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Proteins and Proteomics 
Companion Web Site 


A COMPANION WEB SITE [www.proteinsandproteomics.org) to Proteins and Proteomics: A 
Laboratory Manual provides supplemental information about this fast-moving field of 
research. The site will include; 


+ References linked to Medline. 

® Links to other databases of value to working scientists. 

* Selected figures from the book for use in troubleshooting. 

* Assistance in troubleshooting HPLC and 2D electrophoresis problems. 


* A chapter on the analysis of carbohydrate from proteins (Oxley et al.; see abstract below). 


Analysis of Carbohydrate from Glycoproteins 


David Oxley,* Graeme Currie,t and Antony Bacict 


"The Babraham Institute, Babraham, Cambridge CB2 4AT, United Kingdom; + Plant Cell Biology Research 
Centre, School of Botany; University of Melbourne, Victoria 3110, Australia 


ABSTRACT 


The identification and analysis of glycans associated with proteins is a formidable chal- 
lenge due to the complexity and diversity of these carbohydrate structures. This chap- 
ter attempts to guide the "nonexpert" through the strategies used for the detection and 
analysis of carbohydrates on proteins. The "best" approach is very much dictated by the 
availability of material and the extent of information required, Many of the techniques 
can be used in most biochemistry/molecular biology laboratories, whereas others are 
very much in the bailiwick of the specialist laboratories that have access to highly 
sophisticated instrumentation. This chapter will answer at the very least the question 
most asked: Is my protein glycosylated? For many investigators, this level of informa- 
tion may suffice, For those inspired to ask the next logical question—What is the struc- 
ture of this carbohydrate?—-this chapter will provide sufficient background to proceed 
with confidence. 

In addition to a brief history of carbohydrate research and a presentation of carbo- 
hydrate nomenclature, the chapter explains the fundamental steps involved in the analy- 
sis of glycoprotein-derived glycans. Each section within the chapter is designed to pra- 
vide a concise understanding of the methods available to obtain the goal of the section, 
some of the limitations of each method, and references to some excellent texts on prac- 
tical matters. Four widely used protocols are included, which detail the removal of gly- 
cans from glycoproteins and the preparation of monosaccharides for analysis by gas 
chromatography coupled with mass spectrometry. 
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* A protocol on the use of a multicompartment electrotyzer for the isoelectric fractionation 
of samples prior to 2D gel electrophoresis (Herbert et al.; see abstract below). 


Sample Preparation for High-resolution Two-dimensional Electrophoresis 
by Isoclectric Fractionation in a Multicompartment Electrolyzer 


Ben R. Herbert," Pier Giorgio Righetti,! John McCarthy Jasmine Grinyer,* 
Annalisa Castagna,* Matthew Laver," Matthew Durack,* Gerard Rummery,* 
Rebecca Harcourt, "and Keith L. Williams* 


“Proteome Systems, North Ryde, Sydney, NSW, 1670, Australia; Umversity of Verona, De partment of 
Agricultura] and Industrial Brtechnologu s, Strada Le Grazie No, 15, 37134 Verona, Haly 


ABSTRACT 


Two common problems when using a broad pH gradient with two-dimensional gel 
electrophoresis for the separation of proteins are low resolution of hydrophobic, high- 
ly acidic or basic proteins and poor detection of low-abundance proteins. Increasing 
resolution and enhancing detection on 2D gels is possible with the use of narrow and 
ultra-narrow range (1-3 pH unils and <1 pH unit, respectively}, immobilized pH gra- 
dients (IPGs), However, when narrow pH gradients are loaded with an entire cell 
lysate, a large proportion of the protein sample is not isoelectric within the separation 
range of the pH gradient. These “extraneous” proteins severely disturb the separation, 
because they have pls outside the pH range of the IPG, This phenomenon is aggra- 
vated at high protein loads. Although it can be almost eliminated by loading a small 
amount of protein, this unfortunately is of very limited use for proteomics. 

One solution to these problems is to employ a multicompartment electrolyzer 
( MCE), an instrument that fractionates protein samples isoelectrically prior to the 
creation of 2D maps. The resulting protein fractions match the pH intervals to be 
adopted as the first dimensions of the subsequent 2D maps. The fractionated protein 
mixture, devoid of proteins with isoelectric points outside of the range of the [PGs, 
can be loaded in a 21) map at much higher levels, thus ensuring greater sensitivity and 
detection of low-abundance proteins. Isoelectric fractionation using the MCE is fully 
compatible with subsequent 2D protocols, because it is based on a focusing technique 
that yields highly concentrated samples devoid of salts and buffers. The current MCE 
instrument uses commercially available, amphoteric, buffered membranes that are 
matched to the pH endpoints of commonly used IPGs. 


Additional information will be added after the book is published. To access the Web Site: 
1. Open the home page of the site, 


2. Follow the simple registration procedure that begins on that page (no unique access code 
is required, since the site is open to anyone who completes the registration process). 


3. Your e-mail address and password (selected during the registration process) become your 
log-in information for subsequent visits to the site. 


The FAQ section of the site contains answers about the registration procedure. For addi- 
tional assistance with registration, to inform us of other Web address changes, and for all 
other inquiries about the proteinsandproteomics.org Web Site, please e-mail support@pro- 
teinsand proteomics.org or call 1-800-843-4388 (in the continental U.S. and Canada) or 516- 
422-4100 (all other locations) between 8:00 A.M. and 5:00 P.M. Eastern U.S. time. 


Preface 


Now THAU THE FIRST DRAFT OF THE HUMAN GENOME SEQUENCE is in the public domain, the 
primary focus of biologists is rapidly shifting toward gaining an understanding of how genes 
function, i.e., the functional roles of the full complement of encoded proteins. As well as 
defining structural characteristics of proteins, this task requires an understanding of the tem- 
poral and spatial location of proteins within the cell, including the intricate nature of how 
proteins interact with one another. Analytical protein chemistry, or proteomics as it is now 
commonly known, has a vital role in this daunting task. As information began to flow from 
the various genome projects, it became apparent to Cold Spring Harbor Laboratory Press 
that there was a growing need to provide researchers with a source of reliable proteomics pro- 
tocols. Not long after, at the urging of my colleague Joe Sambrook, author of the enormous- 
ly successful manual Molecular Cloning, Y was invited to tackle the challenge of writing a lab- 
oratory manual of analytical methods and protocols for proteomics studies. Thus, Proteins 
and Proteomics: A Laboratory Manual was conceived. 

Proteins and Proteomics is aimed at those who wish to isolate proteins and peptides for 
subsequent proteomic analysis. It is written for an audience ranging from early graduate stu- 
dents to experienced investigators. Proteins and Proteomics is not an encyclopedic book cov- 
ering all possible proteomics methods. Rather, the book covers only those proteomics meth- 
ods and technologies that are in current use in my laboratory or those of trusted colleagues. 
In each chapter, | have endeavored to provide sufficient background knowledge to underpin 
the accompanying protocols. In areas outside the immediate ken of the protein chemist, such 
as glycobiology and informatics, I have sought the contributions of experts to cover these 
specific fields of experimentation. Accordingly, I thank Antony Bacic and Parag Mallick and 
their colleagues for valued contributions in glycobiology and informatics, respectively. 

A work such as this does not see the light of day without much outside help and support. 
My first thanks go to my friends and colleagues in the Parkville precinct in Melbourne, who 
have responded generously to my sometimes intemperate requests to provide illustration 
material or technical review. I am greatly indebted to the editorial and production staff at 
Cold Spring Harbor Laboratory Press for their dedication and tireless efforts in checking ref- 
erences, facts, faulty constructions. and stvlistic abominations and keeping me on schedule 
(almost). I owe a special debt of gratitude to Judy Cuddihy for her cheerful optimism that 
raised my spirits, Kaaren Janssen and Maryliz Dickerson for their guidance in the beginning 
of the project, Tamara Howard for diligent fact checking, Inez Sialiano for coordinating the 
project, Dorathy Brown for editorial assistance, Susan Schaefer for page layout, Denise Weiss 
for her elegant design of the book, and most of all, Michael Zierler for his unstinting support 
as Senior Developmental Editor in steering the book to completion. I also acknowledge the 
generous support of Jan Argentine, my Managing Editor, and John Inglis, the Director of 
Cold Spring Harbor Laboratory Press, for overseeing the project. 
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Finally, 1 owe a special debt of gratitude to Mary Whitham and Pamela Jones for their 
secretarial assistance. I must also thank Simone Pakin especially for her superb information 
gathering skills and for her dedication and shockproof resilience that helped us both survive 
the process. My thanks also go to the members of the Joint ProteomicS Laboratory (JPSL), 
especially Robert Moritz, Hong Ji, David Frecklington, Lisa Connolly, James Eddes, Eugene 
Kapp, and Gavin Reid, for the support that they have given me in more ways that I can list 
here, I also acknowledge the rich intellectual and collegial environment at the Ludwig 
Institute for Cancer Research and especially the ungrudging support given to me by its direc- 
tor, Tony Burgess. Without his sustaining enthusiasm, this book would not have been possi- 
ble, 

To this long list of creditors ] must finally add my partner, Donna Dorow, who has had 
to suffer with increasing patience the growing and seemingly never-ending demands that the 
“book” came to make on our time (the never-ending weekend and late night writing 
episodes), my attention, and temper. 


Richard J. Simpson 
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Foreword 
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Suo AND EXPERTENCFD RESEARCHERS FACF SIMILAR challenges when attempting to delve 
into a new field of research, They need to learn new terminology, concepts, and theories, 
define current research topics in the field, and master a new set of methods and techniques. 
Finding suitable resources may be as difficult as mastering the subject matter itself, especial- 
ly when the topic is an emerging and rapidly evolving field, such as proteomics. Often, text- 
books, glossaries, and reference manuals aie sparse—if they exist at all—and the available 
information must be gleaned from numerous articles in the primary literature. 

In this comprehensive book, Richard Simpson and a group of leading proteomics experts 
attempt the impossible: to condense theory, background information, protocols, and infor- 
mation resources into a single volume. They succeed. This manual contains virtually every- 
thing one would hope to find in both a textbook and laboratory manual: overviews, intro- 
ductory materials, and a theoretical base for proteomics. Detailed protocols for common pro- 
teomics experiments, complete source lists for the tested materials, and web links to crucial 
reagent resources are also thoughtfully provided. 

Proteins and Proteomics: A Laboratory Manual is an invaluable information tool bath for 
the experienced protein chemist who bravely ventures into the new world of proteomics and 
for the novice to proteins and protcomes., By focusing on what is currently considered the 
bedrock of proteomics technologies, Professor Simpson ensures that-—in spite of the rapid 
advances that characterize contemporary proteomics research—this volume will remain rel- 
evant and current for years to come. 


Ruedi Aebersold 
Prafessor, Institute for Systems Biology 
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Biological macromolecules are the main actors in the makeup of life... . To understand biology and med- 
icine at a molecular level..we need to visualize the activity and interplay of large macromolecules such 
as proteins. To study protein molecules, principles for their separation and determination of their indi- 
vidual characteristics had to be developed. One of the most important chemical techniques used today 
for the analysis of biomolecules is mass spectrometry (MS), one of the subjects of the 2002 Nobel prize 
award, 

The 2002 Nobel prize in Chemistry was awarded “for the development of methods for identification 
and structure analyses of biological macromolecules” with one half going jointly to John B. Fenn 
(Virginia Commonwealth University, Richmond, USA} and Koichi Tanaka (Shimadzu Corporation, 
Kyoto, Japan) “for their development of soft desorption ionization methods for mass spectrometric analy- 
ses of biological macromolecules.” 

This is a revolutionary breakthrough. Chemists and biologists can now rapidly and reliably identify 
what proteins a sample contains. Hence, scientists can both “see” the proteins and understand how they 
function within cells, 
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Now THAT MORE THAN 40 GENOMES, INCLUDING THF HUMAN GPNOME, have been fully sequenced 
and are in the public domain, the next challenge for biologists will be to connect gene ta func- 
tion, genotype to phenctype, to find out what the genes really do! The rapid pace ot genome 
sequencing efforts during the past several years has resulted in many newly discovered genes 
that have been ascribed no function or a function that at best has been poorly described. For 
an up-to-date monitor of complete and ongoing sequencing projects, see the GOLD Web Site 
(Genomes OnLine Databases at http://wit.integratedgenomics.com/GOLD), This impetus to 
understand the function of newly discovered genes is leading biologists toward the systematic 
analysis of the expression levels of the components that constitute a biological system, chiefly. 
mRNA ttranscriptomics), proteins (proteomics), and metabolites (metabolomics) (see Figure 
1.1). Because proteins are central to biological function and obvious candidates for drug tar- 
geting, proteomics is enjoying a rapidly increasing level of attention. 
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FIGURE 1.1. Functional genomics or phenomics. Genomics provides an overall description of the complete set of genetic instruc- 
tions (the genes) contained within the genome that are available to a cell, ie., the "blueprint" of a cell. Functional genomics, on the 
other hand, represents a systematic approach to elucidating the function of the novel genes revealed by coniplete gene sequences. 
(Phenomics has been suggested as an all-embracing term to describe functional genomics.) Functional genomie» adopts a hierarchi- 
cal strategy aimed at gaining a comprehensive and integrative view of the workings of living cells. There are a number of different 
approaches for studying the functional analysis of novel genes. These can be grouped into four domains: genome i the complete set of 
genes for an organism and its organelles), transcriptome (the complete set of mRNA molecules}, proteome (the complete set of pro- 
teins), and metaboiome (the complete set of metabolites, the low-molecular-weight intermediates}. Researchers have now added the 
suffix “ies” to describe the utility for analyzing these domains. For example, the task of comparing the mRNA profiles using DNA 
arrays is now referred to as cellular (or tissue) transcriptomics, and the task of separating the cells proteins and comparing their 
expression profiles is referred to as expression proteomics. Studying all of the proteins encoded bv a genome ithe proteome) without 
focusing on a particular cell type, growth conditions, and subcellular localization is the domam of global proteomics Locus on pro 
tein expression within a particular cell type and/or subcellular organelle is the domain of targeted proteomics Its now clear that these 
domams are not an end in themselves, but a vehicle to understanding an organism's entire metabolism, now referred. to as 
metabolomics {Raamsdonk et al. 2001; Oliver 2002). Thus far, the fully sequenced genome studies have yielded many insights into the 
functional properties of proteins, especially the emergence of networks of interacting proteins (the term “interactame” has been 
coined to describe protein-protein networks). Understanding interactions between encoded proteins of a given genome is a critical 
first step in functional genomic analysis (Xenarios and Eisenberg 2001; Gerstein et al. 20021. To understand biology at the system level, 
and to develop models that explain the dynamics of cellular and organismal function (rather than the characteristics of isolated parts), 
all of these domains must be integrated (the “legome,” total systems biology can be likened to assembling all of the component parts 
of a "Lego" set) in a quantitative and temporal manner (for a review on systems biology, sce Brenner 1999b, Kitano 2002). 
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FIGURE 1.2. 2D gel electrophoresis of proteins from a whole-cell extract stained with Coomassie Brilliant Blue, (a) Wild-type 
C57/Black/6] murine colonic crypts; (b) polyps from multiple intestinal neoplasia (MIN) mice (Cole et al. 2000). The synthetic gel 
images were generated using PDQuest software. Differentially expressed proteins are marked in light gray. The insets show that car- 
bonic anhydrase (CAII) and GST are both highly expressed in wild-type crypts and MIN polypts, whereas expression of several 
CAII isoforms (outlined box) is dramatically reduced in MIN polypts. (Reproduced, with permission, from Simpson and Dorow 
2001 [© Elsevier Science].) 


DEFINING PROTEOMICS 


Proteomics or, more appropriately, functional proteomics refers to the branch of discovery 
science focusing on proteins. Initially, the term was used to describe the study of the 
expressed proteins of a genome using two-dimensional (2D) gel electrophoresis, and mass 
spectrometry (MS) to separate and identify proteins and sophisticated informatics approach- 
es for deconvoluting and interrogating data. This approach is now referred to as expression" 
or “globat profiling" proteomics (Figures 1.2 and 1.3). The scope of proteomics has now 
broadened to embrace the study of "protein-protein" interactions (protein complexes), 
referred to as cell-mapping proteomics (Blackstock and Weir 1999) (see panel below on THE 
MANY FACES OF PROTEOMICS). 


The term proteome, coined in 1994 as a linguistic equivalent to the concept of genome, is 
used to describe the complete set of proteins that is expressed, and modified following 
expression, by the entire genome in the lifetime of a cell. It is also used in a less universal 
sense to describe the complement of proteins expressed by a cell at any one time (from 
Nature 1999). Today, proteomics is a scientific discipline that promises to bridge the gap 
between our understanding of genome sequence and cellular behavior; it can be viewed as 
more of a biological assay or tool for determining gene function. 
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FIGURE 1.3. Differential in-gel electrophoresis (DIGE) for the identification of cancer markers. ( Top, left) 1n DIGE, an emerging 
technology for proteome analysis (Unlu et al. 1997), two pools of proteins (e.g., normal cells and cancer cells procured from the 
same tumor sample using laser-capture microdissection) (Emmert-Bock et al. 1996) are labeled with 1-(5-carboxypentyD-1- 
propylindocarbocyanine halide (Cy3), N-hydroxy-succinimidyl ester and L-(3-carboxypentyl)-1 -methylindodi-carbocyanine 
halide (Cy5] N-hydroxysuccinimidyl ester fluorescent dyes, respectively. The Cy3- and Cy5-labeled proteins are mixed and then 
separated in the same 2D gel. The 2D gel protein profiles can be rapidly imaged by the fluorescent excitation of either the Cy3 or 
Cy5 dye. (Cy3-labelec gel images are collected at an excitation wavelength of 540 nm and at an emission wavelength of 590 nm, 
whereas the Cy5-labeled gel images are collected at an excitation wavelength of 620 nm and an emission wavelength of 680 nm.) 
A comparison of the resulting images allows quantitation of each protein spot. (Right, a) Cy3 image of proteins from normal cells; 
(b) Cy5 image of proteins from tumor cells. Because both protein pools are electrophoretically separated in the same gel, those 
proteins existing in both pools will migrate to the same location in the 2D gel, thereby minimizing the inherent reproducibility 
problem associated with 2D gels. Quantitation of the protein profile can be rapidly and accurately achieved over a wide dynamic 
range (e.g. four orders of magnitude) based on fluorescent intensity ( Patton 2000). The separated proteins in the 2D gel are next 
visualized by SYPRO Ruby staining (right, panel c). (The SYPRO Ruby-stained image is scanned at an excitation wavelength of 400 
nm and an emission wavelength of 630 nm.) Protein spots of interest are excised and in-gel-digested with trypsin, and the pep- 
tides extracted for the purpose of identification by mass spectrometry (MS) methods described in Chapters 7 and 8. (1D) 
Identification. ( Adapted, with permission from Zhou et al. 2002.) 
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THe MAN* FACES Of PROTEOMICS 


* Proteomic analysis (or analytical protein chemistry). The large-scale identification 
and characterization of proteins, including their posttranslational modifications, such 
as phosphorylation and glycosylation. Analysis is done with the aid of mass spectrome- 
try or Edman degradation. For analysis of protein phosphorylation, see Chapter 9; and 
for amino-terminal sequence analysis using the Edman degradation procedure, see 
Chapter 6. 


* 


Expression proteomics (or differential display proteomics). Two-dimensional gels are 
used for global profiling of expressed proteins in cell lysates and tissues. This conven- 
tional approach is being challenged by non-2D gel methods, such as liquid-based iso- 
electric focusing (IEF) or ion-exchange chromatographv/reversed- phase high-perfor- 
mance liquid chromatography (RP-HPLC). Proteins are typically identified by mass 
spectrometry (MS). In many situations, these methods are complemented by DNA- 
based array methods. Includes quantitative proteomics (for a review of proteomics strate- 
gies for the quantitative analysis of paired protein samples ‘e.g, normal vs. diseased] uti- 
lizing stable isotope labeling combined with chromatographic separations, see Chapter 
8 and Patterson 2000a,b}. 


Cell-mapping proteomics (or cataloging of protein-protein interactions). Protein-pro- 
tein interactions and intracellular signaling circuitry are determined by the identifica- 
tion of protein complexes (obtained by affinity purification and protein identifications 
by MS) or by direct DNA readout (e.g., yeast two-hybrid, phage display, ribosome dis- 
play, and RNA-peptide fusions). For reviews on protein networks, see Legrain (2002) 
and Mayer and Hieter (2000); on cell-mapping proteomics, see Blackstock and Weir 
(1999), Lakey and Raggett (1998), and Duan et ai, (2002); and on mapping protein-pro- 
tein interactions with combinatorial biology methods that rely on direct DNA readout, 
see Pelletier and Sidhu (2001). 


WHY PROTEOMICS IN ADDITION TO GENOMICS? 


Large-scale Genome Sequencing: What Have We Learned? 


One of the most exciting biological achievements to emerge during the past 40 years has been 
the completion. of draft DNA sequences of the human genome. published bv the 
International {luman Genome Sequencing Consortium (a publicly funded project} | Lander 
et al. 2001) and by Celera Genomics fa commercial effort) (Venter et al. 2001). These 
Herculean efforts provide a blueprint of the information needed to create a human being and 
reveal, for the first time, the organization of a vertebrate’s DNA (for an overview of this pro- 
ject, see Baltimore 2001). One of the interesting findings about the human genome is the 
number of genes found. The public project estimates that there are 31,000 protein-encoding 
genes, whereas Celera finds --26,000, with many more still to be found. (A current estimate is 
that the number of protein-encoding genes may be on the order of 60,000. : 

Interestingly, the number of coding genes in the human sequence is not dramatically dif- 
ferent from the numbers reported for phylogenetically remote organisms: 6,000 for a yeast 
cell, 13,000 for a fly, 18,000 for a worm, and 26,000 for a plant (Genomes OnLine Databases 
at http/witintegratedgenomics.com/GOLD). The number of genes reported for multicellu- 
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lar organisms is not highly accurate because of the limitations of existing ab initio gene pre- 
diction methods used to identify genes (Dunham et al. 1999). The existence of an open read- 
ing frame (ORF) in genomic data does not necessarily imply the existence of a functional 
gene. In human DNA, gene prediction by ab initio methods is notoriously difficult because 
of the extensive alternative splicing (Black 2000), lower density of exons, and high proportion 
of interspersed repetitive sequences. Given the unreliability of ab initio gene prediction soft- 
ware, all genes will need to be experimentally identified and annotated. For example, the 
error rate in the annotations for 340 genes from the Mycoplasma geritalium genome was ~8% 
(Brenner 19992). Hence, verification of a gene product by proteomic analysis is an important 
first step in annotating the genome. 


Disparity between mRNA Profiling and Protein Profiling 


No simple correlation exists between changes in mRNA expression levels (transcriptomics) 
and those in protein levels (proteomics). Indeed, the link between transcript levels and pro- 
tein levels in a given cell or tissue is tenuous, to say the least, and it is clear that array-based 
gene expression monitoring or other gene expression methods for measuring mRNA abun- 
dances, alone, are insufficient for analyzing the cell's protein complement (for a review of 
global gene expression methodologies, see Lockhart and Winzeler 2000). Recent studies show 
a marked disparity between the relative expression levels of mRNAs and those of their corre- 
sponding proteins (Anderson and Seilhamer 1997; Gygi et al. 19993). A further complication 
arises when considering the complementarity of genomics and proteomics. Despite the adage 
that one gene gives rise to one protein, the situation in eukaryotic cells is more likely six to 
eight proteins per gene (Strohman 1994). Thus, there may be several hundred thousand 
human proteins after splice variants and essential posttranslational modifications are includ- 
ed. For example, 22 different forms of human Q--1-antitrypsin have been observed in human 
plasma (Hoogiand et al. 1999). Fortunately, such biological complexity can be unraveled 
using proteomic studies to understand how cells modulate and integrate signals. 


Origins of Cellular Complexity 


From the genome sequencing efforts to date, it is clear that the physiological complexity of 
organisms is not merely a consequence of gene numbers. For instance, humans (although 
composed of ~10,000,000,000,000 cells) have fewer than twice as many genes as the 959-cell 
nematode, Caenorhabditis elegans. Rather, evolution of the increased complexity of higher- 
order organisms is due to a number of other mechanisms, such as alternative splicing 
(Mironov et al. 1999; Black 2000), diversification of gene regulatory networks, and the abili- 
ty of intracellular signaling pathways to interact with one another (Weng et al. 199% 
Davidson et al. 2002). Biological signaling pathways can interact to form complex networks 
comprising a large number of components. Such complexity arises from the overlapping 
functions of components, from the connections among components, and from the spatial 
relationship between components in the cell. Additionally, many cellular processes are per- 
formed and regulated not by individual proteins but by proteins acting in large protein 
assemblies or macromolecular complexes. For instance, the eukaryotic ribosome, which 
translates RNA into protein, consists of ~80 unique proteins (Wool et al. 1995), and the RNA 
polymerase II transcription complexes in eukaryotic cells, which is involved in DNA replica- 
tion, comprises at least 50 different proteins (Pugh 1996). For a further discussion of protein- 
protein interactions, see Chapter 10. 
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The traditional view of protein function tends to focus on the biochemical activity of a 
single protein molecule such as the catalysis of a given reaction or the binding of a ligand to 
its cognate receptor. This local function is often referred to as the "molecular function" ot a 
protein. However, an expanded view of protein function is beginning to emerge in the post- 
genome cra, with a protein being defined a» an element in its network of interactions. [his 
notion of expanded function has been variously referred to as "contextual function’ or “cel- 
lular function" (sec Kim 2000). Lhe -ontemporary view of function is that each prote nin liv- 
ing matter operates as an integral component of an intricate web of interacting molecuies. 
For excellent reviews on this subject. see Weng et al. (19991 and Eisenberg et al. (20001. 


INTEGRATED Biotocv (TOTAL SYSTEMS BIOLOGY) 


To achieve a full understanding of how a complex organism works, biologists must devel- 
op an integrated (or global} view of a cell’s mRNA and protein complements and a detailed 
knowledge of how these complements change with development and the environment 
(especially in disease). Mathematically, expression profiles from both mRNA and protein 
are required to fully understand how a gene network operates {see Hatzimanikatis and Lee 
1999), For example, an integrated genomic and proteomic analysis of a systematically per- 
turbed glucose/galactose-utilizable pathway in yeast concluded that an analysis of both 
mRNAs and proteins is crucial for understanding biological systems tsee Ideker ct al. 
2001). 

Protein-protein interactions are a crucial component of this integrated biology. Already, 
a large proportion of known protein-protein interactions in yeast have been identified by 
genome-scale yeast two-hybrid assays (Legrain and Selig 2000; Schwikowski et al. 2000; 
Uetz et al. 2000; Hazbun and Fields 2001; [to et al. 2001a,b; Legrain et al. 2001) and direct 
affinity capture methods (Gavin et al. 2002; Ho et al. 2002). However, the interactions 
detected by these physical methods may include nonspecit interactions ol no biological 
significance. Biologically important protein-protein interactions require that the interact- 
ing partners be in specific protein states (interactions may result in the transition of ene 
protein state to another), but physical methods, like two-hybrid assays and affinity capture, 
do not distinguish between protein states of a given protein molecule. The following is a 
list of attributes that define protein states (also see Figure 1.4). 


* Covalent modification (e.g, phosphorylation, glycosylation, hpidation, nitrosvlation, 
acetylation, and ubiquitination). A protein may occur in its “active” or “inactive” form 
depending on the state of covalent modifications, such as phosphorylation (Hunter 
2000a,b). 


© Cellular localization. Depending on the biological status of the cell, a protein molecule 
may reside in one or several cellular locations, such as the nucleus, cytosol, plasma mem- 
brane, mitochondria, and endoplasmic reticulum. 


* Presence of ligands, The binding of small molecules and ions (e.g., heme, metal ion, glu- 
case, ATP, ADP, GTP, and GDP) to proteins alters protein states, affecting properties such 
as rates of enzyme catalysis and allostery. 


« Alternate splicing. Different forms of a protein molecule mav result from alternate sphc- 
ing of the gene product. 


+ Proteolytic cleavage. Truncated forms of a protein molecule may result from specific 
amino- or carboxy-terminal cleavage or internal cleavage. ‘These truncations alter the 
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state and activity of the protein. For example, certain proteolytic enzymes are produced 
as inactive precursors (zymogens), which must be cleaved to generate an active enzyme. 
By restricting the synthesis, location, or activity of the necessary proteases, cells have a 
means of regulating proteolytic activity with respect to time and cellular localization 
(Khan and James 1998; Kobe and Kemp 1999). Examples include proteases involved in 
blood clotting, catabolic digestion (e.g., pepsin, rennin, trypsin, chymotrypsin, and car- 
boxypeptidase are secreted as inactive precursors), apoptosis (e.g., the key effector mol- 
ecules, the caspases, ate present as inactive zymogens ‘Shi 2002]), cleavage of viral pre- 
cursor proteins to functional units, and pattern formation in multicellular organisms 
(Khan and James 1998). Proteolytic activity is further controlled by specialized proteins 
that specifically inhibit the active proteases (Bode and Huber 1992; Khan and James 
1998). 


Oligomeric state, A protein molecule may exist in a multiprotein complex or as a 
homodimer or homo-oligomer. From yeast interactome studies, it is estimated that at 
least 7896 of yeast proteins occur in complexes (Gavin et al. 2002). 


+ Protein conformation. Three-dimensional structure information on different protein 
states is important for understanding their biological behavior. For example, protein 
function is often regulated by allosteric mechanisms (Monod et al. 1963), in which effec- 
tor molecules bind to regulatory sites distinct from the active site, usually inducing con- 
formational changes that alter the activity. Allosteric effectors usually bear no structural 
resemblance to the substrates of their target protein, the classic example being end prod- 
ucts of metabolic pathways acting at early steps of the pathway to exert feedback control. 
Intrasteric regulation (Kemp and Pearson 199]), on the other hand, includes autoregu- 
lation of protein kinases and phophatases by internal amino acid sequences that resem- 
ble the substrate (such internal amino acids are often referred to as pseudosubstrates). 
This type of regulation is considered the counterpart of allosteric control (see Figure 
1.5). Examples of protein kinases whose regulation is mediated by intrasteric autoregu- 
latory sequences include Twitchen kinase, Titin kinase, CAMK- 1, insulin receptor kinase, 
and MAP kinase ERK2 (for reviews, see Kobe and Kemp 1999; Huse and Kuriyan 2002). 


For a descriptive database of biological protein interactions organized in terms of pro- 
tein states and state transitions, see LiveDIP (hitp:www.dip.doe-mbi.ucla.edu/) (Duan et 
al. 2002). Additional information on DIP (the Database of Interacting Proteins) can be 
found in Chapter 11 and in Table 1.1, which provides a list of Web-accessible databases 
containing information on protein-protein interactions. 

One of the difficulties (and challenges) in studying protein-protein interactions is that 
at any given time, the pool of molecules of a protein inside a cel! most likely represents one 
or several of the protein states of that particular protein, depending on the cellular context. 
This is a major impetus of proteomics, especially cell-mapping proteomics, which aims to 
describe all protein-protein interactions (both spatially and temporally) within a given 
cell. The challenge of proteomics is to utilize existing technologies (and to develop new 
technologies) to define all of the protein states for a given protein molecule. Such infor- 
mation is of crucial importance in post-genome biology, especially total systems biology, 
since it will shed light on the molecular mechanisms underlying biological processes. For 
reviews, see Xenarios and Eisenberg (2001) and Gerstein et al. (2002) for protein-protein 
interactions, and for systems biology, see Brenner (1999b) and Kitano (2002). 
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FIGURE 1.4. Physical versus biological protein-protein interactions. Physical interactions are shown as binary relationships 
between pairs of proteins. Each protein exists as a collection of states (forms) in the protein state space. For a summary of protein 
states and their attributes, see the panel on INTEGRATED BIOLOGY. For example, protein X exists in any of the protein states X1, X2, 
etc., which may represent different posttranslationally modified forms of protein X, conformational state, or different cellular 
localizations of this protein; protein Y can exist in multiple protein states as well. A given protein state, ¢.g.. X1, interacts only with 
a given protein state of its interacting partner. A database, which describes protein interactions by protein states and state transi- 
tions, designated LiveDIP, has been described recently (http:www.dip.doe-mbiucla.edu) (Duan et al. 2002). (Adapted, with per- 
mission, from Duan et al. 2002.) 
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FIGURE 1.5. Representation of intrasteric regulation. (Top panel) Intrasteric regulation of enzymes by intramolecular interactions 
through an intrasteric autoregulatory sequence (LARS) that is contiguous with the catalytic domain containing the active site. The 
enzyme is maintained in an inactive state through the binding of the IARS in an intramolecular fashion that masks the active site. 
Allosteric activation by an activatory ligand or protein results in the release of the IARS from the active site. (Middle panel) 
Intrasteric regulation of homodimeric enymes by intermolecular interactions, through an [ARS in trans. The JARS interacting with 
one subunit is a part of another subunit, (Bottom panel) Intrasteric interaction of heteromeric enzymes by intermolecular interac- 
tions, through an intrasteric regulatory sequence (IRS) on a distinct subunit. For a review of active-site-directed protein regulation, 
see Kobe and Kemp (1999). (Reproduced, with permission, from Kobe and Kemp 1999.) 
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FIGURE 1.6. Proteomics: Bridging the gap between gene and product. Four key technology platforms are essential components 
for most proteomic strategies: sample preparation, obtaining protein information (albeit peptide mass/sequence information), 
protein information (use of informatics), and targeted proteomics (study of protein-protein interactions, posttranslational modi- 
fications (PTMs), cellular localization, etc.). For a description of the Edman degradation procedures. see Chapter 6. An outline of 
mass spectrometry (MS) methods for obtaining protein information is given in Chapter 8. 


IDENTIFICATION AND ANALYSIS OF PROTEINS 


Four key platform technologies are crucial to any proteomics strategy aimed at elucidating 
the function of an unknown gene (Figure 1.6): 


« Sample preparation and handling (see Chapters 2, 3, 4, 8, and 10). 

Determination of partial amino acid sequence information (see Chapters 6, 7, and 8). 
+ Protein identification and quantification (see Chapters 7, 8, and 11). 

+. Cell mapping (see Chapter 10). 


These platform strategies require different, yet complementary, types of expertise. Figure 
1.7 provides an outline of the various identification and analysis strategies for solving any 
proteomics problem. 


Protein Separation Strategies 


One of the rate-limiting steps in any proteomic analysis study is obtaining, and then han- 
dling, sufficient quantities of a target protein(s) from its original biologica] source. Before 
embarking on such a study, it is very important to consider dynamic range considerations. 
For example, the dynamic range of protein abundance in a biological sample can be as high 
as 10° (i.e., protein abundance may range from 10 copies/cell for transcription factors up to 
1,000,000 copies/cell for the more abundant molecules). The task of purifying trace-abun- 
dant proteins (e.g., growth factors and their receptors, or transcription factors) from natural 
biological sources is often extremely difficult, involving extremely large quantities of starting 
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FIGURE 1.7. Proteomic strategies used for the identification and analysis of proteins. (SEC) Size-exclusion chromatography; (IEX) 
ion-exchange chromatography; (RP-HPLC) reversed-phase high-performance liquid chromatograpy (see Chapter 5); (HIC) 
hydrophobic interaction chromatography; (2DE) two-dimensional gel electrophoresis (see Chapter 4}; (1DE) one-dimensional gei 
electrophoresis (see Chapter 2); (FFE) free-flow electrophoresis; (CZE) capillary zone electrophoresis; (FRET) fluorescence reso- 


nance energy transfer. 


material (Table 1.2). Typically, a 1-2-millionfold purification is required to acquire a homo- 
geneous target protein if identification is by amino-terminal sequence analysis using Edman 
chemistry. Considerably less material is required and purification to homogeneity is not 
essential if identification is by MS-based methods. 

The classical method for quantitative and qualitative expression proteomics combines 
protein separation by high-resolution 2D gel electrophoresis (Klose 1975; O'Farrell 1975; 
Górg et al. 1988; also see Chapter 4) with MS or MS/MS identification of selected protein 
spots (see Chapter 8 and reviews by Yates 1998a,b; Patterson 2000a,b; Griffin et al. 2001). 
Because even the best 2D gels can routinely separate no more than 1500 proteins, this tech- 
nique is limited to the most abundant proteins if a crude protein mixture (e.g., whole-cell 
lysate) is used (see Figure 1.8) (Gygi et al. 2000). Unlike other separation methods, such as 
RP-HPLC (see Chapter 5) and free-flow electrophoresis, which can tolerate large amounts of 
sample, 2D electrophoresis is limited by the amount of material that can be applied to the 
first-dimension immobilized pH gradient gel (~150 ug to low milligram quantities; see 
Chapter 4). Hence, 2D gels have limited “scale up" capability. For this reason, it is often desir- 
able to “trace enrich" for a particular subclass of proteins. By analyzing proteins in a cellular 
compartment or organelle, it is possible to reduce the complexity and differences in abun- 
dance of a subset of proteins within a cell (for a list of various cellular organelles and com- 
partments that have been subjected to detailed proteomic analysis, see Table 3.2). 
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TABLE 1.2. Examples of low-abundance proteins and peptides isolated from natural biological sources 


Protein 


Source Yield (ug) Reference 
Multipotential colony- Pokeweed mitogen-stimulated mouse l Cutler et al. (1985) 
stimulating factor spleen-cell-conditioned medium (10 liters) 
Human A33 antigen Human colon cancer cell lines (107? cells) 2.5 Catimel et al. (1996) 
Platelet-derived growth factor Human serum (200 liters) 180 Heldin et al. (1981) 
(PDGF) 
Granulocyte colony-stimulating Mouse lung-conditioned medium (3 liters) 40 Nicola et al. (1983) 
growth factor (G-CSF) 
Granulocyte-macrophage Mouse lung-conditioned medium (18 liters) 12 Burgess et al. (1986) 
colony-stimulating growth 
factor (GM-CSF) 
Coelenterate morphogen Sea anemone (200 kg) 20 Schaller and Bodenmuller 

(1981) 

Peptide YY (PYY) Porcine intestine (4000 kg) 600 Tatemoto (1982) 
Tumor necrosis factor (TNF) HL66 tissue culture medium (18 liters) 20 Wang and Creasey (1985) 
Murine transferrin receptor N5-1 myeloma cells (10 cells) 20 van Driel et al. (1984) 
Fibrobiast growth factor (FGF) Bovine brain (4 kg) 33 Gospodarowicz et al. (1984) 
Transforming growth factor- Human placenta (8.8 kg) 47 Frolik et al. (1983) 


(TGF-B) 
Human interferon 


Muscarinic acetylcholine 
receptor 
fi,-Adrenergic receptor 


Human leukocyte-conditioned medium (10 liters) 21 Rubinstein et al. (1979) 
Porcine cerebrum (600 g) 6 Haga and Haga (1985) 
Rat liver (400 g) 2 Graziano et al. (1985) 


Adapted, with permission, from Simpson and Nice (1989). 
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FIGURE 7.8. Relationship between protein copy number and cell quantity. Assuming 100% recovery of pro- 
tein from cells. For 105 celis containing 1000 copies of a specific protein molecule per cell, there are 1.6 pg of 
this protein present (this value is calculated using Avogadro's number, which dictates that there are 6.02252 
X 10? molecules in 1 mole or gram-molecular weight of a given substance). For a 25-kD protein, this trans- 
lates to 40 pg of protein present. (Courtesy of Ben Herbert, Proteome Systems Ltd., Sydney, Australia.) 
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FIGURE 1.9. Cell mapping: Affinity capture methods. 


Cell Mapping and Identification of Proteins in Complexes 


One way to observe interacting proteins involved in a given biological process is to specifi- 
cally enrich for these proteins (see Chapter 10). Typically, this requires knowledge of the 
activity of at least one protein in the multiprotein complex. Under nondenaturing condi- 
tions, interacting proteins can be enriched from complex protein mixtures (e.g., cell lysates) 
using methods (see Figure 1.9) such as: 


* Coimmunoprecipitation or “pull-down” techniques using antibodies directed against one 
of the component proteins (Adams et al. 2002). 


Coprecipitation using affinity-tagged recombinant proteins and antibodies directed 
against the "tag" epitope (see Chapter 10, Protocol 1) (Séraphin et al. 2002). 


* Protein-affinity-interaction chromatography (e.g., using recombinant glutathione S- 
transferase (GST)-fusion proteins and glutathione-affinity chromatography) (see Chapter 
10, Protocol 2) (Einarson and Orlinick 2002) 


+ Isolation of intact multiprotein complexes (e.g., nuclear pore complexes, ribosome com- 
plexes, and spliceosomes). 


TABLE 1.3. Representative macromolecular complexes identified by proteomics methods 


Number of protein 


components 
Multiprotein complex identified in complex References 
S. cerevisiae nuclear pore complex -30 Rout et al. (2000); Allen et al. (2001) 
8. cerevisiae 805 ribosome 75 Link et al, (1999) 
Human spliceosome complex 46 Neubauer et al. (1998) 
S. cerevisiae U1 snRNP complex? 20 Neubauer et al. (1997) 
E. coli chaperonin GroEL ~300 Houry et al, (1999) 
S. cerevisiae 265 proteasome 224 Verma et al. (2000) 


*snRNP = small nuclear ribonucleoprotein. 
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Determination 


of Partial Amino Acid Sequence 


Usually, the final step of most proteomic studies, independent of the purification method 
employed, utilizes either SDS-PAGE (see Chapter 2) or 2D acrvlamide gels (see Chapter 4) to 
separate the proteins for identification and characterization. Following electroblot transfer to 
an inert membrane, such as polyvinylidine difluoride (PVDF), intact proteins can be identi- 
fied directly by amino- or carboxy-terminal amino acid sequence analysis or indirectly from 
peptides generated by in-gel or oti-membrane digestion of the protein witli a protease, usually 
trypsin (see Figure 6.12 and associated text]. MS-based methods usually identify a protein, not 
by analyzing it directly, but by analyzing the peptides derived from proteolytic digestion. The 
main advantage of this approach is the ease with which proteolytically generated peptides can 
be recovered from gel slices (and inert membranes), compared with the difficulty of recover- 
ing intact proteins from acrylamide gels. Moreover, a small number of peptides usually yield 
sufficient information to permit protein identification (by peptide mass fingerprinting [PMF] 
and/or MS/MS of individual peptides). (For a detailed description of methods and strategies 
for the mass spectrometric identification of proteins, see Chapter 8.) The steps typically 
involved in the MS-based identification of a protein are illustrated in Figure 1.10. 


EssENTIAL ELEMENTS OF A Mass SPECTROMETER 
Mass spectrometers consist of three essential components: 


e An ionization source, which converts molecules in either solution or solid form into gas- 
phase ions. 


* A mass analyzer, which separates the gas-phase ions according to their individual mass-to- 
charge ratios {m/z}. 


» An ion detector, which measures the m/z of each ion. 


A mass analyzer uses a physical property (e.g. time-of-flight [TOF]) or electric or magnet- 
ic field (quadrupole or ion trap) to separate ions of a particular m/z that are subsequently mea- 
sured by striking the ion detector. An important contribution of MS to proteomics has been 
the development in the mid 1980s of two soft ionization techniques that create ions of proteins 
and peptides. 

+ Matrix-assisted laser desorption ionization (MALDI) creates ions by excitation with a laser 
of a sample that is mixed or dissolved with an excess amount of a matrix component. The 
laser energy strikes the crystalline matrix (which has an absorption wavelength that match- 
es closely with the laser wavelength) and causes rapid excitation of the matrix and subse- 
quent plume of matrix and analyte (protein/peptide) ions into the gas phase. The singly 
charged ions are then guided to the mass analyzer and the detector by electrostatic lenses. 
MALDI is generally suitable for high-throughput analysis of complex mixtures of analytes 
and is used typically in conjunction with TOP analyzers to produce accurate measurement 
of molecular weight in the low ppm level (see Figures 1.10b and 1.1 lab). 

Electrospray ionization (ESI) creates gas-phase ions by applying a potentia! to a flowing liq- 
uid that contains the analyte and solvent molecules. A fine spray of microdroplets is gener- 
ated upon application of a high electrical tension through a needle. Solvent is removed as the 
droplets enter the mass spectrometer by heat or some other form of energy such as energetic 
collisions with an inert gas (in some instruments, a heated capillary is placed following the 
electrospray needle to facilitate solvent evaporation). In contrast to MALDI, ESI yields mul- 
tiply charged ions that require mass spectral deconvolution. The detection limits that can be 
achieved with ESI have improved dramatically by reducing the flow rate to nanoliter/minute 
levels using capillary chromatography (see Chapters 7 and 8). ESI is used typically in con- 
junction with quadrupele or ion-trap mass analyzers to produce information by tandem 
mass spectrometers (MS/MS) which is diagnostic of amino acid sequence (see Figures 1.10€ 
and 1.11c,d). For a more detailed description of MS, see Chapter 8. 
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FIGURE 1.10. A strategy for mass spectrometric identification of proteins and posttranslational modifications. (a) Affinity capture 
using a bait protein to isolate protein-binding partners from a cell lysate. A GST fusion protein containing the SOCS-1 SOCS box 
sequence (Zhang et al. 1999) is used to illustrate the principle of affinity capture. SDS-PAGE analysis of affinity column eluates from 
GST-control (2), and GST-SOCS-1-SOCS-box glutathione (+) are shown. After 1D gel electrophoresis, the gel was stained with 
Coomassie and the protein bands of interest excised and subjected to trypsin digestion (see Chapters 2 and 7). (5) Analysis of an aliquot 
of the tryptic peptide mixture using a MALDI quadrupole/orthogonal acceleration TOF mass spectrometer (see Chapter 8). The resul- 
tant spectrum represents a peptide mass fingerprint (PMF) of a protein. The peptide masses can be entered into an algorithm, which 
matches them against theoretically predicted peptides of proteins in publicly available databases (see Chapter 8). (c) Analysis of anoth- 
er aliquot of the peptide mixture using an electrospray ion-trap mass spectrometer, which is coupled on-line to a capillary RP-HPLC. 
(Top panel) Peptide masses at a given time. A typical experiment entails isolating the most intense ion in the spectrum (i.e., m/z 765.6), 
and performing collision-induced dissociation to generate sequence ions. (Bottom panel) From the resultant MS/MS spectrum, amino 
acid sequence information can be derived via manual interpretation or by using an algorithm that correlates the experimental spec- 
trum with those in a database (see Chapter 8). (d) For phosphopeptide analysis, peptide-containing fractions frorn a capillary RP- 
HPLC separation are subjected to MALDI-TOF analysis operated in “linear mode” (top panel) and also in “reflectron mode" (bottom 
panel). It can be seen that the major peptide ion detected in “linear mode” (m/z 1881.8) is metastable, resulting in a major ion loss of 
98 daltons (-H,PO,) and a lesser ion loss of 80 daltons (-HPO,) that are observable in "reflectron mode.” Ion losses of 98 and 80 dal- 
tons, or multiples thereof, indicate that a parent ion is phosphorylated (see Chapter 9) (Zugaro et al 1998). 


18 = CHAPTER ! 


a Mass spectrometer b MALDET 
-TOF 


Fil 

E 

3 

a 

— 9 

È 

$ 

È 

5 

ton t 

source Mass analyzer (TOF) Detector 


2000 


MQIPVKTLTGKTITLEVEPSDTIENVKAKIQGDKEGIPPDOORLIFAGKQOLEDGRTLSDYNICK... 


. ; THTU EVE IP ISiD|TI IE IN VK 


Yis Ya Yiayiz Yu Yio Yo Ye Yr Ye Ys¥a Ya 


Tandem mass spectromet MS/MS 
pec eter Ya ESI-Quad 
100 
a 
ü 
3 | 
—- —- £ 
— ZB 
— 9 
— $ 
— ^" 
é 
ion Mass Colision Mass Detector 
source analyzer-1 cell analyzer-2 400 800 806 1000 1200 1400 1600 1800 


miz 

FIGURE 1.11. Basic elements of MS and tandem MS. (a) Schematic of a linear MALDI-TOF instrument depicts ions leaving the 
ion source and entering the mass analyzer, the TOF tube (see Chapter 8). fons of different masses travel along the TOF tube at dif- 
ferent speeds. ‘The time they take to hit the detector is correlated with their mass. (b) Mass spectrum of a peptide mixture using a 
MALDI-TOF instrument. The peptide ions comprise a “peptide mass fingerprint” (PME) for a specific protein, This information 
permits the identification of the protein from which the peptides were derived (see Chapter 8). ‘fryptic peptides derived from 
human ubiquitin (sequence below) are shown as an example, (c) Schematic of a tripte-quadrupole tandem mass spectrometer {see 
Chapter 8). lons enter the first quadrupole (Mass analyzer-1) where they are analyzed. One of these ions is selected to enter the 
collision cell whereupon it is fragmented upon collision with helium gas, The fragment ions leave the collision cell and are sepa- 
rated in the next quadrupele (Mass analyzer-2) before being scanned out to the detector. (d) MS/MS spectrum of a human ubiq- 
uitin peptide. The fragment ions in this spectrum correspond to amino acid residues TITLEVEPSDTIENVK, Proteins can be iden- 
tified from such peptide sequence information. 


In contrast to peptides, the molecular mass of intact proteins is usually insufficient to 
aliow database identification. (This problem is compounded by the fact that very little is 
known about the extent of posttranslational modifications of most proteins.) However, the 
task of identifying intact proteins directly by MS will most likely improve with the develop- 
ment of "top-down" sequencing strategies (McLafferty 1999; Sze et al. 2002; see the informa- 
tion panels on FRAGMENTATION MECHANISMS OF PROTONATED PEPTIDES IN THE GAS PHASE and 
*cOp-DOWN" PROTEIN SEQUENCE ANALYSIS USING MS/MS 1n Chapter 8) where intact proteins are 
fragmented within a specialized Fourier transformed-ion cyclotron resonance (FT-ICR) 
mass spectrometer, and the derived fragments ate selected for further fragmentation by col- 
lision with an inert gas. This produces sufficient information for identification (see the infor- 
mation panel on “TOP-DOWN” PROTEIN SEQUENCE ANALYSIS USING Ms/Ms in Chapter B). 
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Mass SPECTROMETRIC IDENTIFICATION METHODS Usep iN PROTEOMICS 


* MALDI and peptide mass fingerprinting (PMF). MALDI-MS is used to determine the 
accurate mass of a group of peptides derived from a protein by digestion with a sequence- 
Bees protease, usually trypsin, thus generating a peptide mass map or peptide mass 
fingerprint. Because trypsin cleaves proteins at the amino acids arginine and lysine, the 
masses of tryptic peptides can be predicted theoretically for any entry in a protein 
sequence database. Experimentally obtained peptide masses are compared with those 
obtained theoretically, and the protein can be identified correctly if there are a sufficient 
number of peptide rnatches for a protein in the database. PMF is thus popular for iden- 
tifying proteins from species for which complete genome sequences have been deter- 
mined. The method is not suited for seatches of EST databases, because the ESTs repre- 
sent only portions of gene-coding sequences, which may be too short to cover a sufficient 
ber of peptides observed in the experimentally obtained PMF. Digests of complex 
gs. protein mixtures are not suited for PMB, because it is unclear which peptides in the com- 
plex peptide mixture originate from the same protein. Rather, PMF is better suited for 
identification of proteins separated by 2D gels, where ancillary information about protein 
molecular weights and isoelectric points can be used to aid in identification. High mass 
accuracy is critical for unambiguous identifications. 


Electrospray ionization and tandem mass spectrometry. Individual peptides from a pep- 
tide mixture are isolated in the first step in the mass spectrometer and fragmented by col- 
lision-induced dissociation (CID, i.e., by collision with an inert gas in a collision cell) dur- 
ing the second step in order to sequence the peptide (hence, the term tandem mass spec- 
trometry). Lhe fragments obtained by this method are derived from the amino or car- 
boxyl terminus of the peptide and are designated b or y ions, respectively (for a discussion 
be omdhe principles of gas-phas¢ fragmentation, see the information panel on FRAGMENTA- 
TION MECHANISMS OF PROTONATED PEPTIDES IN THE GAS PHASE in Chapter 8). CID spectra 
contain redundant pieces of information, such as overlapping b- and y-series ions, multi- 
ple internal ions from the same peptide, and immonium ions. This redundancy makes 
CID spectra a ¥ich source of sequence information that is highly specific to an individual 
peptide. These data also contain information that is diagnostic of the amino acid sequence 
of peptides. ‘Because peptides fragment in the CID process in a predictable manner, 
sequences in the database can be used to predict an expected pattern to that observed in 
the experimentally observed MS/MS spectrum (see Figure 1.11d). Advantages of this 
approach, compared to PME are that a protein in a complex mixture can be identified 
from the CID spectrum of a single peptide, and matching one or more tandem mass spec- 
tra to peptide sequences in the same protein provides a high level of confidence in the 
identification process. Hence, the MS/MS identification approach is amenable to search- 
ing EST databases (Verhagen et al. 2000). For a full description of MS-based approaches 
for indentifying proteins, see Chapter 8. 


DIFFERENTIAL DISPLAY PROTEOMICS (COMPARATIVE PROTEOMICS) 


A fundamental aspect of proteomics research is the determination of protein expression lev- 
els between two different states of a biological system (i.e., relative quantification of protein 
levels}, such as that encountered between a normal and diseased cells or tissues. This ts often 
referred to as differential display or comparative proteomics. Although using image analysis 
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to contrast the position and intensity of stained spots in a 2D electrophoresis gel has been 
used tor comparative proteomics (Górg et al. 2000; Zhou et al. 2002b), this approach lacks 
adequate dynamic range and is more qualitative than quantitative. To add a quantitative 
dimension to proteomics, especially non-2D electrophoresis gel-based proteomic strategies, 
an alternative technique based on stable isotope dilution and MS has emerged. The stable iso- 
tope dilution approach has been used extensively in the pharmaceutical industry to quanti- 
tatively measure small molecules. The method relies on the premise that the relative signal 
intensity in a mass spectrometer of two analytes that are chemically identical but of different 
stable isotope compositions (e.g., "H, PC. UN, O} can be resolved in à mass spectrometer 
and hence yield a true measure of the relative abundance of the two analytes in the sample 
(Browne et al. 1981; De Leenheer and Thienpont 1992). Although MS is not quantitative per 
se due to variances in detector response and ionization yields of different analytes, observed 
peak ratios of isotopic analogs are highly accurate because the isotopic species are chemical- 
ly identical and they are analyzed in the same experiment using the same instrument. For 
comparative proteomic analyses, proteins from a reference sample are compared with a sec- 
ond sample that is isotopically labeled with a heavy stable isotope. Following isotopic label - 
ing, which can be performed either in vivo or in vitro, proteins are proteolytically digested 
and the generated peptides are analyzed by liquid chromatography [LEC] MS/MS to deter- 
mine both the identity of proteins and the relative ratio of proteins expressed (see Figure 
1,12). Because all of the peptides in the sample exist in pairs of identical amino acid sequence 
(i.e., with the same physicobiochemical properties) but different isotope patterns (different 
masses), they are expected to behave identically during isolation, fractionation, and 10niza- 
tion. By determining the ratio of intensities of the lower- and higher-mass components of a 
peptide, the relative abundance of that peptide (and hence protein) can be established. For a 
summary of various in vitro and in vivo isotopic labeling approaches, see Table 1.4 and 
Chapter 8, Protocols 9 and 10. 
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FIGURE 1.12, Scheme for determining protein expression (comparative proteomics). MS can be used as a 
universal detector to determine relative protein expression between two samples of interest. [n vivo and in 
vitro isotopic labeling of proteins is followed by protein purification, proteolytic digestion, and analysis by 
LC-MS/MS to determine both the identity of peptides (and hence, proteins) and relative ratio of proteins 
expressed (by comparison of isotope ratios]. 
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TABLE 1.4. Summary of various in vitro and in vive isotopic labeling approaches 


In vivo labeling (incorporation af stable isotopes into metabolic products) 


Microorganisms: All UN atoms are replaced by "N using N-substituted Oda et al. (1999); Lahm and Langen (20001; 
media. Because N-substituted media are difficult and expensive to make Conrads et al. (2001) 

for mammalian systems, this method has been restricted Lo microorganisms 

that can be grown in this medium. 


Mammalian cell lines; Mammalian cell lines are grown in media lacking a Ong et al. (2002) 
standard amino acid bul supplemented with an isotopically labeled form of 
thal amino acid (e.g., deuterated leucine}. 


In vitro labeling (incorporation of stable isotopes performed postbiosynthesis by chemical derivatization) 


WO labeling: In this method, O is incorporated from H;'5O into carboxyl Kuster and Mann (1999); Mirgorodskaya et 
groups of peptides during proteolysis or deglycosvlation. Although this al. (2000); Yao et al. (2001) 

method is robust, after amide bond cleavage, trypsin continues to exchange 

“OQ mto carboxyl groups of peptides with basic amino carboxy-terminal 

amino acids (the rate of exchange can be structure-specifi ). Accurate isotope 

ratio measurements require caretul deconvolution of mass spectral data. 


Esterification: Carboxyl groups of asparlic and glutamic residues and the Goodlett et al. (2001) 
carboxyl terminus of peptides are differential methvlated with either 'H3- 
or ?H3-methanol. 


Isotopically coded affinity tags (CAT): An isotopically labeled affinity reagent Gygi et al. (E999b,c 5 Zhou et al. (2002a) 
is attached to cysteine residues in all proteins in the population. (The ICAT 

reagent is a sulthydryl-directed alkviating reagent composed of iodoacetic 

acid attached to biotin through a short, oligomeric coupling arm.) Following 

tryptic digestion, the labeled peptides are affinity- purified using the incor- 

porated affinity tag (biotin was used in the first iteration of the method) to 

achieve a pronounced enrichment (and simplification) of the peptide mixture. 

In a second iteration of the method, considerable improvements have been 

made by attaching the cysteine peptides to solid beads and photo-releasing 

the peptides afterward. 


Alternative isotopically coded affinity tags: A number of variations on the Munchbach et al. (2000); Goodlett et al. 
ICAT method have been proposed. Like ICAT, these methods share the (2001); Cagney and Emili (2002); Regnier 
requirement of in vitro chemical modification of the peptides or proteins et al, (2002) 


and a peptide selection step. The method of Regnier differs from ICAT 
in that it does not include an affinity capture step. 


POSTTRANSLATIONAL MODIFICATIONS 


The vast majority of all eukaryotic proteins are posttranslationally modified and more than 
200 posttranslational modifications (PTMs) of amino acids have been reported thus far 
(Krishna and Wold 1997), Practically all PTMs are associated with either an increase or a 
decrease in molecular mass. Hence, MS is an ideal tool in proteomics studies of PTM identi- 
fication and characterization. It is generally acknowledged that knowing the nature of PTMs 
will facilitate our understanding of protein function. For a list of reported PTMs and their 
molecular masses, see Appendix 1. 

In general, the MS methods used for protein identification are also applicable to the 
analysis of PIMs. However, PTM analyses are inherently more difficult than simple pro- 
tein/peptide identification for the following reasons: 
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* Dynamic range is low. PT Ms of proteins are typically at a low stoichiometry; hence, high 
sensitivily of detection is required. For example, if only 5- 1096 of a protein kinase substrate 
is phosphorylated, then a further 90-9596 of the modified protein may be required for 
detection at the levels used for the nonphosphorylaied form of the substrate. 


« Isolation of the modified peptide containing the PTM is required to identify the PTM and 
the position (i.e. the site) of that modification within the peptide sequence. In contrast, pro- 
teins can be identified by the amino acid sequence of a single peptide by their CID spectrum. 


« PTMs are frequently labile. Because the covalent bond between the PTM and amino acid 
side chain in the peptide is typically labile, it is sometimes difficult to maintain the peptide 
in its modified state during sample preparation and subsequent ionization in the mass 
specirometer. 


+ PTMs are frequently transient in nature. This makes their "capture" for the purposes of 
analysis extremely difficult. 


‘The two major PTMs of proteins are phosphorylation and glycosylation. Of these, the 
reversible phosphorylation of proteins ranks among the most important PTM that occurs in 
the cell. For a detailed discussion and protocols currently used to identify and characterize 
phosphorylation, see Chapter 9. 


Phosphorylation of Proteins 


Phosphorylation of proteins is a ubiquitous regulatory mechanism in both eukaryotes and 
prokaryotes. Intracellular phosphorylation ts regulated by protein kinases (dephosphoryla- 
tion is controiled by protein phosphatases), which are activated in response to extracellular 
signals and tngger cells to switch on or off many diverse processes such as metabolic path- 
ways, kinase cascade activation, membrane transport, gene transcription, and motor mecha- 
nisms. The human genome contains 575 protein kinase domains, representing 2% of the total 
genome, and is the third most populous domain in proteins (Lander et al. 2001). For reviews 
on the structural roles of protein phosphorylation and the roles of phosphorylation in signal 
transduction, see Hunter (2000a,b), Pawson and Nash (2000), Adams (2001), and Johnson 
and Lewis (2001). 

Protein phosphorylation can be examined in several ways. Table 1.5 provides a compar- 
ison of mass spectrometric, Edman degradation, and “P-labeling and phosphopeptide-map- 
ping methods. For a detailed discussion and protocols for phosphoprotein and phosphopep- 


TABLE 1.5. Comparison of widely used methods for phosphoprotein and phosphopeptide analysis 


Phosphopeptide mapping Aminio-terminal 
of “P-labeled proteins sequencing using the Edman 
and peptides degradation procedure Mass spectrometry 
Sensitivity most sensitive method less sensitive (low pmole levels) highly sensitive (low Imole levels) 
Radioactivity requirement yes in some methods no 
Localization of mutagenesis studies usually yes (however, tyrosine phos- yes 
phosphorylation sites required to detect sites of phorylation sites can be 
phosphorylation difficult) 
Homogencous protein yes yes no 
required 
Sample throughput very slow (this approach is slow (very labor-intensive} can be high if automated LC- 


labor-intensive} MS/MS methodologies utilized 
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FIGURE 1.13. Summary of the various analytical methods for phosphopeptide/phosphoprotein enrichment, detection, and micro- 
analysis. (TLC) Thin-layer chromatography; (IMAC) immobilized metal affinity chromatography. 


tide analysis, see Chapter 9 and recent reviews covering various aspects of this topic (sec, e.g, 
Resing and Ahn 1997; Yan et al. 1998; Aebersold and Goodlett 2001; Sickmann and Mever 
2001; Mann et al. 2002). 

Because only a portion of the proteins in a proteome are phosphorylated at any given 
time, it is essential to employ an enrichment step for phosphoproteins/phosphopeptides 
prior to their detection and subsequent microcharacterization. Figure 1.13 provides a sum 
mary of the various analytical methods for phosphoprotein and phosphopeptide enrich- 
ment, detection, and microanalysis. 


Enrichment Methods for Phosphoproteins and Phosphopeptides 


Several antibodies are commercially available that recognize low-abundance tyrosine-phos- 
phoryiated proteins and that can be used for immunoprecipitation. However, these antibod- 
les are not very effective for enriching phosphopeptides. Currently, no commercially available 
antibodies can be used for immunoprecipitating proteins that are phosphorylated on serine 
or threonine residues. Phosphopeptides can be enriched using other methods employing var- 
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ious chromatography separation modalities. For example, phosphopeptides can be concen- 
trated and desalted using reversed-phase C,, columns prior to ESI-MS (see Chapter 9, 
Protocol 5). However, because of the hydrophilic nature of many phosphopeptides, signifi- 
cant losses can occur since these peptides are not retained on the chromatographic support 
(Verma et al. 1997), Other chromatographic supports that can be used to enrich phospho- 
peptides include Oligo R3, a perfusion chromatography resin orginally used lo purity 
oligonucleotides (Matsumoto et al. 1997; Neubauer and Mann 1999), and porous graphitic 
carbon (Chin and Papac 1999, see Chapter 9). 

Affinity selection of phosphopeptides using Fe(III)-, AKIIT)-, and Ga(lLD)-Ioaded immo- 
bilized metal affinity chromatography (IMAC) columns provides a rapid means for enrich- 
ing phosphopeptides. IMAC exploits the high affinity of phosphate groups to bind to metal- 
chelated resins, especially Fe(III)- and Ga(III)-loaded resins (Porath 1992; Posewitz and 
Tempst 1999; Stensballe et al. 2000; Xhou et al. 2000). Although IMAC has been successfully 
used in on-line and off-line formats to analyze phosphopeptides using MS (Mann et al. 
2002), a major drawback is that these columns also bind acidic peptides (i.e. those rich in 
aspartic acid and glutamic acid). One way of attaining a much higher specificity for phos- 
phopeptides is to esterify acidic residues (methyl esterification) prior to loading the peptides 
onto the IMAC column (Ficarro et al. 2002). For protocols outlining the use of IMAC for 
phosphopeptide enrichment, see Chapter 9, Protocols 1, 3, and 4. 


Chemical Modifications 


Another way to enrich for phosphopeptides from complex digests is by replacing the phos- 
phate group with an affinity moiety such as biotin (Meyer et al. 1993; Oda et al. 2001) and 
then using affinity capture (e.g, avidin} to isolate the derivatized peptides. This can be 
accomplished by B-climination of phosphoserine and phosphothreonine residues under 
strongly alkaline conditions to yield dehydroalanine or dehydroaminobutyric acid residues, 
respectively. This approach allows a simple means for derivatizing serine or threonine with 
any desired R-group (Adamczyk et al. 2001; Goshe et al. 2001; Oda et al. 2001). Reagents with 
a sulfyhdryl group (e.g., 1,2-ethanedithiol) add readily to these ot ()-unsaturated amides, and 
the free sulfhydryl group in the derivatized peptide is available for reaction with a biotinylat- 
ed affinity tag (Goshe et al. 2001; Oda et al. 2001). An additional attractive feature of this 
approach is that peptides can be isotopically labeled at their initial phosphorylation sites; 
thus, in addition to affinity selection with avidin, the phosphopeptides can be quantitated. A 
schematic depiction of this chemical modification method is shown in Figure 1,14. However, 
the B-elimination strategy does have some serious drawbacks: 


+ O-phosphorylation sites suffer from a lack of specificity because O-glycosidic linkages are 
also affected by B-climination under identical conditions (Greis et al. 1996). In some regu- 
latory proteins, the same serine residue may be either O-phosphorylated or O-glycosylat- 
ed, and the B-elimination strategy will not distinguish between them (Wells et al. 2001 j. 


‘The strategy is inherently compromised by lower reactivity of phosphothreonine and non- 
reactivity of phosphotyrosine. 


e Yields from [-elimination reactions tend to be substoichiometric. 


FIGURE 1.14. Alternative methods for chemical derivatization and solid-state purification of phosphopeptides. The protocol of 
Zhou at al. (2001) (left) modifies phosphopeptides with frec sulfhydryls that are then trapped by covalent attachment to iodoacetic- 
acid-linked glass beads. Acid elutioa regenerates phosphopeptides, which are then analyzed by MS. In the protocol of Oda et al. 
(2001) (right) phosphoproteins are B-eliminated, buotinylated, and purified by avidin affinity chromatography. Biotinylated peptides 
are further purified in a second avidin binding step and analyzed by MS. (Reproduced, with permission, from Ahn and Resing 2001. 
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Another chemical method for analyzing the phosphoproteome, which is applicable to 
phosphotyrosine-containing peptides as well as those containing phosphoserine and phos- 
phothreonine residues, has recently been described by Zhou et al. (2001). This approach 
begins with a proteolytic digest that has been reduced and alkylated to eliminate cross-reac- 
tivity from cysteine residues. Both the amino and carboxyl termini of each peptide are chem- 
ically blocked, and phosphoroamidate adducts at the phosphorylated residues are formed by 
carbodiimide condensation with cystamine (see Figure 1.14). Free sulfhydryl groups pro- 
duced from this step are covalently captured onto glass beads vía coupling to iodoacetic acid. 
Peptides are then eluted from the glass beads using trifluoroacetic acid, and regenerated 
phosphopeptides are analyzed by MS. Although there are multiple reaction steps, this method 
may be amenable to automation and high-throughput. 


Analysis of Phosphorylation Sites 


PROTEIN MICROARRAYS 


Several methods detect phosphorylation, including traditional methods such as thin-layer 
chromatography (TLC) and Edman sequencing, and methods that utilize MS. In contrast to 
the MS-based methods, the traditional methods typically require the use of radioactively 
labeled samples and some prior knowledge of the protein sequence. Proteins can be radiola- 
beled in vivo using [“PJorthophosphate or in vitro using purified protein kinase and 
[y-¥P]ATP. *P-phosphorylated proteins are proteolytically digested (typically using trypsin), 
and the generated tryptic peptides are fractionated using 2D TLC or HPLC. Isolated ?P- 
labeled tryptic peptides are detected by scintillation counting or autoradiography and typi- 
cally subjected to phosphoamino acid analysis and Edman sequencing, monitoring for a loss 
of radioactivity at each cycle of the Edman degradation (see Chapter 6, Protocol 5). Because 
there is usually insufficient peptide to detect the nonlabeled amino acids at each cycle of the 
Edman degradation, phosphorylation sites are deduced by aligning the cycles at which 
radioactivity appears with a list of predicted peptides to ascertain those that contain serine, 
threonine, or tyrosine at the same position. MS-based methods for phosphorylation site 
mapping are summarized in Table 1.6 (for more details, see Chapter 9, Protocols 6-8). 


A daunting challenge after a gename has been fully sequenced is to understand the function- 
al roles of all the encoded proteins. Although mRNA (and DNA) microarrays allow investi- 
gators to track the activity of thousands of genes (and by inference, protein function) on a 
global scale (Bowtell and Sambrook 2002), these studies are limited by the poor correlation 
between mRNA and protein expression levels. They also fail to take into consideration the 
significant roles of PTMs in protein-protein interactions, The success of mRNA and DNA 
microarrays has triggered efforts to create similar arrays of proteins, which could be used to 
study enzyme-substrate, DNA-protein, and protein-protein interactions on a proteomic 
scale, Early efforts to create protein arrays were hindered by the problem of positioning pro- 
teins on a solid surface in a correctly folded, fully active form, as well as the logistics of syn- 
thesizing large numbers of proteins in a high-throughput fashion. The first of these problems 
was successfully overcome with the report by MacBeath and Schreiber (2000) of protein 
arrays containing more than 10,000 proteins on a piece of glass just over half the size of a 
microscope slide. To make their arrays, these authors used a robot that was originally 
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TABLE 1.6. A selection of MS methods for detecting phosphorylation 
MALDI 


Post-source decay: Phosphopeptides can lose phosphate under conditions of post- Annan and Carr (1996) 
source decay (PSD). MALDI-PSD is performed on reflectron-equipped MALDI-TOF 
instruments. The molecular ion of interest is selected and undergoes PSD in the 

first field-free region of the instrument; the reflectron component of the instrument 
then focuses the fragments such that they are detected at their correct mass {loss of 

80 or 98 daltons from the parent molecular ion is diagnostic for phosphorylated 
peptides}, Serine and threonine tend to show a predominant neutral loss of 98 daltons 
(owing to H,PO, loss) as compared with a loss of 80 daltons (awing to loss of HPO,) 
which can be used to differentiate them from tyrosine phosphopeptides, which 
generally show only a loss of 80 daltons. This approach can be valuable if used on 
peptide mixtures that are first enriched for phosphopeptides using IMAC resins. 


Alkaline phosphatase treatment: MALDI analysis of peptides before and after treat- Liao et al. (1994); Zhang 

ment with alkaline phosphatase can be used to detect phosphopeptides (a characteristic et al. (1998); Kussman 

mass shift occurs owing to the loss of phosphate [80 daltons or multiples] after et al, (1999); Zhou et al. 

treatment with phosphatase). Sce protocol(s) in Chapter 9. (2000); Larsen et al. (2001) 
Electrospray ionization 

In-source callision- induced dissociation (CID): Y phosphopeptide ions are frag- Huddelston et al. (19935; 

mented in negative-ion mode, H;PO; (97 daltons), PO; (79 daltons), and PO; Hunter and Games 

(63 daltons) are detected as phosphate-specific diagnostic ions (under low-energy (1994) 


CLD, phosphotyrosine will be observed to generate the 79- and 63-dalton ions, 
but not the 97-dalton ion). 


Precursor jon scanning: Peptides carrying a phosphate group can be readily detected Carr et al. (1996); Wilm 
by precursor ion scanning because of the loss of phosphate (79 daltons) under et al. (1996); Neubauer 
alkaline conditions. A triple-quadrupole mass spectrometer operating in negative and Mann (1999) 


mode is generally used for this application. This approach is highly selective and 
sensitive and applicable for serine, threonine, and tyrosine-phosphorylated residues. 
One shortcoming is that sequencing must be conducted in positive-ion mode 
immediately after detecting the loss of phosphate in negative-ion mode (this requires 
a change in polarity of the instrument and rebuffering of the sampie). 


designed to synthesize DNA arrays. The high-precision contact-printing robot dips a quill- 
like tip into a well containing a single purified protein and then turns to a glass microscope 
slide, where it spots a tiny 1-nl drop (150—200 um in diameter) onto a glass slide. Ultimately, 
the robot is able to place 1600 spots/cm' on the slide. As proof of principle, MacBeath and 
Schreiber demonstrated that members of the protein array were able to specifically bind to 
other well-characterized binding partners, small drug-like molecules, and protein kinases, 
which chemically modify protein targets. For a set of detailed protocols on establishing and 
using protein microarrays, see Chapter 10, Protocols 10-14. An alternative way of arraying 
functionally active proteins, using microfabricated polyacrylamide gel pads to capture pro- 
tein samples and microelectrophoresis to accelerate diffusion, was reported recently by 
Arenkov et al. (2000), 

The second obstacle to protein arrays, the generation of proteome-scale proteins, is cur- 
rently being addressed with significant developments in high-throughput large-scale recom- 
binant protein expression and isolation (see, e.g., Cahill 2000; Braun et al. 2002). 
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TABLE 1.7, Protein arrays; Classes of capture molecules 


Capture molecules 


Source 


mAb 

scFv/Fab diabodies 
Affinity binding agents 
Affibodies 


Aptamers (DNA/RNA/ 
peptide) 


Receptor ligands 


Substrates of enzymes 


mouse 


antibody libraries 


recombinant fibronectin 
structures 


microorganism 


library 


synthetic 


synthetic; pro- and 
eukaryotic organisms 


Technique 


References 


hybridoma 


phage display, in vitro 
evolution 


in vitro evolution 
heterologous expression 


SELEX/mRNA display, 
in vitro evolution 


combinatorial chemistry 


protein purification, 
recombinant protein 
technology (bacterial 
fusion proteins, baculo- 
virus, peptide synthesis! 


Goldman (2000) 


Gao et al. (1999); Ryu and Nam (2000); 
Krebs et al. (2001); Lecerf et al. (2001); 
Raum el al. (20011 


Kreider ( 2000) 


Gunneriussion et al. (1999a,b) 

Jayasena (19993; Brody and Gold (2000); 
Hoppe-Seyier and Butz (2000); Lee and 
Walt (2000); Lohse and Wright (2001); 
Wilson et al. (2001) 


MacBeath et al. (1999); Lee and Walt ( 2000) 


Arenkov et al. (2000); MacBeath and 
Schreiber (2000); Zhu et al.(2000) 


This table summarizes classes oF molecules that have the potential to be used or are actually used as capture molecules in protein microarray systems. 
Abbreviations: | Fab) Antugen-binding fragment; (scFv) single-chain variable region fragment, (mAb) monoclonal antibody. 
Reproduced, with permission, from Templi et al. (2002). 
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FIGURE 1.15. Classes of capture molecules 
for protein microarrays. For specific interac- 
tion analysis, different classes of molecules 
can be immobilized on a planar surface to 
act as capture molecules in a microarray 
assay. (a) Antigen-antibody interaction, (b) 
scheme of a Sandwich immunoassay; (c) à 
specific protein-protein interaction. (d) A 
different class of binders, where synthetic 
molecules referred to as aptamers act as cap- 
ture molecules. They can be composed of 
nucleotides, ribonucleotides, or peptides. (e? 
Interactions of enzymes with their specific 
substrates, where a substrate (S) for kinases 
is immobilized and phosphorylated (P) by 
the repective kinase. (f) A typical example 
for a receptor-ligand interaction, where syn- 
thetic low-molecular-mass compounds are 
immobilized as capture molecules. (Repro- 
duced, with permission, from Templin et ai. 
2002 [© Elsevier Science].) 
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CAPTURE MOLECULES AND TARGET MOLECULES 


In principle, any type of ligand-binding assay that relies on the lormation of an immobilized 
"capture molecule" product and a "target molecule" present in a biologica! matrix can be 
miniaturized to run in parallel in a protein microarray format. Although antibodies are the 
most prominent capture molecules used to identify target molecules, the labor-intensive 
nature (and cost) of monoclonal antibodies has led to the development of alternatives. Of 
these, the most promising approach has been phage-display methodology (Ryu and Nam 
2000; Lecerf et al. 2001) combined with highly diverse, fully synthetic libraries C^ 10!! inde- 
pendent clones) to generate synthetic antibodies (Gao et al. 1999; Knappik et al. 2000). 
Another class of capture molecule are aptamers (ie. synthetic molecules such as 
DNA/RNA/peptides that act as highly specific capture molecules with nai-pm affinities) (see 
Table 1.7 and Figure 1.15) (Jayasena 1999; Brody and Gold 2000; Hoppe-Seyler and Butz 
2000; Green et al. 2001; Wilson et al. 2001). For recent reviews of protein microarray tech- 
nology, see Templin et al. (2002) and Li (2000). 
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Po RYLAMIDE GEL ELECTROPHORESIS (PAGE) is one of the most reliable methods available 
for the separation of proteins in complex mixtures and for assessing protein purity. Other 
applications of PAGE in protein structure and proteomics research include the following: 


» Determination of protein relative molecular mass (M,). 
* Verification of protein concentration. 
* Detection of protein modifications. 


* Global protein expression profiling using a combination of isoelectric focusing (IEF) and 
SDS-PAGE, Le., two-dimensional gel electrophoresis (see Chapter 4). 


* Peptide mapping using partial proteolytic digestion within the stacking gel (Chapter 7), 


» First stage of immunoblotting, i.e., electophoretic transfer to inert immobilizing matrices 
(Chapter 6). 


* High-throughput proteomic approaches that entail either direct amino- or carboxy-termi- 
nal sequence analysis (Chapter 6) or a combination of in-gel proteolytic digestion (Chapter 
7) and mass spectrometry for protein identification (Chapter 8). 


Because the electrophoretic mobilities of individual proteins change disproportionately 
as cither the pH (Figure 2.1) or the concentration of acrylamide (Figure 2.2) is changed, the 
possibility that the single component observed under one set of conditions results from the 
accidental coelectrophoresis of two or more proteins can be minimized by running elec- 
trophoresis at different pH values or more than one concentration of acrylamide. 

Although the majority of investigators still rely on the original Coomassie Brilliant Blue 
and silver staining techniques, as well as on radiolabeling, recent advances in the use of sen- 
sitive protein visualization procedures, especially fluorescent staining methods (sce Table 2.1) 
have greatly increased the overall sensitivity of protein detection in polyacrylamide gels (for 
a recent review of fluorescent staining techniques, see Patton 2000). 

This chapter presents some basic PAGE separation methods that have been found to be 
useful for a variety of proteomic purposes. A number of staining procedures are also described, 
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FIGURE 2.1. Comparison of electropharetic mobilities of trypsin at 0°C (ug, points) and acid-base titra- 
tion curve of trypsin determined at 20°C (HZ, continuous curve}. (Reprinted, with permission, from 
Duke et al. 1952.) 
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FIGURE 2.2. Effect of various concentrations of polyacrylamide (T,} on the electrophoretic mobility (4) of 
various proteins. (LAC) B-lactoglobulin; (OVA) ovalbumin; (OVM1 ovomucoid; IPEP) pepsin; (BSA, : 
bovine serum albumin monomer; (BSA,) bovine serum albumin dimer; (MYO) mvogiobulin; iyi 
immunoglobulin G. (Reprinted from Kyte 1995 [originally from Morris 1966j.) 


including an improved Coomassie staining technique (Wong et al. 2000) that is faster and 
more sensitive than the colloidal blue method (Neuhoff et al. 1990) and a silver staining 


method that is compatible with mass spectrometry (Sinha et al, 2001). 


GENERAL PRINCIPLES OF POLYACRYLAMIDE GEL ELECTROPHORESIS 


Most electrophoretic procedures for resolving proteins in use todav are based on zonal or dis- 
continuous electrophoresis in polyacrylamide gels (Raymond and Weintraub 1959; Davis 
1964; Ornstein 1964). Zone electrophoresis in polyacrylamide gels, known as polyacrylamide 
gel electrophoresis (PAGE), simultaneously exploits differences in molecular size and charge 
for the purposes of fractionation. This technique, which is based on the mobility of charged 
solutes in an applied electrical field, is influenced not only by charge, but also by voltage, dis- 
tance between the electrodes, size and shape of the molecule, temperature, and time. The 
relationship of the mobility of a particle to the electric field, to a first approximation, is 
expressed as follows: M = v/E, where M is the mobility of the particle (cm! sec ! V=), vis the 
migration velocity the particle (cm sec"), and E is the electric field strength (V cm '). For a 
more detailed description of electrophoresis theory, see Cantor and Schimmel (1980! and 
Kyte (1995). 


Polyacrylamide Gels May Be Continuous or Discontinuous 


Polyacrylamide gels may be cast with continuous or discontinuous buffer systems and acry- 
lamide concentrations. Continuous gels, originally described by Weber and Osborn (1969), 
contain a constant acrylamide concentration, whereas Laemmli discontinuous gels (Laemmli 
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TABLE 2.1. Fluorescence staining methods for PAGE of proteins 


Excitation maxima/ 
Dye name 


Emission maxima (nm) Principal applications 


Features 


Gel Stains 


Nile Red dye 306, 540/640 


SYPRO Ruby 


protein gel stain 


280, 450/610 


SYPRO Ruby [EF 280, 450/610 
protein gel stain 
SYPRO Orange 300, 470/570 


protein gel stain 


SYPRO Red 
protein gel stain 


300, 550/630 


SYPRO Tangerine — 300, 490/640 
protein gel stain 


Blot Stains 


SYPRO Ruby 
protein blot stain 


280, 450/618 


SYPRO Bose Plus 350/610 
protein blot stain 


SYPRO Rose 3505/590, 615 


protein blot stain 


Reprinted, with permission, from Patton (2000). 
ANitroceliulose or PYF membranes, 
"UV epi-iluminatien only. 


1D gels 

IEF gels 

Lipoprotein detection 
Blotting applications 


1D PAGE, 2D gels 
Mass spectrometry 
Edman sequencing 
IEF gels 

Mass spectrometry 


1D SDS-PAGE. 

Mass spectrometry 

Edman sequencing 
Capillary gel electrophoresis 
1D SDS-PAGE 

Mass spectrometry 

Edman sequencing 
Capillary gel electrophoresis 


1D SDS-PAGE 
Blotting applications 
Zymography 
Electroclution 

Mass spectrometry 
Edman sequencing 


Blotting membranes? 


immunodetection (western blotting) 


Mass spectrometry 
Edman sequencing 


Blotting membranes 


Immunodetection (western blotting) 


Mass spectrometry 
Edman sequencing 


Blotting membranes’ 


immunndetection (western blotting) 


Mass spectrometry 
Edman sequencing 


Fair sensitivity 15. 50 ng/band) 
Similar performance as Coomassie Blue 
slaining methods 


Highest sensitivity (1-3 ng/band) 
Better perlormance than the best silver 
staining methods 


Highest sensitivity (1-2 ng/band) 
Better performance than the best silver 
staining methods 


Good sensitivity (-f-10 ng/band} 
Litle protein-to-pratein variability 
Better performance than colloidal 
Coomassie Blue staining methods 
Good sensitivity (4-10 ng/band! 
Little protein-1o- protein variability 
Better performance than colloidal 
Coomassie Blue staining methods 


Good sensitivity (4-10 ng/band) 

Little protein-to-pretein variability 

No organic solvents or acids 

Better performance than zinc-imidazole 
reverse staining methods 


Highest sensitivity (12 ng/band) 
Better performance than colloidal gold 
staining methods 


Highest sensitivity (1-2 ng/band) 
Readily reversible 

Better performance than colloidal gold 
staining methods 

Good sensitivity (1% 30 ng/band) 
Readily reversible 

Better performance than Amido Black 
staining methods 


1970) are a composite of a short wide-pore “stacking gel” (34 %T) (fora definition of T, see 


page 45) layered on top of a long. 


smail-pore “resolving ge!" (7-25 %T). The stacking and 


resolving gels in the Laemmli system are also discontinuous with respect to buffer composi- 


tion (i.e multiphasic buffers are emp 
composition}. These two parameters, 


loyed that differ with respect to both pH and ionic 
i.e., different buffer and acrylamide compositions, 


allow samples of relatively large volumes to be concentrated in the upper stacking gel prior 
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to their separation upon entering the lower resolving gel. In contrast, continuous gels utiliz- 
ing monophasic buffers are limited by the requirement that samples be both highly concen- 
trated and low in volume. 


Electrophoretic Separations Are Designed to Maximize the Resolution of Proteins 


The electrophoretic separation of proteins and peptides on polyacrylamide gels must have as 
high a resolution as possible. This resolution is achieved by using a stacking gel with multi- 
phasic buffer systems (see Figure 2.3) (Chrambach and Rodbard 1971). An important advan- 
tage of the stacking gel is the ability to concentrate proteins from very dilute solutions. This 
process significantly improves resolution of the subsequent separation by shrinking the orig- 
inal sample into very thin, highly concentrated starting zones so that all of the protein mol- 
ecules begin the electrophoretic separation at very nearly the same point. Stacking results 
from the formation of a limited high-voltage gradient or Kohlrausch boundary (Kohlrausch 
1897) in which proteins are confined to a thin and highly concentrated zone of intermediate 
mobility between leading chloride and trailing glycine ions. Briefly, at the pH of the stacking 
gel, the sample ion (typically, a mixture of proteins or their detergent derivatives) is initially 
sandwiched between an upper solution and a lower solution containing ions of the same sign 
as the sample ion. When a current is passed through the mixture, one ion (e.g., chloride) 
migrates faster than the other slower ion (e.g., glycine). The sample ion, being of intermedi- 
ate mobility, is compressed between the chloride and glycine ions, forming a narrow zone or 
band of high concentration. Thus, the stacking process is able to sandwich the proteins into 
à disc or narrow band. For electrophoretic separation to occur, the proteins must be released 
from the boundary after they have been stacked, Unstacking (resolution) is accomplished by 
abruptly increasing the pH and decreasing the pore size of the separating gel (relative to the 


Electrode buffer 
Tris-glycine (pH 8.3) 


, Gly 
Stacking gel Stacking 
Tris-Cl (pH 6.8) Cr Giy- 


Resolving gel T e " 
Tris-Gl (pH 8.8. ] 
ris- Cl (pH 8.8) Unstacking P2 


Y L 
+ Electrode buffer 
Tris-glycine (pH 8.3) 


FIGURE 2.3. Discontinuous multiphasic gel-buffer electrophoresis. Electrophoresis through the stacking 
(pH 6.8) and resolving (pH 8.8) gels of a hypothetical sample containing five proteins (P1—5} is shown as a 
function of time. At the start (TO), the proteins in the original sample are in a large volume and at a low pH 
(pH 6.8). As the valtage is applied, they are compressed to a small volume, or disc, as they migrate through 
the spacer (Jarge pore, ~3% acrylamide) by being sandwiched in the boundary between the upper solution 
(Tris-glycine buffer, pH 8.3) and the solution of the original sample and the spacer (Tris-chloride, pH 6.8) 
(TU. At some time after stacking is complete (T2), the proteins reach the small-pore gel (5-796 acry- 
lamide) boundary where changes in their mobilities occur when they encounter both decreased pore size 
and a higher pH (‘Tris-chloride, pH 8.8) (T3). Because of the special viscous properties of the gel, proteins 
of equal free mobility but of appreciable different molecular weights (different diffusion constants) will 
migrate with markedly different mobilities and will be resolved (T4). (Modified, with permission, from 
Makowski and Ramsby 1997 [©Oxtord University Press].) 
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The Polyacryla 


stacking gel). Under these conditions, the sample ions (proteins) migrate into the separating 
gel and the trailing (glycine) ions continuously overtake and pass the sample ions. Thus, a 
comparatively uniform voltage gradient is established in which electrophoretic separation of 
sampies occurs (Davis 1964). 

Since the initial description of the Laemmli procedure, considerable effort has gone into 
trying to optimize these electrophoretic conditions. Foremost has been the development of 
the multiphasic zone electrophoretic (MZE) theory by Chrambach and Jovin (1983), which 
predicted more than 4200 potentially useful buffer systems (this has now been simplified to 
19) for optimizing SDS-PAGF. For a practical description of PAGE, see Shi and Jackowski 
(1998), Makowski and Ramsby (1997), and Hames (1990). 


mide Gel Matrix Can Be Formed with a Wide Range of Pore Sizes 


Polyacrylamide gels are formed from the polymerization of monomeric acrylamide 
(CH,=CHCONH;) and the bifunctional cross-linking monomer N,N'-methylenebisacry- 
lamide (CH, -CHCONHCH,NHCOCH-CH,) as shown in Figure 2.4. Chemical polymeriza- 
tion of acrylamide and bisacrylamide is initiated by either chemical (e.g, a combination of 
TEMED {tetramethylethylenediamine| and ammonium persulfate) or photochemical free- 


CH, — T 
CO 
| 
NH 
| 
M 
GH, == CH NH 
| | 
co nu 
| 
KH, CH, -= CH 
Acrylamide Bis-acrylamide 
L_ 


Polyacrylamide get 


FIGURE 2.4. Chemical structures of acrylamide, bisacrylamide, and polyacrylamide. (Adapted, with per- 
mission, from Hames 1998 [@Oxford University Press].) 
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radical-generating systems such as riboflavin-5 -phosphate or methylene blue. Persulfate free 
radicals, which form when ammonium persulfate is dissolved in H,O, activate the acrylamide 
monomer, and the electron carrier TEMED catalyzes this reaction. Riboflavin can also gen- 
erate free radicals in the presence of oxygen and UV light as a result of photodecomposition. 
Riboflavin and ammonium persulfate are sometimes used in combination to generate free 
radicals. Polyacrylamide polymers can be formed with a wide range of pore sizes by varving 
the amount of total acrylamide used per unit volume and the degree o£ cross-linkage. For 
instance, as the proportion of cross-linker is increased, the pore size decreases. It should be 
noted that the rate of polymerization also affects the gel pore size. Increasing the rate of polv- 
merization (e.g., by increasing the temperature of polymerization) can produce a gel with a 
small-pore structure, whereas lowering the polymerization temperature can produce large- 
pore gels. Factors that influence the efficiency of monomer-polymer conversion can also 
affect gel pore size. For example, the use of poor-quality acrylamide and extremes of pH can 
result in greater gel porosity due to incomplete conversion of monomer to polymer. Gel addi- 
tives can also markedly influence gel pore structure. On the one hand, urea (e.g., 8 M urea is 
often added to gels to fractionate oligopeptides in the range of 1,200~10,000 daltons) (Swank 
and Munkres 1971) causes the formation of smail-pore-size gels by accelerating persulfate- 
driven polymerization of monomeric acrylamide to near completion. On the other hand, 
inclusion of the polymer polyethylene glycol in the gel solution results in macroporous gel 
formation due to lateral acrylamide chain formation {Righetti 1995). 


Gel Pore Size Is the Major Factor in Determining Protein and Polypeptide 
Resolution in Polyacrylamide Gels 


Polyacrylamide gels range from soft (low polyacrylamide concentration} to hard (high poly- 
acrylamide concentration), with the properties of acrylamide mixtures being defined by the 
letters T and C (Hjertén 1962). T denotes the total percentage concentration of the 
monomers (acrylamide pius N,N’-methytenebisacrylamide) in grams per 100 ml. and € 
denotes the percentage (by weight) of the cross-linker N,N’-methylencbisacrylamide relative 
to the total amount of monomers. In general, the pore size is inversely proportional to %T. 


%T = [lacrylamide (grams) + bisacrylamide (grams)]/100 ml} x 100 
%C = {bisacrylamide (grams)/|acrylamide (grams) + bisacrylamide (grams); x 100 


Judicious selection of acrylamide gei concentration is critical for optimal separation of 
proteins by zone electrophoresis. For a Laemmli gel {Laemmli 1970), T is equa! to 10% and 
C is equal to 2.6%. A guide for choosing the appropriate acrylamide concentration for a uni- 
form gel for separating a particular molecular-weight range of sample proteins is given in 
Table 2.2. Soft gels with concentrations of acrylamide «396 are almost liquid and very diffi- 
cult to handle unless they are strengthened by the inclusion of 0.596 agarose. At the other end 
of the spectrum, high-concentration acrylamide gels (polyacrylamide gels will form up to 
~35% acrylamide) are also very difficult to handle due to their brittleness and are therefore 
of limited usefulness. 

A major limitation of uniform acrylamide concentration gels is their restricted molecu- 
lar separation range. For example, gels with a uniform pore size large enough to resolve large 
proteins are unlikely to resolve small polypeptides as well. This limitation can be circum- 
vented by using pore gradient gels whose pore size changes from the top to the bottom of the 
gel, thus enabling the separation of proteins and polypeptides over a larger molecular-weight 
range and with higher resolution than a uniform concentration gel. Although more difficult 
to prepare, gradient gels resolve wider ranges of protein relative molecular weights, In prac- 
tice, the effective range of a polyacrylamide gel is 5-20% for a uniform gel concentration and 
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TABLE 2.2. Protein and peptide molecular-weight separation guidelines tor polvac rylamide gels 


Acrylamide concentration 
%T HC 


M, range of sample polypeptides 


Uniform acrylamide concentration gels 


^ i6 23.000 to 300,000) 
10 16 15.000 to JUCO,DOC 
10 3 1,000 to LOGO 
15 26 12,000 to 50,000 


Gradient acrylamide gels 


3.3- 12 24v 14,500 to? 800.000 
3--30) 8.4 13,000 to. 1,000,000 
5-20 2.6 14,000 to 276,000 
8-15 1.0 14,000 to. 330,000 


Reproduced, with permision, from Shi and Jackowski 11998) 
"When using Iricine unstead ol ghane in running buffer | Schagger and von Jagaw [857 1, 


This list is bs no means exhaust and other gel ranges may also he used. 
"Granezie and Wang (1993). 


4-20% for a gradient concentration gel. Table 2.2 provides a guide for choosing a gradient gel 
range appropriate for the size of the proteins being fractionated. Although the effective 
resolving range of a polyacrylamide gel is primarily determined by the pore size of the gel, 
other factors can have important roles, including: 

© Buffer composition: Tricine (pK 8.15), for example, instead of glycine (pK 9.6) in the run- 


ning buffer facilitates the resolution of small peptides at high acrylamide concentrations 
(Schagger and von Jagow 1987). 


© pH: The sieving properties of SDS-PAGE (see below), for example, can be altered by a subtle 
change of pH of the resolving gel, which does not require extensive knowledge of MZE theo- 
ry (Makowski and Ramsby 1993, 1997). For instance, by increasing the pH from 8.9 to 9.2, the 
linear calibration range of a 10% gel can be extended to include low-M, proteins of 10-20K 
without sacrificing the resolution of high-M, proteins as occurs with high-percentage gels. 


© Gel additives: For example, the detergents cetyltrimethy! ammonium bronude (CTAB), 
sulfobetaine 3-12, 3[(3-chloramidopropyl}dimethyl-ammonio | -l-propane sulfonate 
(CHAPS), Triton X- 100 (Caglio et al. 1994), and SDS (Caglio et al. 1994; Sobieszek 1994) 
and the polymers dextran, ficoll, and polyvinylpyrrolidone (PVP) (Gersten and Bijwaard 
1992a,b) inhibit polymerization of acrylamide to varying extents, resulting in large pore 
sizes, The inclusion of such additives provides an additional parameter to PAGE, which can 
be useful in resolving complex mixtures of proteins, especially large-M, proteins. 


SDS IS USED TO CREATE DENATURING POLYACRYLAMIDE GELS 


An important refinement of PAGE that has gained widespread acceptance is the use of SDS 
in polyacrylamide electrophoresis as a simple and powerful way to dissociate proteins into 
individual chains and separate them according to their molecular weights (Shapiro et al. 
19671. The following are probably the most widely cited SDS-PAGE procedures: 


e Continuous buffer system (normally a phosphate-phosphate system) with a uniform gel 
(Weber and Osborn 1969). 

© Laemmli (Tris-glycine-SDS system; Laemmli 1970) and Neville (borate-sulfate-SDS sys- 
wng Neville 1971) modifications of the discontinuous buffer system (Davis 1964; Ornstein 
1964) with a uniform gel. 
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In recent years, continuous and discontinuous buffer systems with gradient gels have 
been developed to improve protein and polypeptide resolution, and precast gels are now 
commercially available. Protocol 1 provides details for preparing, running, and storing dis- 
continuous SDS-PAGE gels. Several protein visualization procedures are described here, 
including a rapid, high-temperature Coomassie Blue method (Protocol 3) and a modified sil- 
ver staining method that is compatible with mass spectrometry-based protein identification 
(Protocol 7). The classic Coomassie Blue protein-staining procedure and a rapid variant 
using a commercially available stain-impregnated paper are detailed in Protocols 2 and 4, 
respectively. Finally, two protocols using the fluorescent stains SYPRO Ruby and SYPRO 
Orange are described in Protocols 5 and 6. Additional protocols for staining gels and visual- 
izing proteins are described in Chapters 4 and 7. For a brief description of the history and 
chemical structure of Coomassie Brilliant Blue, see the panel on COOMASSIE BRILLIANT BLUE. 
For a description of possible problems that may arise during the preparation and use of SDS- 
PAGE, see Table 2.6 (Protocol 1). 

Denaturation and binding of the anionic detergent SDS to proteins and peptides gener- 
ally result in a relatively uniform negative charge, because most proteins bind similar 
amounts of SDS (~1.4 g of SDS per gram of polypeptide; Reynolds and Tanford 1970) and 
become highly negatively charged by the addition of strongly acidic sulfonic acid groups. By 
these means, possible changes in electrophoretic behavior of different tertiary protein struc- 
tures are negated since complete unfolding of proteins occurs by SDS binding. Polypeptide 
chains studied by PAGE in the presence of SDS have been shown to migrate according to their 
molecular weights (Weber and Osborn 1969). Because SDS-protein complexes have elec- 
trophoretic mobilities in polyacrylamide gels that are generally directly proportional to the 
logarithm of the length of the polypeptide chain (see Figure 2.5), the molecular weight of an 
unknown protein can be determined to within 1096 of its true value by comparing its elec- 
trophoretic mobility with that of a set of standard proteins. A procedure for determining the 
apparent molecular mass of a protein using SDS-PAGE is given in Protocol 8. 
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COOMASSIE BRILLIANT BLUE 


Coomassie Blue R250 and 250 are still the most popular chemicals used for the staining 
of protein bands in polyacrylamide gels. Coomassie Blue is an aminotriarylmethane dye 
that forms strong but not covalent complexes with proteins, most probably by a combina- 
tion of van der Waals forces and electrostatic interactions with NH," groups. Despite its 
name, Coomassie Brilliant Blue is classified as a magenta dye and is used to stain proteins 
on dot blots and after electrophoresis through polyacrylamide gels. At the pH used for 
detection and assay of proteins (~pH 1}, Coomassie Blue is mostly in its highly protonat- 
ed reddish form, with smaller proportions of green and blue forms, which are all in equi- 
librium with each other. Only the neutral (Blue) form of the dye binds strongly to proteins. 
The uptake of dye is approximately proportional to the amount of protein, following the 
Beer-Lambert law. Under normal conditions of staining, Coomassie Blue binds strongly to 
arginine and lysine residues and with lower affinity to aromatic side chains. The number of 
strong binding sites varies widely from protein to protein (from 13 to 7100) and correlates 
with the content of basic amino acids, rather than the total mass of the protein. The molar 
absorbance of the bound dye varies widely between proteins, from ~34,000 mole" cm"! iB- 
lactoglobulin) to ~58,000 mole? cnr’ (glutamate dehydrogenase). 


Historical Footnote: Coomassie dyes date back to early last century, originally being developed 
for the staining of woolen jumpers. They were named to commemorate the 1896 British occu- 
pation of the capital Kumasie (Coomassie) of the Ashanti tribe, in what has now become mod- 
ern-day Ghana (presumably, the name was given when colonization was still in fashion!). 
Coomassie Brilliant Blue R250 (R stands for red and G for greenish blue} was first used as a lab- 
oratory reagent to stain proteins in 1963. (Chemically, Coomassie Blue G250 is the same as 
Coomassie Brilliant Blue R250, but with extra methyl groups.) Robert Webster, a graduate stu- 
dent in the laboratory of Stephen Fazekas de St. Groth at the Australian National University in 
Canberra, was searching for a way to locate influenza virus proteins that had been separated by 
electrophoresis on cellulose acetate strips. At that time, Australia had a thriving wool industry 
and government laboratories were intensively investigating the mechanism of action of various 
classes of dyes used for wool-dying. Fazekas and Webster reasoned that these dyes must have a 
high affinity for proteins and they obtained samples of a great many dyes from the 
Commonwealth Scientific and Industrial Organization. Included among them was Coomassie 
Brilliant Blue R250, which had been used since the turn of the century in the textile dying 
industry. Webster soon found that Coomassie Brilliant Blue R250 was a very sensitive stain for 
proteins, but he was frustrated by extreme day-to-day variation in the intensity of the staining. 
At home one night, he suddenly realized that the answer to the problem was to fix the protein 
before staining. He went back to the laboratory and fixed the separated influenza virus proteins 
with sulfosalicylic acid. After these results were published (Fazekas de St. Groth et al. 1963), the 
method was rapidly adapted to stain proteins separated by electrophoresis through polyacry- 
lamide gels (Meyer and Lamberts 1965). To this day, staining with Coomassie Brilliant Blue 
remains a standard method to detect, visualize, and quantitate fixed proteins. 

Because Coomassie Brilliant Blue R250 is now a trademark of Imperial Chemical Industries 
PLC, the dye is generally listed in biochemical catalogs as Brilliant Blue. Two forms of the dye 
are available: Brilliant Blue G and Brilliant Blue R, which are given different numbers (42655 
and 42660) in the Colour Index, a kind of dyer's Bible, in which dyes are classified and arranged 
according to color. Brilliant Blue G is less soluble in water and alcohols than Coomassie Blue 
R250, Both dyes stain fixed proteins efficiently. They are used at a concentration of 0.05--0.196 
in methanol-glacial acetic acid-water (50:10:40 viv). The dyes should be dissolved in methanol 
before acetic acid and water are added. 


Adapted, in pari, from GRADIPRINT™ issue No. 6, August 1988 ( with permission) and background provided by joe Sambrook. 
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Some Proteins Behave Anomalously on SDS-PAGE 


There are many reports of proteins, especially glycoproteins, behaving anomalously on SDS- 
PAGE gels even when SDS and thiol reagent are in excess. Artifactually high M, estimates 
often result, probably due to these proteins not binding the normal amount of SDS (Segrest 
and Jackson 1972; Hames 1990), which results in reduced net charge and lower mobilities 
during electrophoresis. However, this problem can be alleviated by determining the M, of gly- 
coproteins at a number of polyacrylamide gel concentrations to yield an asymptotic M, that 
approximates the real molecular weight (Segrest and Jackson 1972) or, alternatively, by using 
polyacrylamide concentration gradient gels (Hames 1990). Proteins with high negative 
charge (Kaufman et al. 1984) and positive charge (e.g., histones) or rich in proline residues 
(e.g., collagenous proteins) all yield abnormally high M, values using SDS-PAGE (Hames 
1990). 


CTAB Can Replace SDS in Some Denaturing PAGE Applications 


Although the anionic detergent SDS is usually the choice for denaturing gel electrophoresis, 
other detergents have been developed for more specialized purposes. For example, neutral 
detergents such as Triton X-100 and Nonidet P-40, as well as some zwitterionic detergents 
(e.g; CHAPS) have been utilized in isoelectric focusing and two-dimensional gel elec- 
trophoresis because they do not alter the charge of proteins and facilitate solubilization (see 
Chapter 4). 

SDS has drawbacks in some situations. For example, SDS can cause protein aggregation 
and precipitation as well as abnormal protein migration leading to poor resolution, and it 
crystallizes at low temperatures (Shi and Jackowski 1998). These limitations of SDS have led 
to the development of cationic detergent solubilization and PAGE (Mócz and Bálint 1984; 
Akin et al. 1985; MacFarlane 1989; Atkins et al, 1992). A method for cationic detergent PAGE 
using CTAB (Atkins ct al. 1992) is given in Protocol 9. 


ACETIC ACID-UREA PAGE SEPARATES PROTEINS ON THE BASIS 
OF THEIR SIZE AND CHARGE 


In SDS-PAGE, proteins are fractionated on the basis of their molecular masses by the sieving 
effect of the polvacrylamide gel matrix. However, this system is of limited value for separat- 
ing proteins of very similar molecular masses that contain posttranslational modifications 
such as acetylation, methylation, and phosphorylation (e.g., histone variants). To analyze 
such modified proteins, PAGE systems have been used in which both molecular size and 
charge of the protein are involved in the separation process. Although the prevailing pH in 
such a system may be any value, a value of 3 is typically used. At this pH, the protein will be 
positively charged and will migrate toward the cathode in an electric field. (The pK, values of 
the carboxyl side chains of aspartic acid and glutamic acid are 3.8 and 4.2, respectively, so the 
negative charge contribution from these amino acids to the overall charge of a protein at pH 
3 will be minimal.) For this reason, acid-urea PAGE is commonly employed to separate pro- 
teins of similar size but different charge (Panyim and Chalkley 1969; Harwig et al. 1993). A 
simple acid-urea PAGE procedure is given in Protocol 10. 
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TRICINE SDS-PAGE PROVIDES EXCELLENT SEPARATION OF SMALL POLYPEPTIDES 


Using the Laemmli discontinuous system {Laemmli 1970), the mobilities of small proteins 
(e.g., M, <14K) in SDS-PAGE may no longer be proportional to their M, when the protein 
charge properties become significant relative to their size. Although gradient gels in the 
3—30% range improve resolution over a broader M, range, they still cannot fully resolve 
oligapeptides with M, «10K. This has led to the development of SDS-PAGE systems capable 
of extending the resolution of oligopeptides into the low-M, range. Swank and Munkres 
(1971) modified the SDS-PAGE method by including 8 M urea m a continuous 0.1 s Tris- 
phosphate (pH 6.8) buffer system, resulting in the resolution of oligopeptides with M, as low 
as 2.5K. In a separate approach, a discontinuous buffer system in which the trailing ton in the 
Laemmli system (glycine) is replaced by tricine has been developed by Schagger and von 
Jagow (1987). Because tricine migrates much faster than glycine at the usual pH values used 
in a stacking gel, the stacking limit is shifted toward the low-M, range so that small SDS- 
oligopeptide complexes are well separated from SDS. By these means, oligopeptides in the 
range M, 1-100K can be resolved using acrylamide concentrations as low as 1096. The tricine 
SDS-PAGE method is described in Protocol 11. 


NONDENATURING PAGE (NATIVE GEL ELECTROPHORESIS) SEPARATES 
NATIVE PROTEINS AND PROTEIN COMPLEXES 


Although SDS-PAGE is probably the most widely used electrophoretic system for analyzing 
proteins, it should be emphasized that this method separates denatured proteins. Thus, SDS- 
PAGE cannot be used to separate intact protein complexes or native proteins whose biologi- 
cal activity must be retained in the gel for subsequent functional testing such as enzyme activ- 
ity, receptor binding, and antibody binding. In these situations, it is necessary Lo use nonde- 
naturing gcl electrophoresis, also called native gel electrophoresis, which separates proteins 
on the basis of their size and charge properties, Proteins that are more highly charged at the 
pH of the separating gel have a greater migration rate in the gel, whereas the acrylamide pore 
size (dictated by the acrylamide concentration used and the acrylamide:bisacrylamide ratio} 
serves to sieve proteins of different sizes. Variables that affect nondenaturing gel elec- 
trophoresis include: 


* Charge: The charges of proteins differ widely at different pt values. Therefore, choosing a 
buffer in a native gel system that will provide optima! resolution depends on the specific 
proteins under investigation. The further the electrophoresis buffer pH ts from the pi val- 
ues of the proteins to be analyzed, the higher the charge on the proteins. This leads to short- 
er times required for electrophoretic separation as well as reduced band ditfusion. 
However, the closer the pH of the electrophoresis buffer to the pi values of the proteins 
under investigation, the greater the charge differences between proteins and the greater the 
likelihood of accomplishing separation of the proteins being analyzed. Hence, conditions 
of optimal resolution of proteins in native gels must be determined by irial and error. Most 
proteins are negatively charged at pH 8.8, which is the common pli used for native gel elec- 
trophoresis. Alternatively, nondenaturing gel electrophoresis may be carried out at low pH, 
in which case the anode and cathode should be reversed. Although the continuous and dis- 
continuous buffer systems described for SDS-PAGE can be applied to nondenaturing gel 
electrophoresis, the discontinuous systems yield the higher resolution for most native pro- 
teins. lt should be noted that some native proteins aggregate and may precipitate at the high 
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protein concentrations accomplished in the stacking gel of the discontinuous gel system. 
Under these conditions, such proteins either fail to migrate into the resolving gel or cause 
“streaking.” Using a continuous buffer system may circumvent this problem. In general, 
almost any buffer with a pH between 3 and 10 and a concentration between 0.01 and 0.1 M 
may be used for native gel electrophoresis. Caution must be exercised in choosing an 
appropriate buffer system because extreme pH conditions may denature the protein of 
interest with subsequent loss of biological activity. Similarly, some protein complexes may 
require the presence of a metal ion or reducing agent to retain activity, which must be 
added to the electrophoresis buffer. Typical buffer systems in the pH range of 3.8-10.2 used 
for native gel electrophoresis include Tris-glycine (pH 8.3-9.5), Tris-borate (pH 7.0—8.5, 
Tris-acetate (pH 7.2—8.5), Tris-citrate (pH 7.0-8.5), and B-alanine-acetate (pH 4-5); for 
other useful continuous buffer systems, see McLellan (1982). 


+ lonic strength: Buffer ionic strength has a crucial role in electrophoresis, and values typi- 
cally used are in the range of 0.01—0.1 m. If the ionic strength is too low in native gel sys- 
tems, proteins may aggregate nonspecifically, whereas high ionic strengths will result in 
increased heat generation during electrophoresis. 


* Acrylamide gel porosity: Acrylamide concentrations and acrylamide:bisacrylamide ratios 
may be varied to achieve different sieving effects and to optimize protein separation (see 
Table 2.2). 


* Temperature: Nondenaturing gel electrophoresis is typically performed at 0-4°C in order 
to minimize loss of biological activity due to protein denaturation and/or proteolytic 
degradation. 


A high-pH (pH 8.8) discontinuous native gel PAGE method is described in Protocol 12. 
For a low-pH (pH 3.8) method, see Shi and Jackowski (1998). 


PROTOCOL l 


SDS-PAGE 


of Proteins 


MATERIALS 


Prorany THF MOST WIDELY USED METHOD FOR ANALYZING MIXTURES of proteins is SDS-PAGE. 
In this technique, proteins are reacted with the anionic detergent 5DS (or sodium lauryl sul- 
fate) to form negatively charged complexes. Initial heating of the protein sample at 95°C in 
the presence of excess SDS and a thiol reagent (e.g., 2-mercaptoethanol or dithiothreitol) 
denatures the protein mixture and disrupts any disulfide bonds. Under these conditions, all 
reduced polypeptides bind the same amount of SDS on a weight basis (1.4 g of SDS per gram 
of polypeptide) independent of amino acid composition and sequence. The resolving power 
of SDS-PAGE is greatly enhanced by preceding the protein separation phase with a “stacking 
gel,” which employs the principles of isotachophoresis to concentrate samples from relative- 
ly large volumes into very small bands (or zones), but does not separate them. In the sepa- 
rating gel, the negatively charged SDS-protein complexes migrate through the sieve-like poly- 
acrylamide matrix and are separated solely on molecular-weight differences. SDS-PAGE of 
proteins can be performed in either "tube" or "slab gel" systems. The slab gel system is more 
widely used for the following reasons: 


» Multiple samples can be resolved on the same gel, thereby allowing a better comparison of 
test and control samples. 


» Molecular weights can be more accurately determined. 


» Slab gels lend themselves easily to manipulations such as scanning, drying, autoradiogra- 
phy, electroblotting, and photography. 


Electrophoresis equipment for preparing and using slab gels comes in a fantastic array of 
sizes and styles to meet all budgets and tastes. This protocol describes the preparation and use 
of SDS-PAGE employing a full-size slab gel and a discontinuous buffer system. 

This protocol was contributed by Hong Ji and Robert Goode (Joint Proteomics 
Laboratory of the Ludwig Institute for Cancer Research and Walter and Eliza Hali Institute 
of Medical Research, Melbourne, Australia). 


P» Reagents 


CAUTION: Sec Appendix 3 for appropriate handling of materials marked with «!.-— 
IMPORTANT: All reagents should be electrophoresis grade or better. 


Buffers for discontinuous buffer system (Laemmli 1970) 
4x Stacking gel buffer (0.5 & Tris-HCl at pH 6.8) . : l 
Dissolve 60.5 g of Tris base in 850 ml of H,O and adjust pH to 6.8 with 6 M HCI «t». Readjust 
pH to 6.8 at room temperature. Add H,O to 1000 m! and store the butfer at 4C. 
4x Resolving gel buffer (1.5 M Tris-HCl at pH 8.8) i 
Dissolve 181.5 g of Tris base in 850 ml of H,O and adjust pH to 8.8 with 6 m HCI. Cool the solu- 
tion to room temperature and readjust pH to 8.8. Add H,O to 1000 ml and store at 4°C. 
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10x Running buffer 
Dissolve 30.0 g of Tris base, 144.0 g of glycine <!>, and 10.0 g of SDS in 1000 ml of H,O. The pH 
of the buffer should be 8.3 and no pH adjustment is required. Store the running buffer at room 
temperature and dilute to 1x before use. 
2x SDS-PAGE sample buffer 
Mix: 
2.0 ml of 4x stacking gel buffer 
1.6 ml of glycerol 
3.2 m] of 1096 SDS 
0.8 ml of 2-mercapteethanol <!> 
0.4 ml of 1.0% bromophenol blue «t» 
n-butanol (H,Q-saturated) <!> 
Combine 50 ml of n-butanol or isobutanol «t» (or isopropanol <!>) and 50 ml of H,O in a glass 
bottle and shake. Use the top phase to overlay gels. Store at room temperature indefinitely. 
Methanol <!> 
Polymerization initiator solutions 
The three polymerization systems are photopolymerization, methylene blue polymerization, and, 
the most commonly used, chemical polymerization. The two chemical polymerization initiators in 
this system are ammonium persulfate and TEMED (N,N,N', N'-tetramethylethylenediamine). 
Ammonium persulfate solution (10%) <!> 
Dissolve 1.0 g of ammonium persulfate in 10 mi of H,O. 
Because ammonium persulfate is very hygroscopic and begins to degrade immediately after being 
dissolved in H,O, this solution should be made fresh daily. 
TEMED <!> 
TEMED can be used as supplied. 
Although this chernical is subject to oxidation and is very hygroscopic, it can be stored in a tight- 
ly closed, dark glass bottle for at least 6 months at room temperature. 
SDS (10%) stock solution <!> 
Dissolve 10 g of SDS in 80 ml of H,O, and then add H,O to 100 ml. This stock solution is stable for 
6 months at room temperature. 
30%T (2.6%C) Acrylamide stock solution <!> 
Add 29.22 g of acrylamide and 0.78 g of bisacrylamide <!> to 100 ml of H,O. Filter the stock solu- 
tion through Whatman filter paper and store at 4°C. Prepare fresh stock acrylamide solution every 
few weeks. 


» Equipment 
Centrifuge 
Electrophoresis apparatus 
Far a list of commercially available slab gel systems and supplier details, see Table 2.3. 
Gel-loading pipette tips 
Glass plates 
Heat block or water bath preset to 959C 
PCC-54 detergent (Pierce) or equivalent 
This detergent is a cleaning solution designed for washing glass plates. 


Power supply 
A power supply capable of providing ~200 V and 500 mA (either constant voltage or constant cur- 
rent) is sufficient for SDS-PAGE. A number of power supplies are available commercially, e.g., Bio- 
Rad Power Pac 3000 (3000 V and 400 mA). 


P Biological Sample 


Protein solution(s) or pellets of cellular proteins 
For instance, as prepared in Chapter 3. 
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TABLE 2.3. Some commercially available slab gel systems for performing SDS-PAGE 


System. name 


Supplier 


Xcell TI Mini- Cell 

Hoefer SE 600 Ruby Standard Dual 

Cooled Vertical Unit 

Hoeler SE 600 Standard Dual Cooled 

Vertical Unil 

Hoefer SE 660 Standard Dual Cooled 

Vertical Unit 

Hoefer MimVE Vertical Electrophoresis System 
Hoefer SE 250 Mighty Small 11 Mini-Vertical Unit 
Hoefer SE 260 Mighty Small II Mini-Vertical Unit 
Protean I] ai Cell (16 cm) 

Protean II xi Cell (20 em) 

Protean lI XL Cell 

Criterion Mini Vertical Cell 

Mini-PROTEAN 3 Electrophoresis Module 


Novex 
Amersham Bioscience 


Amersham Bioscience 


Amersham Bioscience 


Amersham Bioscience 
Amersham Bioscience 
Amersham Bioscience 
Bio- Rad 
Bio-Rad 
Bio-Rad 
Bio-Rad 
Bio- Rad 


Gel size fem: x cm) 


I0 x 10 
iB x 16 


I8 X 16 


18 x 24 


10 x 8 to 10x 10,5 
iO x8 

i0 x 10.5 

lox 16 

16x 20 

18.5 x 20 
13.3 X &.7 
8x7. 


URL addresses; Amersham Bioscience [http//www.apbiorech.com), Bio-Rad (http: www bio-rad.com?, Novex 


(http://www invitrogen om}. 


Additional Reagents 


The final step of this protocol requires the reagents listed in any one of the protein visualiza- 


METHOD 


tion protocols, Protocols 2-7. 


Pouring a Slab Gel 


1. Clean the glass plates. 


a, Soak the glass plates in 2% PCC-54 cleaning solution for 3-24 hours. 


b. Rinse the plates with tap water thoroughly and then once with distilled H;O. 


c. Dry the glass plates with clean tissue paper and then dean them with Kimwipes 
soaked in methanol. Dry the plates in the air. 


2. Assemble the gel-casting unit. 


a. Form the gel sandwich by assembling the spacers and two glass plates in the clamps. 


b. Align the bottom part of the spacers and two glass plates at the 


tighten the clamp. 


c. Place the gel sandwich onto the casting stand. 


same level and then 


d. Insert the Teflon sample application comb and mark the glass plate at a level ~1.0-1.5 


cm below the bottom of the comb teeth. 


The Protean 1] xi electrophoresis unit (Bio-Rad) 
assembling of the gel casting unit. For photograp 


Figure 2.6a-f). 


is used as an example to illustrate the 
hs of the assembled casting unit, see 
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3. Pour the resolving gel. 


a. Using Table 2.2 (see introduction to this chapter) as a guide, decide on an acrylamide 
concentration for the gel. 


b. Prepare the appropriate resolving gel mixture using the recipes in Table 2.4. Make 
sure that the solution is well mixed before adding the TEMED. 


c. Use a 10-m) pipette to transfer the mixture to the glass-plate sandwich up to the 
marked level (marked in Step 2d) (see Figure 2.6). 


d. Carefully overlay the gel with an -2-mm-deep layer of H,O or H,O-saturated 
n-butanol or isopropanol solution. 


This prevents air from reaching the gel, which inhibits polymerization of the acry- 
lamide, aud ensures that the gel surface is flat. 


e. After polymerization is complete (~30 minutes), pour off the overlaying H,O and 
carefully remove any remaining liquid with filter paper without damaging the gel sur- 
face, If the gel is overlaid with n-butanol (or isopropanol), drain the overlay liquid, 
and then wash the gel surface with H,O. 


Polymerization of the gel is evidenced by a clear refractive index change that can be 
seen between the gel and the overlay liquid. 


4. Pour the stacking gel. 


à. Select an acrylamide concentration for the stacking gel and make the appropriate 
mixture, using the recipes in Table 2.5. Make sure that the solution is well mixed. 


b. Carefully overlay the resolving gel with the stacking gel solution until the height of the 
stacking gel is ~2.0-3.0 cm. 


TABLE 2.4, Resolving gel recipes 


eee eee ee" 
For making 100 ml of resolving gel mixture? 
Acrylamide Resolving gel HO 10% SDS 10% Ammonium TEMED 


%T (30.096) (ml) buffer (ml) (ml) (ml) persulfate’ (ml) (ml) 
8.0 26.67 25.0 46.53 1.0 0.75 0.05 
94 30.00 15.0 43.20 1.0 0.75 0.05 

10.0 33.33 25.0 39.87 1.0 0.75 0.05 

11.0 36.67 25.0 36.53 10 0.75 0.05 

12.0 40.00 25.0 33.20 1.0 0.75 0.05 

13.0 43.33 25.0 29.87 1.0 0.75 0.05 

14.0 46.67 25.0 26.53 1.0 0.75 0.05 

15.0 50.00 25.0 23.20 1.0 0.75 0.05 

16.0 553.33 25.4 19,87 10 0.75 0.05 

17.0 56.67 25.0 16.53 10 0.75 0.05 

18.0 60.00 25.0 13.20 LO 0.75 0.05 

19.0 63.33 25.0 9.87 LO 0.75 0.05 

20.0 66.67 25.0 6.53 1.0 0,75 0.05 


*Enough for four 1.0-mm thick Bio-Rad Protean 1I xi gels (160 x 130 X 1 mm’). 
"The 10% ammonium persulfate solution must be made fresh. 
*Add TEMED just before pouring the gel. 
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FIGURE 2.6. Preparation of a vertical SDS-polyacrylamide gel for the separation of proteins. 
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TABLE 2.5. Stacking gel mixture 
For making 40 ml of stacking gel mixture 


Acrylamide Stacking gel 10% SDS 10% Ammonium TEMED: 
HT 30.0% stock (ml) ^ buffer (ml) H,O (ml) stock (ml) persulfate” (ml) (ml) 
3,0 4.00 10.0 25.36 a4 0.2 0.04 
4.0 5.36 10.0 24.00 0.4 0.2 0.04 
5.0 6.66 10.0 22.70 0.4 0,2 0.04 


“Sufficient for four 1.0-mm thick Bio-Rad Protean i! xi gels (160 x 150 x 1 mm’). 
"The 10% ammonium persulfate solution must be made fresh. 
‘Add TEMED just before pouring the gel. 


c. Insert the Teflon comb into this solution, leaving 1.0-1.5 cm between the top of the 
resolving gel and the bottom of the comb. Make sure that no air bubbles are trapped 
beneath the teeth of the comb. 


Insert the comb into the stacking gel at an angle to reduce the chance of trapping air 
bubbles under the comb’s teeth. Trapped bubbles can be released by gently tapping on 
the glass plate near the trapped bubbles. 


d. Aliow the stacking gel mixture to polymerize for ~2 hours. Refractive index changes 
around the comb indicate that the gel has set. It is useful at this stage to mark the posi- 
tions of the bottoms of the sample wells on the glass plates with a marker pen. 


5. Carefully remove the sample comb from the stacking gel, and assemble the cassette in the 
electrophoresis apparatus according to the manufacturer's instructions. 


6. Fill the top reservoir with running buffer ensuring that the buffer fully fills the sample 
loading wells, and look for any leaks from the top tank. If there are no leaks, fill the bot- 


tom tank with running buffer, then tilt the apparatus to dispel any bubbles caught under 
the gel. 


Alternatively, remove trapped air bubbles by squirting running buffer across the bottom 
edge of the gel through a long bent needle or hooked Pasteur pipette. The gel is now ready 
to receive protein samples. 


Preparation of Samples 


Carry out either Step 7 or Step 8. 


7. Prepare protein solutions: 


a. Mix the protein solution with 2x SDS-PAGE sample buffer in a 1:1 ratio, Although 
under ideal conditions the binding ratio for SDS and polypeptide is 1.4 g of SDS per 
gram of polypeptide, to ensure enough SDS is present, the concentration of protein 
in the final solution should not be higher than 10 pg/pi. 


EXPERIMENTAL TIP; To load the entire protein sample onto the gel, bear in mind 
that the sample volume (i.e., protein solution and sample buffer) should not exceed 
the volume of the wells. 


b. Heat the samples in a heat block or water bath for 2 minutes at 95°C to denature the 
proteins and ensure the maximum amount of SDS binding to the proteins. Allow the 
samples to cool to room temperature. Remove insoluble materials by centrifugation. 
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8. Prepare total cellular protein samples: 


a. Loosen the prepared cell pellet by vortexing the pellet brietly. 
b. Add 2x SDS-PAGE sample buffer directly to the cell pellet and vortex. 


The resultant cellular protein lysate is highly viscous. The amount of sample buffer to 
add varies depending on the cell line under study. However, a good amount to use is 
100 ul of 2x sample buffer per 1 x 10° cells. The optimal amount must be determined 
empirically. 

EXPERIMENTAL TIP: To load the entire protein sample onto the gel, bear in mind 


that the sample volume (i.e., protein pellet and sample buffer) should not exceed the 
volume of the wells. 


c. Centrifuge the cellular protein lysate at 100,000g for 20 minutes and collect the super- 
natant. 


Protein concentration for a certain number of cells differs slightly for different cell 
lines. If the protein concentration exceeds 10 pg/pl, add SDS powder. Addition of SDS 
to 5% final concentration will not interfere with the electrophoretic separation. 


d. Heat the samples in a heat block or water bath for 2 minutes at 95°C to denature the 
proteins and ensure the maximum amount of SDS binding to the proteins. Allow the 
samples to cool to room temperature. Remove insoluble materials by centrifugation. 


Running a Discontinuous Slab Gel 


9. Use a pipette and gel-loading pipette tips to load the samples into the sample well. 


10. Connect the power supply to the electrophoresis apparatus with the anode (+) linked 
with the bottom reservoir and the cathode (-) connected to the upper reservoir. 


11. Pass a constant voltage of 200 V at 8°C (with cooling system connected), or 90 V at room 
temperature, through the gel until the bromophenol blue dye front reaches the bottom of 


the gel. This will take about 6-8 hours (at constant 200 V) and ~16—-18 hours at constant 
90 V. 


12. Turn off the power supply and disconnect the electrodes. Remove the gel plates from the 
apparatus and carefully remove a spacer. Use the spacer to gently pry the gel plates apart, 
leaving the gel stuck to one plate. 


13. Visualize proteins using an appropriately sensitive staining method. 


For commonly used protocols that are compatible with mass-spectrometry-based protein 
identification procedures, see Protocols 2 (Coomassie Blue) and 7 (silver stain). 
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AUERNATIVE PROTOCOL: POURING LINEAR GRADIENT GELS WITH THE BIO-RAD: 385 GRADIENT FORMER” N 
The preparation of fixed- concentration sbs- PAGE gels is described in Protocet 1. However, the. use of linear gra- 
dient gels, which are polyacrylamide gels having a gradient of increasing acrylamide concentration (and in M. 
decreasing pore size), can have advantageeayer fixed- -concentration gele. First, a much greater range of proti 

values can be separated on a lintar gradient gel than on a fixed- concéfitration gel: Second, there’is-a greater 
lihood of resolving proteins with very similar M, values on gradient gels than on fixed-concentration wel The 
most common gradient gel contains 4-209 i j ge of acrylamide concenttiton, should 
be chosen on the basis of the size of the pra 


Additional Materials ^ 
CAUTION: See Appendix 3 for appropriate handling of materials marked with «1». 3: 
Gel casting chamber with < 100 ml capatity ^ 


PCS 


Us 


Gradient former (Model 385, Bio-Rad) - Sy, M 
Isobutanol «i» i oo e US 

Magnetic stirrer " "e 

Peristaltic pump capable of delivering 16 ml/min es . SC . " E 


Stir bar (1-inch magnetic flea) 

30.8%T (2.696C) Acrylamide stock solution as 
Add 60 g of acrylamide and 1.6 g of bisactylamide <!>"to 200 ml of H;O. Fer the stock solution through Whatman filter 
paper and store at 4°C, Prepare fish stock acrylamide solution every few weeks. ae 

Method ; 


* 


Ej 
1. Calculate the volume of solution 4 
a. eel spacer thickness by 


wt p» ` 


HERT 


2. Deliver the gef olution t to the c 


For pouring the gel from the top: . 3 
a. Setup the gél sandwich as jn Figure 2.6. k T Du $ 
b. ‘Cut the tubing to minimize the distance between the stopcock open and the pump aitibthe pump to 
the gel sandwich, Attach the jeiif former and the peristalti 


tubing to the center of the g plate, above the final levetif the gel. i 
Place the stirring bar into the mixing chamber. , 
d. Make up appropriate'light and heavy acrylamide solutions for suitable: sample separation! without the 
. polymerization initiator solèt gps (see Tables 2.2 arid 2.4Y;De-gas these if de 
€. -Add the polymerization initiatots to the light solution and-quickly, mix and pour it into the reservoir 
chamber, Make sure that the valve stem is closed, 
f Add the polymerization ini a the heavy solution, Tix, and add fe the Thixing chamber. ds 
§ Start the magnetic stirring motor and adjust the speed sõ thet theottom « Mi the vortex papas by the 
stirring motion is of-equal t to the light s t 
h. Open the valve stem and stápco ck and turn on thé peris istalti pump with : r— to punit 
entire gradient solution into the casting chamber in «10 mimütes (e.g, 10 ml/minute).i 
i Run the pump until all of the solution i is pumped into the sandwich: o A e 
Remove the tubing from the gel casting chamber arid place ibin a waste Teceptagle. : 
k Overlay the gel with H,O-saturated n-butanol, ° NE ^x 
L Fill each chamber with H,O imum speed:to fush-out the em : 
For pouring the gel from the bottom: A Es i 
a. If using a gel casting chamber with a gasket at the bettom, attach the tuie from the pom to-this gasket. 
b. Follow the directions above up to Step h, except Place the light solution in the mixing chamber and the 
heavy solution in the reservoir.. Hoe 
c. When all of the acrylamide solution has passed into the casting chamber, stop the b 
not to allow air bubbles to enter the casting chamber, 5 bi P pamp, Ts eti 
d. Complete Steps H abe. Ss D & 2 SER * 


- 


n 


T 
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INTRODUCTION TO PROTOCOLS 2 THROUGH 7 


Protein Visualization Procedures 


Tu MOST WIDELY USED METHODS FOR VISUALIZING PROTEINS in polyacrylamide gels and on 
electroblot membranes use labels that are fixed to the protein molecules after gel elec- 
trophoresis, but rely on noncovalent interactions. Although the contrast of these methods is 
theoretically lower than could be accomplished with covalent binding, the former approach 
is readily compensated by the availability of a much higher number of interaction sites (for 
a review, see Rabilloud 2000). Because noncovalent interactions are generally reversible, these 
staining methods are compatible with a variety of proteomic applications, including chemi- 
cal amino- and carboxy-terminal sequencing methods (see Chapter 6) and mass spectro- 
metry (see Chapter 8). Detection of proteins after gel electrophoresis is typically carried out 
directly in the polyacrylamide gel or on the electroblot membrane. The visualization labels 
commonly used for noncovalent binding to proteins can be categorized as follows: 


* Organic dyes. Examples are Coomassie Brilliant Blue and Amido Black. Detection is by 
light absorption. 


e Fluorescent probes. Examples are Sypro dyes and Nile red. Detection is by fluorescence. 


© Metal ion binding. This category can be further divided into those that are visualized by 
differential salt binding (e.g., negative or reverse staining with zinc-imidazole) and those 
detected by metal ion reduction, such as silver stains. 


DETECHON WITH ORGANIC DYES 


Conventional Coomassie Brilliant Biue R250 staining of polyacrylamide gels in an aqueous 
solution containing methanol and acetic acid remains the most popular approach for detect- 
ing proteins separated by acrylamide gels (Protocol 2). Due to the high extinction coefficient 
and high affinity of Coomassie Blue dyes for proteins, the detection limits are ~30-300 ng. A 
number of improvements in the Coomassie Blue staining procedure have led to an increased 
sensitivity of detection of proteins in polyacrylamide gels. Perhaps the most significant 
improvements are the rapid Coomassie Blue staining method described in Protocol 3 and the 
development of colloidal Coomassie Blue staining for background-free detection of proteins 
in polyacrylamide gels (Neuhoff et al. 1988, 1990). 


FLUORESCENT PROBES 


Organic fluorophore probes are virtually nonfluorescent in aqueous solutions, but they 
become highly fluorescent in nonpolar solvents or upon association with SDS-protein com- 
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plexes, such as those encountered in SDS-PAGE. Typical fluorophore probes of this type, 
which are gaining popularity in integrated proteomics platforms for global analysis of pro- 
tein expression (Patton 2000), include the SYPRO probes SYPRO Ruby (Protocol 5) and 
SYPRO Orange (Protocol 6). Both of these dyes are commercially available electrophoresis 
stains that can detect proteins in SDS-PAGE gels using a simple, one-step staining procedure. 
The limits of sensitivity of the SYPRO dyes are in the 2-10 ng range, rivaling the sensitivity 
of standard silver staining methods (Patton 2001). 


DETECTION BY METAL ION BINDING 


Metal ions permit protein visualization either by differential salt binding or by metal ion 
reduction. The zinc-imidazole reverse staining method is based on the slower precipitation 
of protein-salt complexes as compared to the precipitation of zinc-dodecyl salts free within 
the gel. The result is translucent protein bands within a darker background (see Chapter 7. 
Protocol 10). The most common protein detection method that utilizes metal ion reduction 
is silver staining. The sensitivity of silver staining with current protocols (see Chapter 4, 
Protocol 11 and Chapter 7, Protocol 11) is in the low-nanogram range, which is ~50--100 
times more sensitive than conventional Coomassie Brilliant Blue staining, 10 times more sen- 
sitive than colloidal Coomassie Brilliant Blue staining, and approximately twice as sensitive 
as zinc-imidazole staining (Rabilloud 2000). It should be emphasized that conventional sil- 
ver Staining procedures are not compatible with proteomic identification methods involving 
spectrometry procedures, owing to protein cross-linking by formaldehyde during the protein 
fixation process. Although the silver staining method can be modified to omit aldehydes that 
preclude further analysis (see Protocol 7 and Chapter 4, Protocol 12), the modifications sig- 
nificantly reduce the sensitivity compared to the conventional silver staining methods. For 
additional information on the visualization of gel-separated proteins, see Chapters 4 and 7. 


PROTOCOL 2 


Conventional Coomassie Blue Staining 


MATERIALS 


» Reagents 


p Equipment 


Coomassie BRILLANT BLUE R250 (CBR-250) AND SILVER STAINING are the most widely used 
methods for the routine visualization of proteins separated by SDS-PAGE. Proteins stained 
with CBR-250 are compatible with subsequent amino- and carboxy-terminal sequence 
determination (Chapter 6) and mass spectrometric analysis (Chapter 8). CBR-250 is an 
organic dye that complexes with basic amino acids, such as arginine, lysine, and histidine, as 
well as tyrosine (de Moreno et al. 1986; for review of CBR-250 staining and other staining 
procedures, see Rabilloud [2000] and Patton [2000, 2001]). Conventional CBR-250 staining 
is capable of detecting as little as 30-100 ng of protein, but sensitivity can be improved by 
performing the staining and destaining at elevated temperatures (Protocol 3) or by using 
colloidal Coomassie Blue staining (Chapter 4, Protocol 10). For additional information 
about Coomassie Blue, see the panel on COOMASSIE BRILLIANT BLUE in the introduction to this 
chapter. 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


Acetic acid (5%) <!> 
See Step 5. 

Coomassie Brilliant Blue (CBR-250) stain 
To 400 ml of H,O, add 500 ml of methanol «1», 100 ml of acetic acid, and 1 g of CBR-250. Stir this 
stock for 4 hours to overnight to dissolve the Coomassie powder <!>, and then filter with Whatman 
filter paper. 

Destain 
To 810 ml of H,O, add 120 ml of methanol and 70 ml of acetic acid. 


Plastic container (standard Tupperware sized to fit gel) 


» Biological Sampie 


Protein samples separated on one- and two-dimensional gels by electrophoresis 
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METHOD 


EXPERIMENTAL TIP: Reuse of the reagents is not recommended because evaporation and contami- 
nation can reduce their effectiveness. 


1. After electrophoresis is complete, place the gel in the plastic container containing enough 
CBR-250 stain to cover the gel. For example, use 100 ml of CBR-250 stain for a typical 
minigel (8 x 10 cm or 8 x 8 cm). 


2. Agitate the gel for 5-20 minutes at room temperature. 
3. Discard the used CBR-250 stain. 


4. Add an appropriate amount of destain and agitate with a piece of Kimwipe in the solu- 
tion to absorb excess dye until a suitable background is achieved. 


5. Store the gel in 596 acetic acid or dry in the air. To air-dry the gel, place it between two 
wetted sheets of cellophane, pull the cellophane taut, and allow the gel to dry undis- 
turbed. Air-dried gels can be stored indefinitely. 


PROTOCOL 3 


Rapid Coomassie Blue Staining 


MATERIALS 


b Reagents 


» Equipment 


Tus METHOD, WHICH IS A MODIFIED VERSION OF THE CONVENTIONAL COOMASSIE protocol 
(Protocol 2), speeds up the destaining process for faster results with increased sensitivity and 
is compatible with mass-spectrometry-based methods for identifying proteins. This protocol 
is adapted from Wong et al. (2000). 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


Reagent A 
Dissolve 0.5 g of CBR-250 «t» in 250 ml of isopropanol <!>, 100 ml of acetic acid <!>, and 650 ml 
of H,O. 


Reagent B 
Dissolve 0.05 g of CBR-250 in 100 ml of isopropanol, 100 ml of acetic acid, and 800 ml of H,O. 


Reagent C 
Dissolve 0.02 g of CBR-250 in 100 ml of acetic acid and 900 ml of H,O. 


Reagent D 
Mix 100 ml of acetic acid with 900 ml of H,O. 


Microwave oven with 1000 W output 
Microwave-safe plastic container (standard Tupperware with a hole punched in the lid to 
vent gases) 


b Biological Sample 


Protein samples separated on a one-dirnensional gel by electrophoresis 
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METHOD 


EXPERIMENTAL TIP: Reuse of the reagents is not recommended because evaporation and contami- 
nation can reduce their effectiveness. 


Staining the Gel 


1. After electrophoresis is complete, place the gel in a microwave-safe plastic container con- 
taining cnough Reagent A to cover the gel. For example, use 100 ml of Reagent A for a 
typical minigel (8 x 10 cm or 8 x 8 cm). 


2. Heat the gel in a microwave oven on full power until the solution is boiling (~2 minutes). 
3. Cool the gel for 5 minutes at room temperature with gentle shaking. 


A. Discard the used Reagent A and rinse the gel briefly with H,O. At this stage, bands con- 
taining more than 100 ng of proteins can be visualized despite the blue background. 


5. Add 100 ml of Reagent B and heat the gel in a microwave oven on full power until the 
solution is boiling (~80 seconds). 


6. Discard the hot Reagent B and rinse the gel with H,O. After this step, bands containing 
more than 50 ng of proteins can be visualized. 


7. Add 100 ml of Reagent C and heat the gel in a microwave oven on full power until the 
solution reaches the boiling point (~80 seconds). 


B. Discard the hot Reagent C and rinse the gel with H;O. After this step, bands containing 
more than 25 ng of proteins can be visualized. 


Destaining the Gel 


9. Add 100 ml of Reagent D and heat the gel in a microwave oven on full power until the 
solution reaches the boiling point (~80 seconds). 


10. Place a piece of Kimwipe in the solution to absorb excess dye. 


11. Cool the gel for 5 minutes at room temperature. At this stage, bands containing 5 ng of 
proteins or more can be visualized. 


12. Repeat Steps 9-11 twice more, or shake the gel in Reagent D for 15 minutes or more at 
room temperature. At this point, some protein bands containing as little as 2.5 ng of pro- 
teins can be visualized. 


PROTOCOL 4 


InstaStain Blue Gel Paper Stain 


MATERIALS 


» Reagents 


P Equipment 


Tus COMMERCIALIY AVAILABLE PROCEDURE IS YET ANOTHER MODIFICATION of the standard 
Coomassie protocol. The method uses Coomassie dye dried onto paper to apply the stain to 
the gel. It obviates the problems associated with handling liquid stains and can reduce stain- 
ing background. However, in our experience, the paper is prone to sticking to the gel if the 
system is overheated. This staining method is adapted from the instructions included in the 
Pierce InstaStain Blue Gel Stain Paper and is compatible with mass-spectrometry-based pro- 
tein identification procedures. 


CAUTION: Sce Appendix 3 for appropriate handling of materials marked with <!>. 


Reagent À 
Mix 400 ml of methanol <!> with 500 ml of H,O, and add 100 ml of acetic acid <!>, 


Filter paper (3) cut to gel size (e.g, 3MM paper, Whatman) 
Glass plates (2) 

Glass rod 

InstaStain Blue Gel Stain Paper (Pierce) cut to the size of the gel 
Microwave oven with 1000 W output 

Plastic container (standard Tupperware sized to fit gel) 


P Biological Sample 


METHOD 


Protein samples separated on a one-dimensional gel by electrophoresis 


1. After electrophoresis is complete, place the gel in a plastic container containing enough 
H,O to cover the gel. For example, use 100 ml of H,O for a typical minigel (8 x 10 cm or 
& x 8 cm). Gently shake the gel for 5 minutes. 
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10. 


. Repeat the H,O wash two more times (the total wash time is 15 minutes). 


. Remove the H,O and add enough Reagent A to cover the gel. Shake the gel for 10 min- 


utes at room temperature. 


. Remove the gel from Reagent A and place it on a glass plate. Make sure that it remains 


moist. 


. Place the InstaStain Blue Gel Stain Paper onto the top of the gel, with the blue side of the 


paper facing down. Roll out any air bubbles with a glass rod. 


. Moisten the three pieces of filter paper with Reagent A and place them on top of the 


InstaStain Blue Gel Stain Paper. Roll out any air bubbles with a glass rod. 


. Place the second glass plate on top of the filter papers. 


. Place the gel/paper/glass sandwich into the microwave oven and heat on high until it is 


lukewarm (~8 seconds). Make sure that the gel is not cooked, as it will stick to the 
InstaStain Blue Gel Stain Paper. 


. Remove the top glass plate, the filter papers, and the InstaStain Blue Gel Stain Paper, and 


place the gel into a plastic container in enough Reagent A to cover the gel. 


EXPERIMENTAL TIP: If the gel sticks to the Stain Paper, place the gel and Stain Paper into 
Reagent A and shake until the paper can be peeled off the gel, and then place the gel into 
fresh Reagent A. 


Shake the gel for 1 hour at roorn temperature. 


. Discard Reagent A and wash the gel in H,O for at least 1 hour at room temperature, or 


until the background is as desired. This should allow visualization of bands containing 
~20 ng of protein. 


The gel may be stored overnight in H,O without loss of sensitivity or band intensity. 


PROTOCOL 5 


SYPRO Ruby Fluorescent Staining 


B. FAR SYPRO RUBY IS PERHAPS THE MOST SENSITIVE FLUORESCENT staining technique in com- 
mon use for detecting polyacrylamide-gel-resolved proteins (1-2 ng protein/band). 
Unfortunately, at this low level of detection, stained protein bands are invisible to the naked 
eye, and a fluorescence scanner is required for their detection (the excitation maximum and 
emission maxima for SYPRO Ruby are 280 nm and 450/610 nm, respectively). Bands con- 
taining ~50 ng of protein can be seen with the naked eye with the aid of a UV transillumi- 
nator. This protocol has been adapted from the SYPRO Ruby staining protocol provided by 
Genomic Solutions. The method is also mass-spectrometer-compatible. For a review of flu- 
orescent detection technologies, including imaging devices and detection instrumentation, 
see Patton (2000), 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <{>. 


» Reagents 


Glycerol (2% v/v) 
Mix 20 ml of glycerol into 980 ml of H;O. 


Reagent A 
Mix 400 ml of methanol <!> with 500 ml of H,O, and add 100 ml of acetic acid. <t> 


Reagent B 
Mix 100 ml of methanol with 840 ml of H,O, and add 60 ml of acetic acid. 


SYPRO Ruby protein gel stain «t» 


P Equipment 
Aluminum foil 
Fluorescence scanner (Typhoon 8600, Molecular Dynamics or equivalent) 
Plastic container (standard Tupperware sized to fit gel) 


» Biological Sample 
Protein samples separated on a one-dimensional gel by electrophoresis 
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METHOD 


EXPERIMENTAL TIP: Reuse of the reagents is not recommended because evaporation and contam- 
ination can reduce their effectiveness. 


IMPORTANT: At all stages of this protocol, when handling the gel, use powder-free gloves because 
powder from the gloves fluoresces. 


1. 


After electrophoresis is complete, place the gel in a plastic container containing enough 
Reagent A to cover the gel. For example, use 100 ml of Reagent A for a typical minigel (8 
x 10cm or 8 x 8 cm). 


. Shake the gel for 30 minutes at room temperature. 


. Discard the used Reagent A, and add enough SYPRO Ruby stain to fully cover the gel, 


cover it with aluminum foil, and shake the gel for between 90 minutes and overnight at 
room temperature, 


New SYPRO Ruby stain requires -2 hours staining to detect 1-ng bands on a fluorescence 
scanner (Molecular Dynamics Typhoon 8600 or equivalent). Completing Step 3 with used 
stain for 2 hours reduces the detection sensitivity to bands containing ~8 ng of protein. 


. Discard the SYPRO Ruby stain (or store for later use, see note to Step 3). 


. Add an appropriate amount of Reagent B to the gel and shake it for 30-60 minutes at 


room temperature. 


. Use a fluorescence scanner set at 532 nm to scan the gel, or store the gel covered in an 


appropriate amount of 2% glycerol. 


The gel may be stored in 2% glycerol for up to 6 months without any fading of the fluores- 
cent signal. 


_ For longer-term storage, preserve the gel by drying using a gel dryer or dry in the air. To 


air-dry the gel, place it between two wetted sheets of cellophane, pull the cellophane taut, 
and allow the gel to dry undisturbed. Air-dried gels can be stored indefinitely. 


PROTOCOL 6 


SYPRO Orange Fluorescent Staining 


Tu METHOD USES A FLUORPSCEN] DYF that is very sensitive to prolein (4-10 ng 
protein/band). SYPRO Orange cannot be visualized with the naked eye and thus a fluores- 
cence scanner is required for detection of protein bands (e.g, Molecular Dynamics Typhoon 
8600 or equivalent; the excitation maximum [Ex] and emission maxima [Em] for SYPRO 
Orange is 300 nm and 470/570 nm, respectively). This same protocol can be used for SYPRO 
Red stain (Ex and Em = 300 nm and 550/630 nm). For a review of fluorescent detection tech- 
nologies, including imaging devices and detection instrumentation, see Patton (2000). The 
method was adapted from the SYPRO Orange staining protocol provided by Amersham 
Biosciences, This staining method is mass-spectrometer-compatible. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


» Reagents 


Reagent A (100-ml stack) 
Mix 92.5 ml of H;O with 7.5 ml of acetic acid <!>. Add 20 ui of SYPRO Orange gel stain concen- 
trate <!>. 
SYPRO Red fluorescent protein stain uses the same binding principle as SYPRO Orange, and hence 
this protocol can be used with SYPRO Red protein gel stain. 


Reagent B ( 1000-ml stock) 
Mix 925 ml of H,O with 75 ml of acetic acid. 


» Equipment 
Aluminum foil 
Fluorescence scanner (Typhoon 8600, Molecular Dynamics or equivalent) 
Plastic container (standard Tupperware sized to fit gel) 


b Biological Sample 


Protein samples separated on a one-dimensional gel by electrophoresis 
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METHOD 


EXPERIMENTAL TIP: Reuse of the reagents is not recommended because evaporation and contam- 
ination can reduce effectiveness. 


IMPORTANT: At all stages of this protocol, when handling the gel, use powder free gloves because 
powder from the gloves fluoresces. 


1. After electrophoresis is complete, place the gel in a plastic container containing enough 
Reagent A to cover the gel. For example, use 50-100 ml of Reagent A for a typical minigel 
(8 x 10 cm or 8 X 8 cm). 


2. Cover with aluminum foil to protect the gel from bright light. 
3. Shake the gel for 60 minutes at room temperature. 


4, Remove Reagent A and set it aside for use in Step 5. Add an appropriate amount of 
Reagent B to the gel and shake it for | minute at room temperature. 


Longer rinsing times can lower the background, although it also reduces sensitivity, 


5. Use a fluorescence scanner to scan the gel (SYPRO Orange has an excitation maximum of 
472 nm and an emission maximum of 570 nm). After scanning the gel, return it to 
Reagent A. 


6. The gel may be stored in Reagent A for several weeks, although signal intensity decreases 
with time. Alternatively, air-drying the gel between two sheets of cellophane, although it 
slightly reduces sensitivity, may preserve the gel. To air-dry the gel, place it between two 
wetted sheets of cellophane, pull the cellophane taut, and allow the gel to dry undis- 
turbed. Air-dried gels can be stored indefinitely. 


PROTOCOL 7 


Mass-spectrometry-compatible Silver Staining 


Tas ALIERNATIVE TO THE CONVENTIONAI. STANDARD SIIVER STAINING method uses the same 
principle, but it omits some reagents to permit mass-spectrometry-based protein identifica- 
tion. it is more sensitive than conventional Coomassie Blue staining procedures, allowing 
visualization of as little as 1-2 ng of protein. For a detailed review of silver staining proce- 
dures from which this protocol has been adapted, see Sinha et al. (2001). 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
P Reagents 
Reagent A 
Mix 600 mi of ethanol <!> with 200 ml of acetic acid <!> and 1200 ml of H,O. 
Reagent B 
Dissolve 6 g of potassium tetrathionate and 98 g of potassium acetate in 600 m! of ethanol and 1400 
ml of H,O. 
Reagent C (0.2% AgNO,) 
Dissolve 2 g of silver nitrate <!> in 1000 ml of H,O. 
Reagent D (Developer) 
Combine 60 g of potassium carbonate <!>, 600 ul of formaldehyde <!>, and 250 jl of 1096 sodi- 
um thiosulfate pentahydzate. Adjust the volume to 2000 m! with H;O. 
Reagent E (Stop solution) 
Combine 80 g of Tris and 40 ml of acetic acid. Adjust the volume to 2000 mi with H,O, 
P» Equipment 


Plastic container (standard Tupperware sized to fit gel) 


b Biological Sample 
Protein samples separated on one- and two-dimensional gels by electrophoresis 
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METHOD 


EXPERIMENTAL TIP: Reuse of reagents is not recommended because evaporation and contamtna- 
tion can reduce their effectiveness. 


i$ 


oo c4 > C0 


After electrophoresis is complete, place the gel in a plastic container containing enough 
Reagent A to cover the gel. For example, use 100 ml of Reagent A for a typical minigel (8 
xlücmor8x 8 cm). 


. Shake the gel for 30 minutes at room temperature, and then discard the used Reagent A. 
. Repeat Steps 1 and 2 three more times. 


. Add an appropriate amount of Reagent B to the gel, and shake it for 45 minutes at room 


temperature. 


. Discard the Reagent B. 
. Wash the gel with H.O for 10 minutes at room temperature, and then discard the H,O. 
. Repeat Step 6 five more times (the total wash time is 60 minutes). 


. Add an appropriate amount (e.g., 200-250 ml for a 20 x 20-cm gel} of Reagent € to the 


gel and incubate it with continuous shaking for 1—2 hours at toom temperature. 


. Discard Reagent C in a suitable waste container and quickly rinse the gel with H,O tor 


5--15 seconds. 


. Add an appropriate amount of Reagent D to the gel. Continuously shake the gel at room 


temperature. 


. Observe the gel during development of the protein bands. Once an appropriate sensitiv- 


ity is reached (i.e. clearly visible bands without a strong background), discard Reagent D. 


. Immediately cover the gel with Reagent E and shake it for 30 minutes at room temperi- 


fure. 


. Discard Reagent E and store the gel for up to 6 months in H,O. Alternatively, the gel can 


be air-dried and stored indefinitely. 


PROTOCOL 8 


Apparent Molecular Mass Determination 


Because MIGRATION OF SDS-PROTEIN COMPLEXES in polyacrylamide gels is generally propor- 


tional to the apparent molecular mass (M,) of the protein, SDS-PAGE 1s a widely used pro- 
cedure for determining the M, of a protein. 


MATERIALS 


b Biological Molecules 
Protein sample of interest 
Protein molecular-weight standards 


» Additional Reagents and Equipment 
Step | requires the reagents and equipment listed in Protocol 1. 


Step 2 requires the reagents and equipment listed in one of the protein visualization proto- 
cols, Protocols 2-7. 


METHOD 


1. Run sample protein and protein molecular-weight standards on the same SDS-PAGE slab 
gel following the steps in Protocol 1. 


Running samples and protein standards on the same slab gel eliminates any variability due 
to acrylamide concentration and electrophoresis conditions. Although kits of protein stan- 
dards are commercially available from several sources, detailed lists of protein standards and 
their M, values can be found in Weber and Osborn (1969), Neville (1971), and Hames 
(1990). For careful M, determinations using this method, it is advisable to run gels of more 
than one acrylamide concentration (Ferguson 1964; Neville 1971). For a detailed discussion 
on the effect of both polyacrylamide and cross-linker concentrations on protein migration 
using various commercially available calibration proteins for SDS-PAGE, see Makowski and 
Ramsby (1997). 


2. After electrophoresis is complete, carry out one of the protein visualization protocols 
(Protocols 2-7). 


3. (Optional) Dry the gels prior to measurement of R, values. 
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FIGURE 2.7. Semilogarithmic graph of molecular weight versus relative mobility (Rj). (Reprinted, 
with permission, from Bollag et al. 1996.) 


4. Measure the distance of protein migration and of the tracking dye (which is typically bro- 
mophenol blue}. 


5. Calculate relative mobility (R,) values 
R; = distance of protein migration/distance of tracking dye migration 


6. Plot the mobilities against the known M, expressed on a semilogarithmic scale. The line 
covering the middle of the gel should be almost linear (see Figure 2.7). 


7. Read the M, of the unknown protein from the graph based on its R, value. 


KEY POINT: Generate a new standard curve for each gel. 


PROTOCOL 9 


CTAB-PAGE 


MATERIALS 


b Reagents 


Tus PROTOCOL IS A VARIATION ON THE STANDARD SDS-PAGE METHOD described in Protocol 1. 
Although SDS-PAGE is the method of choice for most denaturing gel electrophoresis proce- 
dures, the anionic detergent SDS still presents some drawbacks. For example, SDS forms crys- 
tals at low temperatures and, in some cases, causes proteins to aggregate or precipitate. in 
addition, some proteins are not well-resolved in SDS geis or may migrate anomalously. In 
these situations, the use of a cationic detergent for PAGE offers an alternative approach. The 
discontinuous, cationic detergent PAGE method described here is based on the method of 
Atkins et al. (1992). This systern uses the cationic detergent CTAB and includes a stacking gel 
based on the zwitterion arginine (used as a stacking agent) and tricine (N-tris[hydrox- 
ymethyl]-methylglycine) used as a counterion and buffer. Some proteins separated on the 
CTAB electrophoresis system retain their native enzymatic activity, provided the samples are 
prepared without boiling and without the addition of a reducing agent, 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 
IMPORTANT: All reagents should be electrophoresis grade or better. 


Acrylamide stock solution (40%T, 2.6796 C) «t» 
Mix 38.96 g of acrylamide and 1.04 g of bisacrylamide <!> to 100 ml of H,O. Filter the stock solu- 
tion using Whatman filter paper and store at 49C, Prepare fresh stock acrylamide solution every few 
weeks. 


CTAB sample buffer 
10 mM tricine-NaOH (pH 8.8) «i» 
196 CTAB 
1096 glycerol 
10 l/m] of a saturated aqueous solution of crystal violet <!> 


2-Mercaptoethanol (2%) <!> 
Optional, see Step 3. 
Resolving gel mixture (6% acrylamide, 375 mM tricine-NaOH at pH 8) 
Mix: 
9.4 ml of H,O 
2.4 ml of acrylamide:bisacrylamide (40%T, 2.67%C} 
4 ml of 1.5 M tricine- NaOH (pH 8) 
0.16 ml of 1096 ammonium persulfate «1» 
16 pl of TEMED <!> 
Add the TEMED just before pouring the gel. This makes enough gel mixture to pour two minigels, 
1.5-mm thick. 
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Stacking gel mixture (0.7% agarose, 0.1% CTAB, 125 mw tricine-NaOH at pli 10) 
Add 35 mg of agarose to 1.25 ml of 0.5 M tricine-NaOH at pH 10 and 50 ul of 10% CTAB. Add H,O 
to 5 ml. Melt the agarose by boiling or microwaving the mixture before pouring the gel. This makes 
enough gel mixture to pour stacking gels for two minigels, 1.5-mm thick. 
5x ‘Tricine running buffer 
Dissolve: 
22.4 g of tricine 
5 gof CTAB 
75 ml of | m arginine (free base] in H,O 
Add H:O to a final volume of 1 liter. The pH of the buffer should be -8.2 and no pH adjustment is 
required, Store the running buffer at room temperature and dilute to [x before usc. 


P Biological Sample 
Protein samples, either lyophilized or as pellets 


P Additional Reagents and Equipment 


Boiling water bath 
Optional, see Step 3. 
Step 1 requires some of the reagents and all of the equipment listed in Protocol L. 


METHOD 


1. Prepare the gels as described in Protocol 1, but substitute the resolving gel mixture and 
stacking gel mixture for those in Protocol 1. 


2. Fill the upper and lower buffer reservoirs with 1x tricine running buffer, 


3. Prepare the protein samples by dissolving them in 50 ul of CTAB sample buffer at room 
temperature, to a final concentration of 5 mg/ml. If retaining biological activity is not a 
concern, heat the samples for 3 minutes in a boiling water bath in the presence of 2% 2- 
mercaptoethanol. 


4. Load the samples into the sample wells. 


5. Start electrophoresis from the anode (+) to the cathode (—) at 100 V as the samples 
migrate through the stacking gel. 


6. Increase the voltage to 150 V as the samples begin to migrate through the resolving gel. 


7. Turn off the power supply and stop the electrophoresis when the dye front reaches the 
bottom of the gel. 


proTOCOL 10 


Acid-Urea Continuous PAGE 


Acw-vres CONTINUOUS PAGE IS IDFAEIY SUITED for studying minor structure variants (of 
slightly different charge) or modified forms of the same protein (e.g., histones). Because SDS- 
PAGE systems are usually unable to separate two proteins of similar size but different charge 
from each other, these two electrophoresis systems complement each other. This protocol is 
based on the methods of Panyim and Chalkley (1969) and Rabilloud et al. (1996). 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 
» Reagents 
Acrylamide:bisacrylamide stock solution (50%T, 1.5%C) <!> 
Mix 48.7 g of acrylamide and 1.3 g of bisacrylamide per 100 ml of H,O. Filter the stock solution 
using Whatman filter paper and store at 49C. Prepare fresh stock acrylamide solution every few 
weeks. 
Diphenyliodonium chloride stock (1 mM) <!> 
Prepare in HO. This solution can be stored for | week at 4°C in the dark. 
Glacial acetic acid <!> 
Methylene blue stock (2 mM) <!> 
Prepare in H,O, This solution can be stored for 1 year at room temperature in the dark. 
Reservoir buffer (0.9 M acetic acid) 
Prepare as 5.4% glacial acetic acid in H;O. 
Sample buffer (0.9 M acetic acid containing 2.5 M urea <!> and 0.1% pyronin Y <!>} 
Sodium p-toluenesulfinate (20 mM) <!> 
Prepare in H,O. This solution can be stored for 1 week at 4°C in the dark. 
Urea <!> 
P Equipment 


Light source for photopolymerization 
A daylight fluorescent lamp (15 W) positioned ~10 cm from the gel can be utilized to catalyze the 
methylene blue photopolymerization. Alternatively, two light boxes (each with 12-W neon tubes) 
also positioned 10 cm from the gel on both sides can be used for photopolymerization. 


b Biological Sample 


Protein samples, either lyophilized or as pellets 
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» Additional Equipment 


Step 2 requires all of the equipment listed in Protocol 1, although a minigel apparatus will be 
sufficient. 


METHOD 


1. 


~~ O c è 


Prepare the gel mixture (15% acrylamide, 2.5 M urea, 0.9 M acetic acid at pH 2.7) by mix- 
ing: 


acrylamide:bisacrylamide (30901, 1.5%C) 3.0 ml 
glacial acetic acid 0.54 ml 
urea l.5g 
2 mM methylene blue 0.15 ml 
20 mM sodium toluenesulfinate 0.25 ml 
| mM diphenyliodonium chloride 0.2 ml 
EO to 10 ml 


| Pour the gel into the minige! mold. 


. Photopolymerize for 2 hours with constant illumination. Pre-electrophoresis is not nec- 


essary. 


. Fill the buffer reservoirs with the reservoir buffer. 
| Dissolve the protein samples in sample buffer, up to 35 mg of protein/ml. 
. Load the protein samples into the sample wells of the gel. 


. Start the electrophoresis trom the anode (1) to the cathode {--). Follow the manutactur- 


ers recommendations for the optimal voltage to use. For example, use 130 V constant 
voltage if the Bio-Rad Mini-Protean 1 Cell minigel is used. 


. Turn off the power supply and stop the electrophoresis when the dye front reaches the 


bottom of the gel. 


PROTOCOL 11 


Electrophoresis of Peptides (Tricine-SDS-PAGE) 


MATERIALS 


Tue MOST WIDELY USED PROTEOMICS METHOD for quantitatively analyzing protein mixtures is 
SDS-PAGE. It is particularly useful for monitoring protein purification, and because the 
method separates proteins on the basis of their size, it can also be utilized to determine the 
relative molecular mass (M,) of proteins (Protocol 8). Parameters affecting the resolution of 
proteins or peptides separated by SDS-PAGE include the following: 


s The ratio of acrylamide to the cross-linker, bisacrylamide. 


e The percentage of acrylamide/bisacrylamide used to prepare the stacking and separating 
gels. 


e The pH of the stacking and separating gel buffers and the components of these buffers. 
* The method by which the sample is prepared. 


Typical Laemmli gel systems (Laemmli 1970), which utilize glycine in the running buffer, 
are capable of resolving proteins in the molecular mass range of ~200,000 daltons down to 
~3000 daltons. In this protocol, the tricine gel system of Schágger and von Jagow (1987) is 
described, in which tricine is substituted for glycine. This system permits the resolution of 
peptides as small as 500 daltons, making it suitable for SDS-PAGE peptide mapping and for 
preparing samples for amino- and carboxy-terminal sequence analysis (see Chapters 7 and 6, 
respectively). 


b Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <1. 


Lower reservoir buffer (0.2 M Tris-HCl at pH 8.9) 
Resolving gel mixture <!> 
Prepare a 10% acrylamide mixture following the recipe given in Table 2.4 (Protocol 1), 
Stacking gel mixture <!> 
Prepare a 4% acrylamide stacking gel mixture following the recipe given in Table 2.5 (Protocol 1). 
Upper reservoir buffer 
0.1 M Tris 
0.1 M tricinc 
0.1% (wiv) SBS <!> 
The pH of the buffer will be ~8.3. 
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P Biological Sample 


Protein solution(s) or pellets of cellular proteins 
» Additional Reagents and Equipment 


Steps | and 3 require some of the reagents and ail of the equipment listed in Protocol 1. 
Step 5 requires the reagents listed in either Protocol 2 or 7. 


METHOD 


1. Cast the discontinuous gel following the steps in Protocol 1. 
. Fill the upper and the lower buffer reservoirs with the appropriate buffers. 


_ Prepare the protein samples for loading onto the gel according to Protocol 1. 


mR w Pn 


. Start electrophoresis from the cathode (—) to the anode (+) at a constant current of 25 
mA/gel. 


5. Visualize the proteins by staining with Coomassie Blue or silver as described in Protocol 
2 or 7. 


protocol 12 


Nondenaturing PAGE of Proteins 


MATERIALS 


Nonvevarunine GEL ELECTROPHORESIS, also referred to as native gel electrophoresis, is com- 
monly run at pH 8.8. At this pH, most proteins are negatively charged and migrate toward 
the anode. All stock solutions used in this system are similar to those used for the Laemmii 
system (Laemmli 1970) but lack SDS (see Protocol 1). Native gels can also be run at low pH 
(for a low pH, discontinuous system, where proteins are stacked at pH 5 and resolved at pH 
3.8, see Shi and Jackowski 1998). 


b Reagents 


CAUTION; See Appendix 3 for appropriate handling of materials marked with <!>. 


Acrylamide stock (30%T, 2.6%C) <!> 
Add 29.22 g of acrylamide and 0.78 g of bisacrylamide <!> to 100 ml of H,O. Filter the stock solu- 
tion through Whatman filter paper and store at 4°C. Prepare fresh stack acrylamide solution every 
few weeks. 

Ammonium persulfate (10%) «1» 


Electrophoresis buffer 


Dissolve 3.0 g of Tris base and 14.4 g of glycine <!> in H,O and adjust the volume to 1 liter. The 
final pH should be 8.3. 


5x Sample buffer 

Mix: 

15.5 ml of 1 m Tris-HC1 (pH 6.8) 

2.5 ml of a 196 solution of bromophenol blue <!> 

7 ml of H,O 

25 ml of glycerol 
Solid samples can be dissolved directly in 1x sample buffer. Samples already in solution should be 
diluted accordingly with 5x sample buffer to give a solution that is 1x sample buffer. Do not use 
protein solutions that are in a strong buffer which is not near to pH 6.8 as it is important that the 
sample is at the correct pH. For these samples, it will be necessary to dialyze against 1X sample 
buffer. The desired protein concentration will depend on the sensitivity of the protein detection 
(visualization) method to be used (see the Introduction to Protein Visualization Procedures, and 
Protocols 2-7). 


Separating gel buffer (1.5 M Tris-HCl at pH 8.8) 
Stacking gel buffer (0.5 mM Tris-HCl at pH 6.8) 
TEMED <!> 
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» Equipment 


A microsyringe, standard disposable pipette tips, or gel-loading tips for loading samples onto 
the gel 


P Biological Sampte 
Protein solution(s) or pellets of cellular proteins 


» Additional Reagents and Equipment 


Step 1 requires all of the equipment listed in Protocol 1. 
Step 2 requires the reagents listed in either Protocol 2 or 3 (optional). 


METHOD 


1. The procedures for setting up the gel cassette, pouring the gel, performing the elec- 
trophoresis, and handling, storing, and staining the gel are the same as those described in 
Protocol 1. 


2. If proteins are to be detected by their biological activity, run duplicate samples. Stain one 
set of samples for protein and the other set for activity. Most commonly, look for enzyme 
activity in the gel. This is achieved by washing the gel in an appropriate enzyme substrate 
solution that results in a colored product appearing in the gel at the site of the enzyme 
activity. Alternatively, visualize the proteins using CBR-250 stain, as described in Protocol 
2 or 3. 
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Bxuacuos OF PROTEIN FROM TISSUES AND CELLS is perhaps the most critical step in any pro- 
teomics strategy because this step influences protein yield, biological activity, and the struc- 
tural integrity of the specific target protein. Thus, care must be taken in selecting the specif- 
ic extraction conditions employed. The principal aim must be to reproducibly achieve the 


*This chapter in luses a contribution from Melinda L. Ramsby and Gregory $. Makowski t University of 
Connecticut Health Center, Fremington}, 
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highest degree of cell breakage using minimal disruptive forces while maintaining protein 
integrity. A summary of commonly used methods for homogenizing cells and tissues is given 
in Table 3.1. 

It is important to avoid altering the native structure of the target protein and, ipso facto, 
its biological activity. Perturbation of native protein structure during preparation of cell 
extracts can occur by exposure to extremes of pH, temperature, mechanical stress (shearing 
forces}, pressure, and proteolytic degradation. Although proteolytic degradation may not 
alter the biological activity per se, it may influence the association of the target protein with 
other cellular regulatory components. Such nonspecific alterations can result in irrepro- 
ducibility of behavior of the target protein from one preparation to the next, which makes 
interpretation of biological studies extremely difficult. For discussions on avoiding prote- 


olytic degradation of proteins in extracts, see Beynon and Oliver (1996) and Beynon and 
Bond (1989). 


Key variables that determine the successful preparation of crude extracts include 
* the method of cell lysis, 

» the control of pH, 

* temperature, and 

« avoidance of proteolytic degradation. 


Note: A trial-and-error approach in pilot experiments is often required to optimize celi 
lysis conditions. 


It is not always necessary to break open cells to extract recombinant proteins. For exam- 
ple, various high-expression mammalian cell lines (in particular, Chinese hamster ovary 
[CHO] cells) and strains of yeast (e.g., Pichia pastoris) have been readily engineered to secrete 
recombinant proteins that can be purified directly from cell-conditioned media and culture 
filtrates. Using these approaches, it is important to concentrate the large volumes of cell-con- 
ditioned media into a “protease-free” environment as quickly as possible. 

A different problem can occur with Escherichia coli expression systems, where the 
expressed recombinant protein often appears in the crude extract (lysate) as insoluble aggre- 
gates, referred to as "inclusion bodies? In this case, the purification of a target protein often 
involves initial solubilization of the inclusion bodies in a strong denaturant (e.g., guanidine 
hydrochloride or urea) and subsequent refolding. These procedures are described in Protacols 
6 and 7. 

When choosing a cell disruption strategy, it is important to consider the intended use of 
the cell lysate. For example, lysis conditions (choice of buffer, detergents, and so on) will vary 
markedly, depending on whether the lysate is to be used for 


« immunoprecipitation studies, 
e western blotting, 
e two-dimensional gel electrophoresis, 


native target protein isolation using conventional chromatographic purification proce- 
dures, and 


recombinant protein purification procedures that rely on the target protein expressed as à 
fusion protein including a purification "handle" or “tag” 
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TABLE 3.1. Methods for homogenizing cells and tissues 


Method Underlying basis of cell disruption Type of tissue 
Gentle 
Osmotic shock Osmotic disruption of cell membranes bacteria, erythrocytes 
Detergent lysis Detergent disruption of cell membranes tissue culture cells 
Enzymatic digestion Digestion of celi wall; contents released by bacteria, yeast 
osmotic disruption 
Dounce homogenizer Cells forced through a narrow gap with a clearance soft animal tissues and cells 
and/or of 0.05-0.08 mm (tight fitting) to 0.1—0.3 mm (loose 
Potter-Elvehjem homogenizer: fitting); cell membrane disrupted by liquid shear 
forces 


The smaller the clearance, the greater the shearing 
force; the clearance of a Teflon pestle (Potter-Elvehjem 
homogenizer) is normally 0.05—0.06 mm^ 


Moderately harsh 
Homogenization (Waring Blendor) Cells broken by rotating blades most animal, plant tissues 
Grinding (with sand, alumina, or Cell walls broken by abrasive action of particles cell suspensions 
glass beads) 
Vigorous 
French pressure celi Cells forced through small orifice at high hydraulic bacteria, yeast, plant cells 
pressure (100-150 Mpa or 15,000—20,000 psi) and 
disrupted by shear forces 
Explosive decompression Cells equilibrated with inert gas (e.g N;) at high bacteria, yeast, plant cells 
(nitrogen cavitation) pressure (typically, 5500 kPa or 800 psi); on exposure 
of cells to 1 atm, disruption occurs 
Bead mill Rapid vibration with glass beads disrupts cell wall cell suspensions 
Ultrasonication High-pressure sound waves cause cell rupture by cell suspensions 


cavitation and shear forces 


Sometimes called Teflon-and-glass homogenizer; it is power-driven, the pestle typically rotated at 500-1000 rpm. 

*For soft tissue such as liver, a Potter-Elvehjem homogenizer with a clearance of ~0.09 mm is recommended. Smaller clearances can lead to damage to 
released organelles {especially nuclei} as well as causing difficulty in moving the glass vessel relative to the pestle in the early stages of homogenization 
(Graham 1997). 

The first part of this introduction describes commonly used methods for cell lysis, 
including procedures for preparing crude cell extracts for immunoprecipitation and 
immunoblotting studies. The range of methods discussed is by no means exhaustive and does 
not include plant tissues and fungi. For more detailed reviews describing the preparation of 
crude extracts from eukaroytes, prokaryotes, and plants, see Deutscher (1990), Doonan 
(1996), Graham (1997), and Spector et al. (1998). The second section deals with the problem 
of solubilizing E. coli-derived recombinant proteins from inclusion bodies. The third section 
discusses the advantages of isolating subcellular fractions and describes the differential deter- 
gent fractionation (DDF) of eukaryotic cells for the isolation and analysis of proteins. The 
protocols described in this chapter are sufficiently general and thus can be applied to a vari- 
ety of different tissues and cell types with only minor modifications. 


TISSUES AND CELLS CAN BE DISRUPTED BY MECHANICAL OR CHEMICAL MEANS 


Preparation of Protein Extracts from Mammalian Tissues 


The preparation of protein extracts from most animal tissues is relatively simple because the 
cell membranes are weak and easily disrupted by a combination of osmotic and mechanical 
forces. The first steps of a typical protein isolation procedure usually consist of washing the 
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tissue, disrupting the tissue in a suitable buffer using a homogenizer, and clarifying the 
homogenate by centrifugation. ‘Ihe centrifugation step separates the soluble proteins trom 
the membrane fraction and insoluble cell debris. A generalized. procedure that may be 
applied to a number of different tissues and cell types with only minor modifications is given 
in Protocol 1. Isolation of a particular membrane or subce lular organelle requires more spe- 
cialized procedures (see Graham and Rickwood 1997; Celts 1998; Spector et al. 1998]. 

The choice of tissue for extraction of soluble proteins depends on several criteria, Clearly, 
the tissue distribution of a particular target protein varies markedly. Hence, it is always desir- 
able to perform small-scale pilot experiments to measure the relative content of the target 
protein in various tissues. However, the final choice of starting tissue will, invariably, be deter- 
mined by the balance between target protein abundance, ready availability and cost of the tis- 
suc, and various technical issues such as minimization of proteolytic activity, It should be 
borne in mind that certain animal tissues (e.g., liver, spleen, kidney, 4nd macrophages) are 
rich in lysasomal proteases, particularly cathepsins, and should be avoided unless the goal of 
the study is the proteolytic enzyme(s) itself. Fresh tissue is preferable, but in some cases, 
frozen tissue may be acceptable, provided it is frozen rapidly in smail pieces and not stored 
for too long. It is recommended that frozen tissue be stored below —509C, since some prote- 
olytic degradation of proteins can be expected due to release of proteases Irom lysosomes as 
a result of ice-crystal formation (Dignam 1990). 

In some cases, it may be desirable to gently disrupt tissues and prepare enriched popula- 
tions of intact cells, prior to disruption of these cells. For example, the separation of differ- 
ent cell types from normal tissues and tumors may permit comparative information to be 
obtained (e.g., protein expression profiles and mRNA profiles! thal is unavailable if whole tis- 
sue (or tumor) is studied. A number of procedures have been well described in various cell 
biology manuals te.g., see Celis 1998; Spector et al. 1998) for preparing cell suspensions from 
tissues and organs using mechanical or enzymatic methods. in general, enzymatic methods 
are preferred, since there is less damage to the integrily of the cells. [n addition, it is usual to 
add ethylene diaminetetracetic acid (EDTA) to chelate Ca^ ions that are frequently involved 
in cell cell adhesion. Purification of cells obtained by these means is usually accomplished bv 
the following: 


« Differential centrifugation (based on cell size and density 1, This is accomplished using iso- 
osmotic densitv gradients generated using nontoxic/nonpermeable media, notably, Percoll, 
Ficoll, and metrizamide and, more recently, OptiPrep (based on indixanol! (Graham et al. 
19941. 

e Centrifugal elutriation or counterstream centrifugation. This method is based on two 
opposing forces, namely, media flow and centrifugal force (Bird 1998). 

« Selective immunoseparation procedures employing monoclonal antibody-bound magnet- 
ic beads (Dynabeads). This method is based on the efficient selection of specific cells using 
a simple magnet (Neurauter et al. 1998). 


All existing methods have inherent advantages and limitations, the discussion of which 
is beyond the scope of this manual. For a detailed discussion of these procedures and proto- 
cols for enriching cell populations of interest from various tissues, see Celis (1995), Spector 
et al. (1998), and references therein. 


Preparation of Protein Extracts from Mammalian Cultured Cells 


Mammalian cultured cells can be lysed by several different methods, the method of choice 
depending on the fina: use of the target protein Le.g., immunoprecipitation immunoblot- 
ting, two-dimensional gel electrophoresis, and conventional purification). Mammalian cells 
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lack a cell wall and thus are easily lysed by treatment with mild detergents. If the final prepa- 
ration of the target protein need not retain its structural integrity (three-dimensional struc- 
ture) or biological activity, the cells can be lysed under harsh denaturing conditions (e.g. 
RIPA lysis buffer). If gentler conditions are required, the Nonidet P-40 (NP-40) lysis buffer 
(or variations thereof) shouid be used. 


Many extractiéia conditions release proteolytic enzymes in the lysis buffer. If proteolytic 
degradation of the target protein, becomes a problem, two approaches-can be employed to 
“Jessen its effect: 


œ Keep the sample cold (temperature has a profound effect on the catalytic activity of most 
proteases). “ 


* Add protease inhibitors to the lysis buffer (for a list of some of the commonly used pro- 
tede inhibitors, see Table 3.3). 


For immunoprecipitation studies, the conditions used for lysis should be as gentle as pos- 
sible to maintain the structural integrity of the target protein and to minimize the solubi- 
lization of irreievant proteins. This is best accomplished by using nonionic detergents instead 
of ionic ones, lower concentrations than higher, and single detergents rather than mixtures. 
The two most widely used lysis buffers for the extraction of proteins from mammalian cul- 
ture cells are NP-40 lysis buffer and RIPA lysis buffer. The former buffer releases cytoplasmic 
and nuclear proteins without releasing chromosomal DNA, which, because of its viscous 
nature, can cause numerous problems during protein purification and analysis. 

The lysis conditions can be easily tailored to suit the target protein. Variables that can 
affect the release of a target protein from a cell include salt concentration, type of detergent, 
presence of divalent cations, and pH. To determine the optimum conditions for extracting a 
target protein, the variables listed below should be monitored in pilot experiments (Harlow 
and Lane 1999): 


* Salt concentration should be varied from 0 to 1 M. 

* Nonionic detergent concentrations between 0.196 and 296. 

* lonic detergent concentrations between 0.01% and 0.596, 

* Divalent cation concentrations between 0 and 10 mw. 

* EDTA concentrations between 0 and 5 mM. 

* pH values between 6 and 9, 

A generalized procedure for lysing tissue culture cells for the purpose of performing 
immunoprecipitation is given in Protocol 2. 

For immunoblotting studies, the conditions used for lysis can be harsher than those used 
for immunoprecipitation studies. The most widely used buffer system is the Laemmli sample 
buffer {Laemmli 1970) containing 2% SDS. A procedure for lysing tissue culture cells for the 
purpose of performing immunoblotting is given in Protocol 3. An alternative procedure for 


lysing any tissue culture cells (and microorganisms such as bacteria) that relies on gaseous 
shear to disrupt cells is nitrogen cavitation (see Protocol 4). 


Disruption of Bacterial Cells 


Due to the advent of recombinant DNA technology, bacteria are now a particularly conve- 
nient vehicle for generating large quantities of recombinant protein. Enormous numbers of 
bacteria can be grown under defined conditions and are relatively easy to break open for 
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extraction purposes. A number of methods, based on mechanical and enzymatic means, are 
available tor lysing bacteria, and to a large extent, the choice will depend on the scale of the 
process (for a review, see Cull and McHenry 1990). 

The mechanical procedures for lysing bacteria (e.g., French pressure cell, nitrogen vavi- 
tation, ultrasonication, grinding with abrasive agents, and vortexing with glass beads; rely on 
shearing forces to disrupt the tough outer cell wall (for a review of these procedures, see 
Graham 1997). However, mechanical methods are more likely to damage the cellular contents 
(compared with enzymatic methods) and are not easily scaled-up. 

The French Press lyses cells by applying hydraulic pressure to the cell suspension (typt- 
cally, 8,000—20,000 psi, or 550-1400 kg/cm’), followed by a sudden release to atmospheric 
pressure. In this device, the cell suspension is propelled by a piston through a narrow orifice, 
often the annulus around a ball bearing. The rapid change in pressure causes a liquid sbear 
and consequent bursting of cells. In some cases, two to three passes of the cells through the 
French Press are required to obtain adequate lysis (Cull and McHenry 1990). The French 
Press method works well for cell suspension volumes of 10-30 ml (ratio of cell wet weight to 
lysis buffer volume can range from 1:1 to 1:4 g/ml), but it is considered too time-consuming 
for larger volumes and can be technically difficult for smaller volumes. 

In nitrogen cavitation devices, the pressure cell consists of a robust stainless steel cylinder 
with an inlet port for delivery of nitrogen gas from a cylinder and an outlet tube with a nee- 
dle valve. During pressurization (i,e., 5500 kPa [800 psi] for 10-30 minutes), nitrogen dis- 
solves in the cell suspension buffer as well as in the cytosol of the cells. When the necdle valve 
is opened, the suspension is forced through the outlet tube, and at ] atm, it experiences à 
rapid decompression that causes cell disruption due to the sudden formation of bubbles of 
nitrogen gas. This method can be used with bacteria or any tissue culture cell in volumes 
ranging from 1 to 1000 ml. Nitrogen cavitation (see Protocol 4) eliminates the heat buildup 
associated with mechanical and ultrasonic disruption because the cells are actually cooled by 
the expanding gas. 

Sonication disrupts cells by creating vibrations that cause mechanical shearing of the cell 
wall. This method is suitable for small-scale purifications (up to | g of cells or tissue can be 
lysed at a time). Generation of heat during sonication can be a problem and may result in pra- 
tein denaturation (as evidenced by foaming); this problem can be overcome by sonicating the 
cell suspension in short bursts and allowing the sample to cool on ice between treatments. 

Like the French Press method, the grinding of cells with abrasive materials such as alu- 
mina or sand is an effiGent method of lysing unicellular organisms, as well as plant celis. 
Unlike the French Press, this grinding method requires inexpensive materials (e.g., mortar, 
pestle, and either sand or alumina) and is effective for moderate quantities of cells (up to 
30 g wet weight of cells) (Fahnestock 1979; Sebald et al. 1979). Abrasives can also be added to 
sonication mixtures. 

An extension of the grinding method is the glass bead vortexing procedure, which has been 
described for lysing unicellular organisms, particularly yeast (Schatz 1979). Glass bead vor- 
texing is suitable for small samples (~3 g wet weight of cells) that can withstand being repeat- 
edly vortexed with glass beads. A number of instruments are now commercially available for 
lysing larger quantities of cells (e.g., Manton-Gaulin homogenizer, Braun MSK Glass Bead 
Mill). 

Methods based on enzymatic breakdown of the cell wall use the activity of lysozyme; 
which cleaves the glucosidic linkages in the bacterial cell-wall polysaccharide, to cause dis- 
ruption (the inner cytoplasmic membrane can be readily disrupted by detergents, osmotic 
pressure, or mechanical methods). Enzymatic methods are much gentler than mechanical 
methods and are more easily applied to large-scale processing. ‘Two generic methods for 
preparing bacterial extracts are given in Protocols 5 and 6. 
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Disruption of Yeast Cells 


A number of procedures exist for preparing yeast extracts, including autolysis (e.g addition 
of toluene to yeast suspension), French pressure cell, abrasives (e.g., glass beads), and enzy- 
matic lysis (for an overview, see Jazwinski 1990; Bridge 1996). Of these, the most widely used 
procedure is the use of abrasives. The abrasive action of well-agitated glass beads (typically, 
0.5-mm diameter) on yeast cells yields up to 95% cell breakage (Jazwinski 1990). Although 
this method is useful for preparing enzymes and some cell organelles, it is considered too 
harsh to preserve the integrity of nuclei as well as protein complexes (enzymatic methods are 
recommended for this purpose). Major differences between abrasive procedures reside in the 
method employed to agitate the glass beads. One of the simplest methods for agitating glass 
beads involves the use of a vortex mixer (Baker et al. 1988). A generic method for lysing small 
quantities of yeast cells employing the glass bead vortexing approach is given in Protocol 8. 
For a discussion on commercially available equipment for lysing yeast, see Jazwinski (1990). 


RECOMBINANT PROTEIN CAN BE RECOVERED FROM INCLUSION BODIES BY 
DENATURATION AND RENATURATION 


A significant impediment to overexpressing recombinant proteins in EF. coli is the tendency 
for the targeted protein to form inclusion bodies, which are in vivo agglomerates of proteins 
that appear in the cytoplasm as large, dense bodies in scanning electron micrographs 
(Williams et al. 1982). Inclusion bodies are relatively rare in nature (most proteins are 
expressed in soluble form), with sickle cell anemia and other related blood diseases being the 
notable exception (Carrell et al. 1966). Overexpression of recombinant proteins from strong 
promoters on multiple-copy plasmids—with expression levels up to 40% of total cell pro- 
tein—is thought to be the underlying reason for inclusion body formation in E. coli. For a 
review of heterologous protein production in E. coli and the purification of recombinant pro- 
teins from inclusion bodies, see Hockney (1995) and Marston (1986). 

it is thought that inclusion body protein is partially or incorrectly folded, especially a 
protein containing disulfide bonds. An important difference between the expression of 
eukaryotic proteins in E. coli and their native environment is the inability of E. coli to form 
disulfide linkages, due to the reducing environment of its cytoplasm. Inclusion body forma- 
tion of overexpressed recombinant protein in E. coli is found not only for foreign eukaryotic 
proteins, but also for overexpressed bacterial proteins that are normally solubie (Gribskov 
and Burgess 1983). Temperature-sensitive denaturation can be overcome (by reducing the 
frequency of inclusion body formation, and therefore increasing the soluble fraction of the 
target protein) by lowering the expression temperature from 37°C to 30°C (Schein and 
Noteborn 1988). 

In addition to growth temperature, a number of other growth parameters have been 
manipulated to prevent inclusion body formation and increase the soluble fraction of the tar- 
get protein. Most notable of these are the following: 


+ Varying the media composition and using different host strains (Schein and Noteborn 
1988). 


Coexpression of molecular chaperones. For example, coexpression of DNAK increased the 
percentage of soluble human growth hormone in E. coli by ~87% (Blum et al. 1992), and 
coexpression of GroES and GroEL facilitated the purification of milligram quantities of 
recombinant p50** (Amrein et al. 1995). 


Fusion of the target protein with a highly soluble protein such as glutathione-S-transferase 
(Smith and Johnson 1988) or thioredoxin (La Vallie et al. 1993). 
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© Growing cells in the presence of sorbitol and glycyl betaine. Sorbitol facilitates the cellular 
uptake of the “protein stabilizer” glycyl betaine (Blackwell and Horgan 1991). 


lt should be emphasized that none of the above-mentioned approaches for increasing the 
soluble fraction of the target protein are general for all applications (Hockney 1995), and 
pilot experiments are recommended to ascertain the best approach for a particular protein, 
The following are two major advantages of inclusion bodies. 


e By sequestering recombinant protein, these bodies permit the cell to express the protein at 
high levels. 


» The inclusion bodies can be readily purified away from bacterial cytoplasmic proteins by 
centrifugation, yielding an effective purification step. 


The major disadvantage of inclusion bodies is that extraclion of the target protein requires 
the use of detergents. This problem is exacerbated where natively folded protein is required, 
particularly if the target protein contains more than threc disulfide bonds. The protein con- 
tained within inclusíon bodies is generally insoluble in nonionic detergents and salts. The 
most widely used solubilizing agents are the water-soluble chaotropic agents, such as urea (8 
M) and guanidine hydrochloride (6 M), that cause complete denaturation and are more com- 
patible with protein refolding. Thus, the target protein can be renatured by simply removing 
the denaturant under conditions that favor complete refolding of the target protein over the 
formation of aggregates due to intermolecular protein-protein interactions. Such refolding 
conditions include low temperature and very low protein concentration ( «1 ug protein/ml). 

For cysteine-containing target proteins that contain disulfide bonds in their native struc- 
ture, solubilization of inclusion bodies is usually performed in the presence of a reducing 
agent, such as dithiothreitol or B-mercaptaethanol, in order to fully reduce the target protein 
and to prevent the formation of disulfide-bonded aggregates (1.e., to favor intramolecular 
versus intermolecular disulfide bonds) during the refolding process, The most widely used 
inethods for the correct refolding of recombinant proteins (Marston 1986; Kohno et al. 1990; 
Marston and Hartley 1990) use dilution and dialysis to reduce the urea or guanidine 
hydrochloride concentration gradually. In some cases, redox conditions (e.g., reduced/oxi- 
dized glutathione) are maintained during the refolding process to accelerate the correct pair- 
ing of disulfides and formation of the native structure (Light 1985). This may be critical in 
some cases (e.g., interleukin-2 [Tsuji et al. 1987; Weir and Sparks 1987]), but not others (e.g. 
interleukin-6 [Zhang et al. 19921). In both cases, the target protein is in a fully reduced form 
in inclusion bodies. Interestingly, the best yields of recombinant prochymosin (which con- 
tains three disulfide bonds in the native structure) were obtained bv omitting redox reagents 
at both the solubilization and refolding stages (Marston and Hartley 1990); in fact, little 
activity was recovered when the molecule was fully reduced and then solubilized in 8 M urea. 
A generic procedure for the isolation and solubilization of inclusion bodies and renaturation 
of target protein is given in Protocol 7. 


PREPARATION OF SUBCELLULAR EXTRACTS ENRICHES FOR TARGET PROTEINS 


Why lsolote Subcellular Fractions? 


Subcellular fractionation has been widely practiced by biologists during the past 50 years to 
gain better insight into the composition and function of cellular organelles and macromole- 
cules. Understanding the biological function of proteins requires knowledge of their subcel- 
lular localization and their movement from one compartment in the cell to another in 
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response to external biological stimuli. Proteins must be localized in the same cellular com- 
partment if they are to cooperate toward the execution of common physiological functions, 
such as metabolism (metabolic pathways), cell signaling (signal transduction pathways), pro- 
grammed cell death (apoptosis), and maintenance of cell structure (cytoskeleton). For 
instance, the reversible localization of signal transducing proteins to both the plasma and the 
internal membranes of cells is critical for the selective activation of downstream functions 
and depends on both protein/protein and protein/membrane-lipid interactions (Johnson 
and Cornell 1999; Hurley and Meyer 2001). In this exainple, in addition to identifying all 
protein/lipid-mediated subcellular localization mechanisms, it is also important to establish 
how these targeting mechanisms organize subcellular signaling events in time and space. This 
latter question represents one of the major challenges of the post-genome era. 

Now that the template of the human genome has been fully described, the quest to 
understand the complete protein complement (the proteome) of cells in various biological 
states and in response to assorted biological stimuli is gaining greater impetus. However, this 
is à problem with formidable obstacles, because both the number of proteins in a cell is large 
and the protein complement within a cell is dynamic. Attempts to analyze an entire cellular 
proteome without fractionation often fail because the complete protein complement simply 
cannot be resolved by current techniques, such as two-dimensional gel electrophoresis. Thus, 
there is a need to tackle the problem of defining a cell's total proteome by breaking it up into 
"bite size" components and to fully characterize the proteomes of the individual components. 
À natural choice is to separate the cell into subcellular organelles and compartments and then 
probe the proteomes of these individual cellular compartments. Already, a systematic identi- 
fication of many of the various cell organelles has been initiated to understand their function 
and biogenesis (see Table 3.2) (for a review, see Jung et al. 2000a,b). 


2D or Not 2D? 


It is apparent that two-dimensional (2D) gel electrophoresis (see Chapter 4), the classical 
method tor studying the global expression of cellular proteins, has major limitations (Gygi et 
al. 2000): 


* 2D electrophoresis gels are unable to resoive all proteins (although improvements in the 
technology, such as isoelectric focusing [IEF] gels covering only a single pH unit, have 
increased the resolution of the method) (Corthals et al. 2000; Gorg et al. 2000; Hanash 
2000; Hoving et al, 2000; Wildgruber et al. 2000). 


TABLE 3.2. Proteomic analyses of subcellular compartments and organelles 


Subcellular compartment (organelle) References 

Human piacental mitochondria Rabilioud ct al. (1998) 

Rat liver mitochondria Lopez et al. (2000) 

Hurnan placental lysosomes Chataway et al. (1998) 

Rat hepatocyte Golgi complex Taylor et al. (1997; 2000} 
Trans-Golgi network-derived carrier vesicles Fiedler et al. (1997); Morel et al. (2000) 
MDCK Il cell endocytic vesicles (apical and basolateral) Fialka et al. (1999) 

Burkitt lymphoma BL60 cell line, nuclear fraction Miiller et al. (1999) 
Phagosomes Garin et al. (2001) 

Rat liver mitochondrial ribosomes Cahill and Cunningham (2000) 
Human monocytic lysosomes Journet et al. (2000) 

Rat liver caveolae-enriched plasma membranes Calvo and Enrich (2000) 
Epithelial celi plasma membranes Simpson et al. (2000) 


Human myeloid leukemia HL-60 cells microsomal fraction Han et al. (2001) 
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Some protein classes (e.g., the membrane proteins) are underrepresented on 2D elec- 
trophoresis because of their poor solubility. Efforts to overcome this problem have been 
made by using novel detergents (see Santoni et al. 2000) and differential solubilization 
methods (see Molloy et al. 1998]. 


e The limited sample capacity and limited detection sensitivity of 2D electrophoresis reduce 
the ability to detect low-abundance proteins. For example, if à total veast cell lysate is frac- 
tionated by 2D electrophoresis and silver-stained, proteins present at < 1000 copies per cell 
(a large portion of the yeast proteome) are not detected (Gvgi et al. 2000). 


2D electrophoresis, like most other protein separation methods, is capable of revealing 
only a “narrow slice" of a cell’s total proteome. To address this limitation, some measure of 
subcellular fractionation prior to 2D electrophoresis is required to bring low-abundance 
proteins into view. For cxample, comparative 2D electrophoresis analysis of phagosomes 
containing living mycobacteria and those containing dead bacilli facilitated the identifica- 
tion of a protein, termed TACO, that allows this intracellular pathogen to evade host defense 
strategies and survive within macrophages (Ferrari et al. 19995. Such an analysis on intact 
macrophages would have been nigh on impossible. Another approach to reveal more pro- 
tein information on 2D electrophoresis involves a combination of subcellular fractionation 
and the use of narrow pH range gels (Cordwell et al. 2000}. Other groups advocate pretrac- 
tionating complex protein mixtures by solution- based IEF prior to 2D electrophoresis (Zuo 
and Speicher 2000; Righetti et al. 2001) or free-flow electrophoresis (FFE)/SDS-PAGE 
(Hoffman et al. 2001). Other non-2D electrophoresis proteomic approaches for identifying 
jow-abundance proteins include a combination of ion-exchange chromatography and 
reversed-phase high-performance liquid chromatography (RP-HPLC) (see Chapter 8 and 
Appendix 2). 


Two Main Steps in the Subcellular Fractionation Process 


Although the aim of subcellular fractionation is to separate cellular compartments with min- 
imal damage to them, it is evident that it is never possible with the use of current fractiona- 
tion techniques to recover cellular organelles in a completely undamaged state. Indeed, it may 
never be possible to separate cell organelles in a natural state. At present, there is no single 
best way to fractionate tissue that applies to all tissues (it cannot be assumed that the subcel- 
iular fractionation methods applied to one tissue can be applied to another). To complicate 
the problem further, methods to determine the subcellular localization of proteins (c.g., Hu- 
orescence microscopy, in situ hybridization, and electron microscopy) (Spector et al, 1998) 
differ from methods used to isolate a particular organelle for proteomic analysis. There are 
two main steps in the subcellular fractionation process: homogenization of tissues and cells 
followed by separation of cellular organclles. The principal aims of any homogenization 
method are to 


e achieve maximal cell breakage in a reproducible manner, 


e use disruptive forces that minimize damage to the organelles of interest (e.g, protein 
denaturation and proteolytic degradation), and 


e retain the original structure and functional integrity of the organelles of interest. 


The principal methods for disrupting cells (osmotic shock, ultrasonic vibration, mechan- 
ical grinding or shearing, and nitrogen cavitation) are detailed in earlier parts af this chapter 
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and in the protocols that follow (see also Graham and Rickwood 1997; Celis 1998; Spector et 
al. 1998). The components of the homogenate are then separated bv procedures based on the 
variations in physical properties of the subcellular components, for example, 


e centrifugal methods that separate organelles by size and/or buoyant density in gradient 
media, 


¢ immunoisolation methods that use antibodies, which bind to specific surface proteins, and 


electrophoresis methods that separate proteins on the basis of surface charge distribution. 


For both overviews on subcellular fractionation and isolation of specific cell organelles, 
see Spector et al. (1998), Graham and Rickwood (1997), and Celis (1998). 


Detergents Can Be Used in Combination to Fractionate Eukaryotic Cells 


Differential detergent extraction is an established method for cell fractionation, which parti- 
tions subcellular constituents into functionally and structurally distinct compartments (Lenk 
et al. 1977; Lenstra and Bloemendal 1983; Fey et al. 1984; Ramsby et al. 1994; Ramsby and 
Makowski 1999). The use of differential detergent fractionation (DDF) to obtain celi frac- 
tions enriched in cytosolic, membrane, nuclear, and cytoskeletal proteins for direct analysis 
by 2D electrophoresis has been described recently (Ramsby et ai. 1994; Ramsby and 
Makowski 1999). DDF is applicable for the fractionation of cells grown in suspension or 
monolayer culture, as well as for fractionation of whole tissue, and can be modified to achieve 
specific fractionation goals, including further subfractionation. 

DDF preserves the structural and functional integrity of cellular proteins, including the 
cytoskeleton, and permits direct biochemical analysis of detergent extracts by a variety of 
methods, including enzymatic assays, autoradiography, immunoblotting, immunoprecipita- 
tion, 2D electrophoresis, and mass spectrometry (Lenk et al. 1977; Lenstra and Bloemendal 
1983; Fey et al. 1984; Ramsby and Kreutzer 1993; Ramsby et al. 1994; Patton 1999; Ramsby 
and Makowski 1999). Thus, DDF is useful in a variety of proteome research applications, and 
has been used to 


e determine the subcellular localization of biologics, 
s semipurify compartment-specific macromolecules, 
e enrich for low-abundance proteins, 


* investigate dynamic interactions between cytosolic and structural entities (i.e., membranes 
and the cytoskeleton), and 


monitor treatment-induced compartmental redistributions of macromolecules. 


Protocol 9 describes a procedure for the fractionation of proteins from eukaryotic cells, 
and the preparation of those proteins for analysis by 2D electrophoresis. Additional protocols 
detail the use of DDF for the concomitant isolation of total cellular RNA and the isolation of 
tubulins and microtubule-associated proteins present in cytosolic extracts. 


PROTOCOL Í 


Homogenization of Mammalian Tissue 


MATERIALS 


To PURIFY OR CHARACTERIZE AN INT RACELLUEAR PROTEIN, it is important to choose an efficient 
method for disrupting the cell or tissue that rapidly releases the protein from its intracellular 
compartment into a buffer that is not harmful to the biological activity of the protein of 
interest. One of the most widely used methods for disrupting soft tissues is homogenization, 
In this protocol. three processes for tissue homogenization using mechanical shear are dis- 
cussed: chopping the tissue in a Potter-Elvehjem glass-Teflon homogenizer (see Figure 3.17, 
a Dounce hand homogenizer, or a hand-held Waring Blendor. These methods are rapid and 
pose little risk to proteins other than the release of proteases from other cellular compart- 
ments. Proteolytic degradation can be minimized by the inclusion of protease inhibitors in 
the homogenization buffers. 


b Reagents 


CAUTION: Sce Appendix 3 for appropriate handling of materials marked with eto, 


Dithiothreitol (DTT) (0.5 m) <!> 
Prepare a 0.5 M stock solution in cold H,O and store frozen. Add the reagent to cold butters at the 
indicated concentration just prior to use. 
Homogenization buffer A 
50 mM Tris-Cl (pH 7.51 
2mM EDTA 
150 mM NaCl 
0.5 mw DTT 


Homogenization buffer B 
50 mM Tlris-Cl (pH 7.5) <!> 
1096 (v/v) glycerol (or 0.25 M sucrose) 
5 mM magnesium acetate <!> 
0.2 mm EDTA 
0.5 mm DTT 
1.0 mM phenylmethylsulfonyl fluoride (PMSF) <I> 


The chaice of homogenization buffer will depend on the nature of the extract required. Generally, 
use a buffer of moderate ionic strength at neutral pH (e.g. 0.05-1.0 M phosphate or Tris, pH 
7.0-7.5). The appropriate buffer ionic strength should be chosen bv trial and error to optimize the 
yield of the target protein. For example, the addition of 0.1 M NaCl or KCI will increase the yield of 
those proteins that have a tendency to attach electrostatically to cell debris/membrane fragments. 
On the other hand, the association-dissociation behavior of some proteins is influenced markedly 
by ionic strength. For example, the globular (G) form of actin can be extracted from muscle using 
low-ionic-strength solutions (0.01 M KCI}, but actin aggregates to form fibers (F-actin} if the ionic 
strength is raised (e.g, to 0.1 M KCl) (Price 1996). If the purpose of the extraction is to isolate 
organelles, it is important to use low-ionic-strength buffers (¢.g., 5-20 mm Tris, HEPES, or TES at 
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FIGURE 3.1. (Left) A Potter-Elvehjem homogenizer {inset Dounce homogenizer). The device holding the Potter-Elvehjem homog- 
enizer allows the glass vessel to be cooled in an ice-water mixture and the pestle can be advanced using a remote handle. (Reprinted, 
with permission, from Evans 1992, as in Graham and Rickwood 1997.) (Right) A Polytron homogenizer. (Inset) {inverted) The 
rotating teeth of the work head. (Reprinted, with permission, from Evans 1992 |GOxford University Press], as in Graham and 
Rickwood 1997.) 


pH 7.4) containing iso-osmotic sucrose or mannitol (0.25 M). Avoidance of proteolytic degradation 
of the target protein in a crude extract is a primary concern. In many cases, it may not be essential 
to add protease inhibitors to the homogenization buffer (due to the protective effect of bulk pro- 
tein on a target protein), but some proteins are more susceptible to proteolysis than others, and 
some tissues (e.g., liver and pancreas) have much higher levels of proteases than others (e.g., heart). 
The use of protease inhibitor cocktails can be expensive if the extract volumes are large. Hence, 
carry out pilot experiments over a period of a few hours to ascertain whether there are measurable 
losses of the target protein activity. If proteolytic degradation is a problem, then include protease 
inhibitors in the homogenization buffers (for a list of protease inhibitors, see Table 3.3 and for the 
preparation of protease inhibitor cocktails, see Table 3.4). If the target protein is susceptible to oxi- 
dation or its activity is inhibited by heavy metals, then add DTT (1 mw) (or 0.1 M B-mercap- 
toethanol) and EDTA (0.1 M), respectively, to the extraction buffer. 


Phenylmethylsulfonyl fluoride (PMSF) (0.2 M) 
Prepare a 0.2 M stock solution in 2-propanol, and add the reagent to cold buffers with adequate stir- 
ring just prior to use; the reagent will crystallize from 2-propanol when stored at —20°C. 
Aminoethylbenzenesulfonyl fluoride (AEBSF) is a water-soluble alternative to PMSF that can be 
used at the same molar concentration (0.1-1.0 mm} for most applications. 
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TABLE 3.3. Inhibitor cocktails used to contro! proteolysis during protein isolation 


Protease inhibitors Target protease ~ 
Tissue type (working concentration) type Stock solution" 
Animal tissues AEBSF (0.2 mM) (or DCI serine 20 mM in methanol (DCI: 10 mM in DMSO; 
[0.1 major PMSF [0.2 mm]) PMSF: 200 mM in ethanol or isopropanol! 
benzamidine (1 mM) serine 100 ms 
leupeptin (10 pg/ml) serine/cvsteine 1 mg/ml 
pepstatin (10 pg/ml) aspartic 5 mg/ml in methanol 
aprotinin (trasylot)( l ug/ml} serine 0.1 mg/mi 
EDTA or EGTA‘ (1 mM) metallo 100 mM 
Plant tissues 
AEBSF (0.2 mM) tor DCI serine 20 mM in methanol (DCE 10 mM in DMSO; 
(0.1 mM] or PMSF [0.2 mid) PMSF: 200 mM in ethanol or isopropanol) 
chymostatin (20 ug/ml) serine/cysteine 1 mg/ml in DMSO 
EDTA or EGTA" (1 mM} metallo 100 mM 
Yeasts and fungi 
AEBSF (0.2 mM) (or DCI serine 20 mM in methanol (DOL 10 mM in DMSO; 
[9.1 mejor PMSF [0.2 mm]} PMSF: 200 mM in ethanol or isopropanol) 
pepstatin (15 pg/ml} aspartic 5 mg/ml in methanol 
1,10-phenanthroline (5 mm) metallo 1M in ethanol 
Bacteria AEBSF (0.2 mM) (or DCI serine 20 mM in methanol (DCI: 10 mM in DMSO, 
[0.1 mMlor PMSF [0.2 mu]? PMSE: 200 m» in ethanol or isopropanol} 
EDTA or EGTA! (1 mm} metallo 100 mm 


Adapted from North (1989). 

Abbreviations: (AEBSF) 4-(2-aminoethyl)-benzenesulfonylfiuoride; (ICI) 3,4-dichloroisocoumarin; (DMSO) dimethylsutfoxide; (EDTA) ethylenedi- 
amine tetraacetic acid; (EGTA) ethylene glycol bis (B-amincethyl ether! N,N.N,'N'-tetraacetic acid; (PMSF) phenylmethylsutfonyl fluoride. 

M, values of inhibitors: AEBSF (240); PMSF (174); DCL (215); EDTA (disodium sait, dihydrate} (372); benzamidine (hydrochioride! (1575; leupeptin 
(427); pepstatin (686); aprotinin (65001; chymostatin (605), 1,10-phenanthroline (198). 

aor a review of proteolytic enzymes, sec Neurath (1989) and Perlmann and Lorand (1970). 

Aqueous solution unless otherwise indicated. 

tAn efficient chelator of divalent metal cations other than Mg? (for which it has a 10"-fold lower affimty] (Gegenheimer 19901 


TABLE 3.4. Preparation of a general protease inhibitor cocktail 
Trial working Stock (100x) 


Inhibitor concentration concentration Recipe Use? Target protease type 
AEBSE 0.2 mM 20 mM 238,5 mg/10 ml H,O (100 mm) 4 m! serine proteases 

EDTA | mM 100 mM 19 mg/100 ml H,O (0.5 M) 4 ml metallo proteases 
Leupeptin <!> 20 uM 2 mM 17 mg/2 ml H,O (20 mM) 2 mi cysteine/serine proteases 
Pepstatin <!> ] UM 100 [LM 0.86 mg/10 ml methanol (1 mm; 2 ml aspartic proteases 


Adapted from Calbiochem ‘Technical Bulletin C.B0578- 1294. 
:Mix the inhibitor solutions and bring to a final volume of 20 ml with H,O or appropriate aqueous butter, The resulting solution, 20 m] of a stock 1OUX 
protease inhibitor cocktail, can be aliquoted into microfuge tubes and stored at —202C until required. 
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è Cells and Tissues 


Appropriate animal tissue 
Working with human tissue presents a unique set of hazards. <!> 


P Equipment 
Centrifuge 
Cheesecloth and filter funnel 
Knife or meat grinder 
Homogenizer (see note to Step 2) 
Power-driven Potter-Elvehjem glass-Teflon homogenizer 
Clearance range: 0.05-0.6 mm (see Figure 3.1). 
Dounce hand homogenizer 
Loose fitting: 0.1—0.3-mm clearance. Tight fitting: 0.05-0.08-mm clearance (see Figure 3.11. 
Waring Blendor 
A mechanical shear homogenizer that uses rotating metal blades or teeth to disrupt the materi- 
al. There are many variations of the traditional domestic food liquidizer in which the material is 
placed in a glass reservoir with the blades driven by a motor beneath it. Other models resemble 
the modern hand-held blenders in which the motor is overhead. The Waring Biendor is tvpical- 
ly used to macerate large amounts (100-1000 ml) of hard animal tissue and plant tissue. For 
smaller volumes (~1-5 ml), the Ultra-Turrex and its successor the Polytron homogenizer are 
widely used (see Figure 3.1). 
METHOD 


IMPORTANT: Carry out all procedures at 0—49C. 


l. After the tissue is excised from the animal, trim and discard fat and connective tissue from 
the tissue. Place the tissue in cold Homogenization buffer A. 


2. Dice the washed tissue into small pieces (i.e., i-cm cubes) with a knife or, alternatively, 
pass the tissue through a meat grinder twice. 


Tissues such as liver, brain, kidney, and heart are readily homogenized in a Waring Blendor, 
but tissues such as skeletal muscle and lung are tougher, and it is advisable to grind them in 
a domestic meat grinder prior to homogenization. Very fibrous tissues such as mammary 
glands must be frozen prior to homogenization to facilitate disruption (the Ultra-Turrex 
homogenizer is widely used for this purpose). Cultured mammalian cells and small amounts 
(1—5 g) of soft tissue such as brain can be homogenized conveniently using a Dounce hand 
homogenizer (Dignam 1990). In all cases, prechill the homogenizers and glassware to 4°C, 
and work in a cold room while using the blender. 


3. Add 3-4 volumes of Homogenization buffer B per volume of tissue, and transfer the mix- 
ture to the homogenizer. 


4. Prepare the homogenate using one of the following methods: 


Potter-Elvehjem homogenizer: Homogenize the tissue with the apparatus set at 500-1500 
rpm, allowing 5-10 seconds per stroke. 


Dounce hand homogenizer: Homogenize the tissue with 10—20 strokes of the pestle. 


Waring Blendor: Homogenize the tissue three to four times for 20—30 seconds each (no 
longer), pausing for 10-15 seconds between each homogenization. 
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. Pour the homogenate into a glass beaker, piace the beaker on ice, and stir the homogenate 
gently for 30-60 minutes at 4°C to allow further extraction of proteins. 


EXPERIMENTAL TIP: Do not allow the homogenate to foam. 


56. Remove cell debris and other particulate matter from the homogenate by centrifugation 
at 10,000g for 10-20 minutes at 4°C, 


7. Filter the supernatant through two layers of cheesecloth (or a plug of glass wool) in a fil- 
ter funnel to remove any fatty material that has floated to the surface. Carefully squeeze 
the cloth to obtain the maximum amount of filtrate (referred to as the “crude extract"). 


B. Proceed with the appropriate fractionation or analysis strategy as quickly as possible (see 
Chapter 1). 


AopmoNAt PROTOCOL. REMOVAL OF MUCIN FROM TISSUE HOMOGENATES 


The presence of mucin in bulk biological extracts (e.g. colonic mucosa) complicates subsequent 
purification because the mucin binds to most chromatographic supports causing blockage of the 
column. Mucin can be selectively removed from tissue homogenates using this protocol, which was 
developed for the isolation and characterization of novel growth factors in colonic mucosa (Nice et 
al, 1991). 


Additional Materials 


CAUTION See Appendix 3 for appropriate handling of materials marked with <!>. 


Ammonium carbonate buffer (7.85 g/liter, pH 9.2, ice-cold) containing the proteolytic inhibitors 
pepstatin (3 mg/liter) and PMSF (22 mg/liter) <!> 


Method 


1. Homogenize mucosa scraped from the descending colon, using an Ultra-Turex homogenizer, 
in 5 volumes of ice-cold carbonate buffer (pH 9.2) containing pepstatin and PMSF. 


2. Remove the tissue debris by centrifugation at 10,000g for 20 minutes. 


3. Slowly adjust the pH of the resulting supernatant to 4.5 (>2 hours) by the dropwise addition 
of 1 M HCI (—100 ml required) with continuous stirring at 49C. 


Fp 


This results in a flocculent precipitate of mucin (Glass 1964). 


4. Remove the precipitate by centrifugation at 10,000g for 20 minutes at 4°C. 
The supernatant is the crude extract, which can be used as starting material in the desired purifi- 
cation strategy. 


PROTOCOL 2 


Lysis of Cultured Cells for Immunoprecipitation 


Cr L LYSIS WITH MILD DETERGENT IS COMMONLY USED with cultured animal cells. If low deter- 
gent concentrations are sufficient to cause cell lysis (e.g., 1% NP-40 or 1% Triton X-100), this 
method may be more gentle to the protein of interest than the homogenization methods dis- 
cussed in Protocol 1. The choice of detergent must be tailored to the nature of the epitope 
recognized by the immunoprecipitating antibody. If the antibody recognizes a linear peptide 
epitope (e.g., a synthetic peptide), then use a harsh denaturing lysis buffer (e.g., RIPA buffer). 
On the other hand, if the antibody is directed toward a conformational epitope, use NP-40 
lysis buffer (or 1% Triton X-100) (for lysis buffer details, see Table 3.5). This protocol was 
contributed by Hong Ji (Joint ProteomicS Laboratory of the Ludwig Institute for Cancer 
Research, and Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «1». 


» Reagents 


Lysis buffer 
Good first choices are NP-40 lysis buffer (Triton X-100 may be substituted for NP-40) and RIPA 


lysis buffer. For additional information on chaosing a lysis buffer, see Table 3.5. Chill the lysis buffer 
to 4°C prior to use. 


TABLE 3.5. Commonly used lysis buffers for lysing cultured cells 


Buffer Comments 

NP-40 lysis Probably the most widely used lysis buffer. it relies on the nonionic 
150 mM NaCl detergent NP-40 as the major solubilizing agent, which can be 
1.0% NP-40 replaced by Triton X-!0 with similar results. 
50 mM Tris-Cl (pl 7.4) Variations include lowering the detergent concentration or using 


alternate detergents such as digitonin, saponin, or CHAPS. 


RIPA lysis A much harsher denaturing lysis buffer than NP-40, due to the inclu- 
i50 mM NaCl sion of two ionic detergents (SDS and sodium deoxycholatel. In 
196 NP-40 addition to releasing most proteins from cultured cells, RIPA lysis 
0.596 sodium deoxycholate buffer disrupts most weak noncovalent protein-protein inleractions. 


0.196 SDS <!> 
50 mM Tris-Cl (pH 7.4) 


When studying the modification of proteins by phosphorylation, phosphatase inhibitors (e.g., 25 mm NaF, 40 mm B-glvcerol 
phosphate, 100 uM Na, VO, or 1 um microcystin) should be included. If proteolytic degradation of the target protein is a prob- 
lem, protease inhibitors should be included in the lysis buffer (some commonly used inhibitors include aprotinin (1 jig/ml), 
leupepun (1 pg/ml), pepstatin (1 ug/ml), and PMSF (50 pg/ml). Alternatively, commercially available protease cocktail tablets 
fe.g., from Boehringer) can be included. Also see ‘lables 3.3 and 3.4. 
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Phosphate-buffered saline (PBS) 


» Cells 


8.0 g of NaCl (fina) concentration 137 mM) 

0.2 g of KC] (final concentration 2.7 mM) <i> 

1.44 g of Na,HPO, (final concentration 10.1 mM) <!> 
0.24 g of KH,PO, (final concentration 1.8 mm) <!> 
in 800 ml of H;O 


Adjust pH to 7.4, and then adjust volume to 1 liter with H,O. 


Suspension or monolayer cell cultures 


b Equipment 


Centrifuge 


METHODS 


Method A: Lysing Cells Grown as Monolayer Cultures 


_ Discard the culture medium, and wash the cells twice with ice-cold PBS. 
. Place the culture dishes on ice. 


, Add 1.0 ml of lysis buffer (chilled to 4C) per 100-mm dish. For culture dishes of other 


sizes, adjust the volume of lysis buffer accordingly. 


_ Incubate the cells for 10-30 minutes (dependent on cell lines being studied) on ice with 


occasional rocking of the dishes. 


. Tilt a dish on the bed of ice and allow the buffer to drain to one side; remove the lysate 


with a pipette and transfer it to a microfuge tube or other suitable centrifuge tube. Repeat 
with all of the remaining dishes. 


Although some researchers prefer to scrape the cells from the tissue-culture dish, this does 
cause some stress to the cells and is only required in unusual cases. 


. Centrifuge the lysate at 20,000g for 10 minutes at 4°C. 


. Carefully remove the supernatant to a fresh tube, making sure not to disturb the pellet. 


Store the lysate on ice until it is needed for the preclearing and immunoprecipitation (see 
Harlow and Lane 1999). 


The cell lysate can be snap-frozen using a dry icefethanol mixture and then stored at -70°C 
for long-term storage. However, for the analysis of protein complexes by immunoprecipita- 
tion, the use of a freshly prepared cell lysate is recommended. 


Method B; Lysing Cells Grown in Suspension 


1. Harvest the cells by centrifugation at 480g for 10 minutes. Pour off the supernatant and 


discard. 


2. Carefully wash the cell pellet twice with ice-cold PBS, and then place the washed cell pel- 


let on icc. 
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. Resuspend the pellet in 1.0 ml of lysis buffer (chilled to 4°C) per 1 x 10 to 5 x 10 cells, 
. Incubate the cells for 15 minutes on ice with occasional vortexing of the tube. 
. Centrifuge the lysate at 20,000g for 10 minutes at £C, 


. Carefully remove the supernatant to a fresh tube, making sure not to disturb the pellet. 


Store the lysate on ice until it is needed for the preclearing and immunoprecipitation (see 
Harlow and Lane 1999). 


The cell lysate can be snap-frozen using a dry ice/ethanol mixture and then stored at —709C 
for long-term storage. However, for the analysis of protein complexes by immunoprecipita- 
tion, the use of a freshly prepared cell lysate is recommended. 


PROTOCOL 3 
————————ss——€—nÀ—— Eee 


Lysis of Cultured Animal Cells, Yeast, and Bacteria for 
Immunoblotting 


om [YSIS WITH DETERGENTS IS COMMONLY USED with cultured animal cells. Typically, the 
ionic detergent SDS (e.g., 2% SIS) is sufficient for lysing cells for the purpose of immuno- 
blotting studies. Both cultured animal cells and bacteria such as E coli may be lysed in this 
manner. If the antigenic determinant recognized by the antibody being studied is dependent 
on the native spatial conformation and sensitive to reducing conditions, then dithiothreitol 
should be omitted from the lysis buffer and nonreducing/non-urea gels may need to be 
employed (Ji et al. 1997; Ji and Simpson 1999). This protocol was contributed by Hong fi 
(Joint ProteomicS Laboratory of the Ludwig Institute for Cancer Research, and Walter and 
Eliza Hall Institute of Medical Research, Melbourne, Australia). 


MATERIALS 


CAUTION: See Appendix 3 for approptiate handling of materials marked with «!7. 


P» Reagents 


Dithiothreitol (DTT) (1 m} (10x stock solution) <!> 
Store DTT in aliquots at 2209C. 


1x Laemmli sample buffer 
2% SDS «1» 
10% glyceral 
60 mi Tris-Cl (pH 3.8) 
0.01% bromophenol blue <!> 
It is often convenient to prepare Laemmli sample buffer as à 2x or 5x stock, Just prior to use, add 
DTT to a final concentration of 100 mw. 


P Celis 
Suspension or monolayer cell cultures, bacteria, or yeast cells 
10? tissue culture celis is -1 mi or ~l g. 


b Equipment 
Centrifuge 
Water bath or Heating block preset to 95°C 


» Additional Reagents 


Step 6 of this protocol requires the reagents listed in Chapter 2, Protocol I. 
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METHOD 


per 


ao n h œ 


. Add 1 ml of Laemmli sample buffer containing 100 mm DTT to 1 X 10? to 5 x 10 cells. 


. The celi lysate becomes highly viscous in Laemmli buffer due to the presence of released 


DNA. Two options are available for surmounting this viscosity problem. 
* Centrifuge the lysate at 100,000g for 20 minutes to remove DNA. 


* Lyse the cells by sonicating the mixture using four bursts of 15-30 seconds each. 
Transfer the samples to ice for 15 seconds between each sonication step. 


. Heat the collected supernatant or sonicated sample for 5 minutes at 95°C. 
. Centrifuge at 20,000g for 5 minutes. 
. Transfer the supernatant to a fresh tube. 


. The samples (supernatant) are now ready for electrophoresis (see Chapter 2, Protocol 1) 


and immunoblotting (see Harlow and Lane 1999). When preparing cell culture extracts 
for immunoblotting, the protein sample must be 


¢ ina solution that is compatible with the gel electrophoresis system (e.g. the pH of the 
solution should be ~7.0 and the salt concentration ~200 mM) and 


* at a protein concentration that does not exceed the loading capacity for a particular 
gel system (for a discussion of gel electrophoresis variables, see Chapter 2). As a rule 
of thumb, for a conventional gel, do not load >150 ug of total protein per lane for a 
minigel. 


PROTOCOL 4 


Disruption of Cultured Cells by Nitrogen Cavitation 


112 


Ca DISRUPTION BY NITROGEN DECOMPRESSION from a pressurized vessel is a rapid and 
effective way to homogenize cells and tissues, to release intact organelles, and to prepare cell 
membranes (Hunter and Commerford 1961). The principle of the method is simple: Cells 
are placed in a pressure vessel, and large quantities of oxygen-free nitrogen are first dissolved 
in the cells under high pressure (~5500 kPa, which is equivalent to 800 psi). When the gas 
pressure is suddenly released, the nitrogen comes out of solution as bubbles that expand and 
stretch the cell membrane, rupturing it, and releasing the contents of the cell. Nitrogen cav- 
itation is well suited for mammalian and plant cells and fragile bacteria (1e, bacteria treat- 
ed to weaken the cell wall), but it is less effective at lysing yeast, other fungi, spores, or other 
cell types with tough cell walls. Features of the nitrogen cavitation method include the fol- 
lowing: 

+ It is a gentle method for homogenizing or fractionating cells because the chemical and 
physical stresses that it imposes upon enzymes and subcellular compartments are mini- 
mized compared to other ultrasonic and mechanical homogenizing methods. For example, 
functional intact nuclei and mitochondria can be released from most cell types. 

© Unlike many cell lysis methods relying on shear stresses and friction, no heat is generated 
with the nitrogen cavitation because this method is accompanied by an adiabatic expan- 
sion that cools the sample instead of heating it. Hence, there is no heat damage to proteins 
and organelies. 


e Any labile ceil components are protected from oxidation by the use of an inert gas, nitro- 
gen. Furthermore, nitrogen does not alter the pH of the suspending medium. 

* The process is fast and uniform because the same disruptive forces are applied within each 
cell and throughout the sample, ensuring reproducible cell-free homogenates. 

+ Variable sample sizes (e.g., from ~1 ml to 1 liter or more) can be accommodated with most 
commercial systems. 


This protocol, adapted from the Kontes Glass Company "instructions for users,” is 
designed for small volumes (1-15 ml) of tissue culture cells using the Kontes M ini-Bomb cell 
disruption chamber (see Figure 3.2). 
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MATERIALS 


P Reagents 


(a) - 


FIGURE 3.2. Nitrogen cavitation vessel. (a) Disassembled parts of the homogenizer; (b) diagrammatic 
cross-section. The version shown in a is manufactured by Baskervilles (Manchester, U.K.). Artisan Industries 
(Waltham, Massachusetts) makes a similar version that uses the pressure gauge on the gas cylinder to mon- 
itor the pressure. (Reprinted, with permission, from Evans 1992 [GOxford University Press}, as in Graham 
and Rickwood 1997.) 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Homogenization medium 
0.25 M sucrose 
20 mM Tris-Cl (pH 7.4) 


For alternative homogenization buffers, see Protocol 1. For nitrogen cavitation, choose any buffer 
compatible with subsequent purification steps. Isotonic solutions are suitable for most applications. 
Hypertonic solutions tend to stabilize organelles. Low concentrations of MgCl,, magnesium acetate, 
or CaCl, have been reported to stabilize nuclei. 


Phosphate-buffered saline 
8.0 g of NaCl (final concentration 137 mM) 
0.2 g of KCl (final concentration 2.7 mm} <!> 
1.44 g of Na,HPO, (final concentration 10.1 mm) <!> 
0.24 g of KH;PO, (final concentration 1.8 mM) <!> 
in 800 ml of H,O 


Adjust pH to 7.4, and then adjust volume to | liter with H,O. 
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» Celis 


P Equipment 


METHOD 


Cultured animal cells or animal tissue 


Cell disruption chamber 


Com mercially available cell disruption chambers (also called nitrogen pressure vessels) include: 
+ Mini-Bomb cell disruption chamber (Kontes Glass Company, Vineland, New lersey; www. 


kimble-kontes.com}. 


+ Parr Cell Disruption Bomb (Parr Instrument Company, Moline, Illinois; www. parrinst.com). 
è Baskervilles (Manchester, U.K.) or Artisan Industries (Waltham, Massachusetts). 


Magnetic stirrer 
Nitrogen (oxygen-free) <!> 
Rubber policeman 


CAUTION: Because high pressures are generated, carry out this procedure behind a shield. 


1. 


Ensure that the outlet port is closed (do not over-tighten), and cool the cell disruption 
chamber (if necessary) by immersing the unit in an ice bath. 


. Prepare the cells or tissue for disruption. 


For adherent cultured cells: Wash the celis gently two or three times with phosphate- 
buffered saline (PBS). Use a rubber cell scraper to scrape the cells from the dish into 
PBS. 


For nonadherent cells: Collect the cells by centrifugation at 400g for 10 minutes. Wash the 
pellet of cells two or three times with PBS. 


For animal tissue: Mince the tissue using a mechanical homogenizer or by passing through 
a screen or sieve. 


. Collect the cells by centrifugation at 480g for 5 minutes at 4°C. 


_ Discard the supernatant and resuspend the cells in 10 volumes (^15 ml) of homogeniza- 


tion medium. Transfer the suspension to a plastic beaker (containing a magnetic stje bar) 
and gently mix to obtain a uniform suspension. 


- Remove the filter holder from the chilled cell disruption chamber by using a turning and 


lifting motion. 


. Add up to 15 ml of cell suspension to the cell disruption chamber. 


For larger volumes, an extension is available for the Mini-Bomb. Alternatively, other nitro- 
gen pressure vessels can be used. 


. Replace the filter holder. 


- Attach the unit to the source of oxygen-free nitrogen by screwing on the cap of the pres- 


sure vessel, which should already be connected to the source of nitrogen. HAND TIGHT- 
ENING is all that is required. 


. Replace the pressure vessel in the cooling bath if desired. 


PREPARATION OF CELLULAR AND SUBCELLULAR EXTRACTS = 115 


10. 


Pressurize the bomb to the desired pressure by allowing nitrogen to flow from the cylin- 
der into the disruption chamber. 


There must be a gauge on the filling connection that shows the pressure inside the chamber. 


The amount of pressure required depends on the type of cells being disrupted. Typical pres- 
sures used include: 


e 500 psi for KB cells 
* 500 psi for rat liver 
e 1000 psi for chicken red blood cells 


Step 10 must be optimized for each application. In general, lower pressures lead to a greater 
recovery of intact organelles, whereas higher pressures can result in total homogenization. 
Cells that are difficult to lyse can be treated more than once for more complete homoge- 
nization. For additional information on effective gas pressures to use to disrupt various cells, 
see Table 3.6. 


CAUTION: Inexperienced workers handling high-pressure gases must always seek expert help 
before using a gas cylinder. 


11. Allow the pressurized apparatus to equilibrate for at least 30 minutes to allow the nitro- 


gen to dissolve and come to equilibrium within the cells. Gentle agitation once or twice 
during this time period with a magnetic stirrer or periodic gentle shaking keeps the cells 
in a uniform suspension, 


12. Place a collection container (e.g., beaker) in an ice bath by the outlet port. 


TABLE 3.6. Suggested working parameters for disruption of cells using nitrogen cavitation 


Cell No. cells No. times No. cells No. nuclei 96 cells 
Cell type suspension psi used/ml through remaining/ml remaining/m! totally lysed 
Cultured cells KB 500 3.4 x HE 1 2.8 X 1 n.d. 93 
KB 500 2 2.0 x 10 n.d, 99.95 
KB 250 3.3 x 105 l 5,5 x 10° 4.62 x 10 69.4 
KB 250 2 5.4 x 10t 2.2 x 10 97,7 
KB 250 3 0 0 100 
KB 0 1.8 x HF a 1.08 x 10* 7.6 x 10^ - 
KB 250 1 24X16 6.4 x 10* 52.2 
KB 250 2 3.4 x 104 5.0 x 104 95.4 
KB 250 3 1.6 x 10* 4.6 x 101 96.6 
Tissue rat liver 500 9,2 x 10? 1 8.5 x 105 nd, 7.8 
500 2 [t 100 
Blood chicken red 1000 5.6 x 10? 1 9.6 x 105 nd, 83 
blood cells 
Bacteria E. coli 1500 1.3x 104 L 1.2 x 10” n.d. 8 
1500 2 6.6 x 10? n.d. 50 
E. coli 1500 6.5 x 10° ] 3.6 x 108 n.d. 45 
1500 2 33x 10 n.d. 50 


These values were determined with the Mini-Bomb cell disruption chamber. (KB cells) Human oral epidermoid carcinoma. n.d. indicates not deter- 
mined. (Reprinted, with permission, from Kimble/Kontes [| www.kimble-kontes.com/html/pg-881455.htm].) 
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13. 


16. 
17. 


With the pressurization unit stili activated, slowly open the outlet port until the cell sus- 
pension begins to flow into the chilled collection container, Make sure that the pressure 
is maintained until ali of the suspension is out. 


A reasonable flow rate is 3-10 drops per second. An indication that the unit is almost empty 
is the appearance of a "foamier" flow of fluid. The end af the flow is accompanied hy a gen- 
tle hissing. 

Cell lysis occurs upon release of pressure, but passage through the narrow orifice of the out- 


let port often aids cell disruption. 


CAUTION: The homogenate is expelled with great force, so this operation must be carried 
out very carefully, particularly if the sample contains hazardous materials. Because of the 
potential for creating an aerosol, this procedure must always be performed in a safety hood. 


. Gently stir the “foam” to allow it to subside before any centrifugation or subsequent work 


up of the homogenate is carried out. 


, Close the outlet port. Do not overtighten it. 


Turn off the flow of nitrogen at its source. 


Discharge the gas in the unit by opening the outlet port and venting the unit. Once the 
pressure inside the bomb has returned to atmospheric pressure, the chamber can be 
opened for recovery of any remaining sample and thorough cleaning. 


CAUTION: If the lysed cell material is potentially hazardous, vent the gas through a suitable 
trap. 


PROTOCOL 5 


small-scale Extraction of Recombinant Proteins 
from Bacteria 


Bac ARE PARTICULARLY CONVENIENT FOR PRODUCING RECOMBINANT PROTEINS for purifica- 
tion purposes. To monitor induction as well as the levels of recombinant protein expression, 
it is important to have a rapid, simple method for estimating bacterial protein expression. 
This protocol describes the preparation of small-scale bacterial extracts using cell lysis with 
0.596 Triton X-100. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 

Dithiothreitol (DTT) {1 m) (10x stock solution) «' 
Store DTT in aliquots at -20°C, 

1x Laemmli sample buffer 
2% SDS <!> 
10% glycerol 
60 mM Tris-Cl (pH 6.8) 
0.01% bromophenol blue <t> 
It is often convenient to prepare Laemmli sample buffer as a 2x or 5x stock. Just prior to use, add 
DTT to a final concentration of 100 mM. 


Triton X-100 (0.5% v/v) «1» 
Cool to 4°C before use. 


b Cells 


Bacterial cells expressing target recombinant protein 


» Equipment 
Boiling-water bath 
Sonicator <!> 


» Additional Reagents 
Step 5 of this protocol requires the reagents listed in Chapter 2, Protocol 1. 
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METHOD 


. Harvest the bacterial cells from 1 ml of culture by centrifugation in a microfuge for 1 


minute at room temperature. 


. Pour off the supernatant and resuspend the bacterial cell pellet in 1 ml of chilled aqueous 


0.5% (v/v) Triton X-100. 


. Sonicate the suspension for three cycles of 20 seconds each, cooling the cells on ice 


between treatments. 


. Centrifuge the suspension in a microfuge for 1 minute. 


. Add Laemmli sample buffer containing 100 mM DTT to the supernatant, boil the mixture 


for 2 minutes, and analyze for the target protein on analytical SDS-PAGE (see Chapter 2, 
Protocol 1). 


PROTOCOL 6 


Large-scale Extraction of Recombinant Proteins 
from Bacteria 


Bc ARE PARTICULARLY CONVENIENT FOR PRODUCING RECOMBINANT PROTEINS for purifica- 
tion purposes. Suitable extraction methods for bacterial cells include sonication, glass bead 
milling, grinding with alumina or sand, high-pressure shearing using the French pressure cell 
(French Press}, and lysozyme treatment. These procedures are applicable for preparing 
extracts from a variety of gram-negative bacteria such as E. coli and gram-positive bacteria 
such as Klebsiella pneumoniae and Bacillus subtilis. Disruption of bacterial cells by enzymat- 
ic means is commonly used because a relatively uniform treatment is obtained when cells are 
in suspension. A protocol for enzymatic disruption of E. coli follows. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


P Reagents 

Dithiothreitol (DTT) (1 M) (10x stock solution) <!> 
Store DTT in aliquots at —20°C. 

DNase I 

1X Laemmli sample buffer 
2% SDS «t» 
10% glycerol 
60 mM Tris-Cl (pH 6.8) 
0.0196 bromophenol blue «1» 
It is often convenient to prepare Laemmli sample buffer as a 2x or 5X stock. Just prior to use, add 
DTT to a final concentration of 100 mm. 

Lysis buffer 
50 mM Tris-Cl (pH 7.4) 
25% (w/v) sucrose 
EDTA can be included (to a final concentration of 10 rw) to minimize proteolytic degradation 
due to metalloproteases. Although the inclusion of EDTA and protease inhibitors such as ! mM 
PMSF <!> (or AEBSF), 1 g/ml aprotinin <!>, or 10 pg/ml leupeptin <!> in the lysis buffer may 
not be necessary when the target protein is packaged in inclusion bodies, they may be important 
in the solubilization/ refolding steps (unfolded proteins are more susceptible to proteolytic degra- 
dation) or when extracting soluble proteins. 

Lysozyme (hen egg) 

MgCl, (1 M} <!> 

NP-40 (10% v/v) 


» Cells 


Bacterial cells expressing target recombinant protein 


ny 
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» Equipment 


Centrifuge 
Waring Blendor (see Step 4) 


» Additional Reagents 


Step 9 of this protocol requires the reagents listed in Chapter 2, Protocol 1. 


METHOD 


. Harvest the bacterial cells by centrifugation at 3000g for 15 minutes at 4°C. 


. Wash the cells with lysis buffer to remove the residual culture medium and harvest the 


washed cells by centrifugation as in Step 1. 


3. Pour off the supernatant and weigh the wet pellet. 


. Resuspend the washed E. coli cells in ~3 m! of lysis buffer per gram of cell pellet and stir 


the suspension for 30 minutes at 4C. If the pellet is not fully resuspended after 30 min- 
utes, mix the suspension in a Waring Blendor at low speed for ~1 minute. 


. Add lysozyme to a concentration of 0.1% (w/v) and incubate for 35 minutes at 4*C, shak- 


ing gently. 


A faster tate of lysis may be obtained by increasing the lysozyme concentration to 1.096 
(w/v) (10 mg/ml). Under these conditions, satisfactory lysis can be accomplished in as little 
as 5 minutes at temperatures as low as 4°C (Bollag et al. 1996). 


. Add in the following order: 


NP-40 to a final concentration of 0.5% (v/v) 
MgCl, to a final concentration of 5 mM 
DNase I to a final concentration of 40 ug/ml 


Stir the suspension for 30 minutes at 4°C to remove the viscous nucleic acid. 


Bacterial extracts are 40-70% protein, 10-30% nucleic acid, 2-10% polysaccharide, and 
10-15% lipid (Worrall 1996). The release of DNA upon cell lysis often results in a highly vis- 
cous extract that can cause serious problems in subsequent chromatographic purification 
steps. In addition to DNase I treatment, DNA can be removed from the cell extract (along 
with other nucleic acids, and in some cases, highly acidic proteins) by the addition of a neu- 
tralized solution of positively charged compounds such as protamine sulfate (up to 5 mg/g 
wet weight o£ cell pellet) (Scopes 1994) or polyethyleneimine (Burgess and Jendrisak 1975). 


EXPERIMENTAL TIP: Methods for DNA removal involving positively charged compounds 
should not be used with inclusion body preparations, since the precipitated DNA will cocen- 
trifuge with the inclusion bodies. 


. Centrifuge the suspension at 23,000g for 30 minutes at 4eC. 


8. Resuspend a small portion of the pellet in Laemmli buffer containing DTT. 


. Analyze aliquots of both the soluble protein fraction (supernatant) and pellet fraction for 


the presence of target protein using analytical SDS-PAGE (see Chapter 2, Protocol 1). 


If the bulk of the target protein is found in the insoluble pellet fraction, then inclusion bod- 
ies have likely formed, and the target protein will need to be solubilized and purified accord- 
ing to Protocol 7. If the target protein is found in the supernatant, this material should be 
stored at 4°C in readiness for the next purification protocol(s). 


PROTOCOL 7 


Solubilization of E. coli Recombinant Proteins 
from Inclusion Bodies 


MATERIALS 


Because MOLECULAR CLONING TECHNIQUES ALLOW HIGH LEVELS of expression in bacteria, this 
is a particularly convenient system for producing recombinant proteins. Regrettably, these 
proteins are often difficult to purify due to their tendency to aggregate and precipitate with- 
in the bacteria to form insoluble inclusion bodies. The formation of inclusion bodies is espe- 
cially common for nonbacterial proteins. Although no single method can be applied to every 
protein, a number of strategies are available to solubilize inclusion body proteins. One of 
these strategies is described in this protocol. A number of steps must be considered in solu- 
bilizing inclusion body proteins: 


* celi lysis 

* isolation of inclusion bodies 

* washing of inclusion bodies 

* solubilization of inclusion bodies 


* renaturation (if required) of recombinant protein 


First, the washing of inclusion bodies prior to solubilization is an important step for 
removing nontarget proteins, as well as nonproteinaceous material. For example, washing 
inclusion bodies with 1 M guanidine hydrochloride was used to remove contaminating pro- 
teins from recombinant IL-2 (Weir et al. 1987), and 4.0 m urea was used in the case of IL-6 
inclusion bodies (Zhang et al. 1992). It is important to remove nonproteinaceous (lipid-like) 
material in order to prevent the clogging of RP-HPLC columns and therefore prolong col- 
umn lifetimes. This can be accomplished by extensive washing of inclusion bodies with 4.0 M 
urea (Zhang et al. 1992) or extraction of the inclusion bodies with butanol-1-ol/10 mw EDTA 
(Weir et al. 1987). Whereas some target proteins will refold due to air oxidation (e.g., IL-6 
(Zhang et al. 1992]), the most common methods to induce proper refolding of proteins use 
extensive dilution and dialysis in the presence of thiol reagents (e.g., 1 mM DTT) to gradual- 
ly reduce the concentration of denaturant and promote correct disulfide bond formation. 


p Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «17. 


Renaturation buffers 
See Step 9 for examples. 
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Solubilization buffer (8 M guanidine hydrochloride in 50 mM Tris-Cl [pH 7.5]) <!> 


The ideal solubilization buffer for a particular protein may have to be determined empirically. See 
notes to Steps 8 and 9. 


Washing buffer 
100 mm Tris-Cl {pH 8.0) 
2—4.0 M urea <!> 
196 (v/v) Triton X-100 <!> 


» Cells 


Bacterial cells expressing the target recombinant protein 


b Equipment 
Centrifuge 


b Additional Reagents 


Step 1 of this protocol requires the reagents listed in Protocol 6 of this chapter. 
Step 7 of this protocol requires the reagents listed in Chapter 2, Protocol 1. 


METHOD 


1. Lyse the cells as described in Protocol 6. 
. Centrifuge the cell lysate at 23,000g for 30 minutes at 4°C, 


. Pour off the supernatant and measure the mass of the pellet (wet mass). 


h WwW h 


. Resuspend the pellet in ~10 volumes of washing buffer. Stir the suspension for ! hour at 
room temperature. 


5. Centrifuge the mixture at 23,000g for 30 minutes at 4°C. 
6. Pour off the supernatant and recover the pellet. 


7. Repeat Steps 4—6 three more times (or until the recombinant protein is >80% pure, as 
judged by analytical SDS-PAGE as described in Chapter 2). 


6. Dissolve the pellet from Step 7 in ~8 ml of solubilization buffer per gram wet mass pellet 
(determined in Step 3), and gently stir the mixture for 16-20 hours at 4°C. 
EXPERIMENTAL TIP: Do not exceed a protein concentration of 2-3 mg/ml. 
Alternative solubilization buffers: 
e 8M guanidine hydrochloride, 10 mm DTT, 50 mm Tris-Cl (pH 8.5). 


Use 15 ml per 12-15 g wet weight of harvested cells. Incubate the mixture for 1 hour at 
37°C {Weir et al. 1987). Renature the protein as described in Step 9a. 


e § M urea, 2 mM reduced glutathione/0.2 mM oxidized glutathione. 


Use 9 volumes of buffer per gram wet weight of inclusion body pellet. Incubate the mix- 
ture for ] hour at room temperature. Do not exceed a protein concentration of 2.5 mg/ml 
(Bollag et al. 1996). Renature the protein as described in Step 9b. 
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9. Refolding conditions for a specific target protein must be optimized in pilot-scale exper- 
iments. Examples of renaturation conditions include the following: 


a. Dilute the solubilized pellet 25-fold with 50 mm Tris-Cl (pH 8.5) containing 1.5 UM 
copper sulfate to give final concentrations of 0.04 M DTT, 024 M guanidine 
hydrochloride, and ~1.4 ug/ml target protein (Weir et al. 1987). Incubate the mixture 
for 2 hours at 20°C. 


b. Slowly add 9 volumes of 50 mM phosphate, 50 mm NaCl, and 1 mm EDTA containing 
2 mM reduced glutathione/0.2 mM oxidized glutathione to the solubilized pellet. 
Incubate the mixture for 2-4 hours at 25°C. 


10. Recover the folded protein using a concentration step that is appropriate for the target 
protein (e.g., RP-HPLC, lyophilization, or ultrafiltration). 


PROTOCOL S 
-————————————————————————————————————rÜÁ HÀ!!! 


Preparation of Extracts from Yeast 


Bios: YEAST IS EXCEPTIONALLY WELL SUITED TO GENETIC ANALYSIS, both classical and molec- 
ular, it is an attractive system for expressing recombinant animal proteins for purification 
purposes. For a discussion on cell growth and harvesting yeast, especially the genus of the 
budding yeast Saccharomyces, see Jazwinski (1990). A number of methods available for lysing 
yeast cells include autolysis, pressure cells (e.g., French pressure cell), abrasives (glass bead 
vortexing), and enzymatic lysis (e.g., zymolase). One of the simplest methods, discussed in 
this protocol, involves the abrasive action of well-agitated glass beads. This 1s a very effective 
method for both low volumes (e.g., <1 ml using a microfuge tube) and many liters using a 
specialized DynoMill apparatus. Cell breakage is typically 79596, as assessed by phase-con- 
trast microscopy. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 
Lysis buffer (RIPA buffer) 
150 mM NaCl 
196 NP-40 
0.5% sodium deoxycholate 
0.1% SDS <!> 
50 mM Tris-Cl (pH 7.2) 
Just prior to use, add protease inhibitors to the following final concentrations: | pg/ml aprotinin, | 
pg/ml leupeptin, 1 ug/ml pepstatin, and 50 ug/ml PMSF. 
Phosphate-buffered saline (PBS) 
8.0 g of NaC] (final concentration 137 mM) 
0.2 g of KCI (final concentration 2.7 mM) <!> 
1.44 g of Na,HPQ, (final concentration 10.1 mM) <!> 
0.24 g of KH,PO, (final concentration 1.8 mM) <l> 
in 800 ml of H,O 
Adjust pH to 7.4, and then adjust volume to 1 liter with H,O. 


b Cells 


Yeast cells, from a freshly grown culture 


y Equipment 


Glass beads (500 tm) chilled | l 
Prepare s beads by washing twice in 1 N HCl <!> and twice in lysis buffer. Store the washed 


beads at 4°C in a small volume of lysis buffer. 
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METHOD 


. Harvest the cells by centrifugation in a microfuge for 3 minutes. 
. Discard the supernatant, resuspend the cells in PBS, and centrifuge again for 3 minutes. 


. Discard the supernatant. Estimate the volume of the ceil pellet. Add 3 volumes of ice-cold 


lysis buffer per volume of cell pellet. Keep the suspension on ice. 


. Add a volume of chilled glass beads equal to the total volume of the resuspended yeast 


cells. 


- Vortex the suspension vigorously for 30 seconds. 


. Repeat Step 5 until the bulk of the yeast cells are lysed, as determined by phase-contrast 


microscopy. 


. Centrifuge the suspension in a microfuge for 5 minutes at 49C. 


. Carefully pour off the supernatant and transfer to a fresh tube. Keep it on ice until use. 


PROTOCOL 9 


Differential Detergent Fractionatian af Eukaryotic Cells 
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Dione DETERGENT FRACTIONATION (DOE) involves the sequential extraction of cells 
with PIPES buffers containing first digitonin, then Triton, and finally Tween/deoxycholate. 
The procedure yields four biochemically and electrophoretically distinct fractions (see Figure 
3.3 and Table 3.7) composed of the following: 


Cytosolic proteins and extractable cytoskeletal elements: Microtubule components can be 
semipurified from this fraction by magnesium precipitation (see ADDITIONAL PROTOCOL: 
PRECIPITATION OF TUBULINS AND MAPS USING MAGNESIUM at the end of this protocol). 


Membrane and organelle proteins: Integral membrane proteins can be enriched by using 
Triton X-114 (Bordier 1981; Pryde and Phillips 1986). 


Nuclear membrane proteins and extractable nuclear proteins. 
Detergent-resistant cytoskeletal filaments and nuclear matrix proteins: Cytoskeleton-asso- 


ciated proteins and interactions can be investigated further by differential urea extraction. 


Cell Culture 


Digitonin Extraction 


Cytosolic Cell Residue 
Compartment 
(CYTO) 


Triton X-100 Extraction 


Membrane/Organelle Cell Residue 
Compartment 
(MO) 


Tween/DXCX. Extraction 


|. 


| 


Nuclear Cell Residue 
Compartment 
(NUC) Cytoskeletal/Matrix 
Compartment 
ICSK/MAT) 


FIGURE 3.3. Schematic for differential detergent fractionation (DDE). (Courtesy of Melinda L. Ramsby 


and Gregory S. Makowski, University of Connecticut Health Center.) 
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TABLE 3.7. Protein distribution in detergent fractions of hepatocytes 


Celi fraction Protein distribution in hepatocytes 


Digitonin/EDTA extracts 
(CYTO fraction) 


~35% total hepatacytic cellular protein 
enriched in cytosolic markers (90% LDH activity, 
100% carbonic anhydrase immunoreactivity} 
(Ramsby et al. 1994) 


Triton extracts (MO fraction) constitutes the bulk of hepatocytic cellular proteins 
{~50%6 total protein) 
* enriched in markers for membrane and organelle 


proteins (Ramsby et al, 1994) 
Tween/DOC (NUC fraction) 


~5% of total hepatocytic cell protein 
contains, exclusively, immunoreactivity for the 
nuclear protein p38 (Ramsby et al, 1994) 


Detergent-resistant fraction * -7-1096 of hepatocytic cellular protein enriched in 
(CSK/MAT fraction) intermediate filaments, actin, various cytoskeleton- 
associating proteins, and nuclear matrix 

for monolayer cultures, this fraction also contains 
extracellular matrix (Ramsby et al. 1994) 


Magnesium-precipitable fraction 1-296 of total hepatocytic cellular protein 

(tabulins and microtubule-associated proteins) œ represents —4—596 of the protein in digitonin/EDTA 
extracts 

includes tubulins, actin, and proteins presumed to 
interact with the cytoskeleton 


Hepatocytes have a high metabolic rate, which may affect the percent distributions of protein. Distributions will vary 
depending on cell type and should be determined empirically. Table courtesy of Melinda L. Ramsby and Gregory 5. Makowski 
(University of Connecticut Health Center). 


EXAMPLES OF CELLULAR ANALYSIS USING DIFFERENTIAL DETERGENT FRACHONATION 


DDF has been used to fractionate suspension-cultured hepatocytes for the goal of follow- 
ing intracellular redistribution of mobile cytoskeletal elements in response to nutritional 
deprivation as a function of autophagy (Ramsby 1989). Fractionation profiles suggest that 
DDF would-lso be useful for monitoring compartmental redistribution of heat shock pro- 
teins (Ramsby et al. 1994), DDF has been used to fractionate monolayer cultures of bovine 
corneal endothelial cells toward the goal of monitoring intracellular levels and stores of 
plasminogen activator (Rainsby and Kreutzer 1993). DDF can be used to selectively extract 
type IV collagenase (i.e., gelatinase B) from isolated human polymorphonuclear leukocytes 
and thereby provides a mechanism far differentiating primary and secondary granule con- 
tent release (Makowski and Ramsby 3997). In rat hepatocytes, DDF has been used to study 
both regulatory and pathological protein degradation in mitochondrial (Makowski and 
Ramsby 1999) and cytoskeletal compartments (Makowski and Ramsby 1998), respectively. 
In additióà, DDF can be used to assess RNA distribution in monolayer cultures of 
osteoblasts for the goal of monitoring induction, turnover, and trafficking of specific 
mRNA in response to bone cell mitogens (M. Ramsby, unpubl.). RNA isolated from DDF 
extracts is intact and differentially distributed with respect to ribosomal and transfer RNAs 
{see Figure 3.4), DDF has also been used extensively for the study of signal transduction 
“pathways leading to protein subcellular translocation, including filamin, nitric oxide syn- 
‘thase, myosin, and NF-KB (Patton 1999 and references therein). 
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MATERIALS 


» Reagents 


A. cyto mo pred B. cowrRoL 


123456 1238 


FIGURE 3.4. Agarose gel of osteoblastic RNA isolated from differential detergent cell fractions RNA was iso- 
lated from osteoblastic cell fractions obtained by DDF: digitoni FOTA cytosolic (€ Y'TOJ, Triton/EDTA 
membrane/organelle (MQ), and combined nuclear/detergenl- resistant cytoskeletal? matrix (NUCACSK) (A), 
and compared to total RNA isolated from unfractionated monolayers CB]. RNA (15 png in equal volumes 
(15 pl) was separated by gel electrophoresis an 1% agarose formaldehyde gels containing ethidium bromide 
to visualize ribosomal RNA (285, 185) and transfer RNA (RNA) (Courtesy of Melinda L. Ramsby and 
Gregory S. Makowski, University of Connecticut Health Center. | 


The DDF protocol described here represents a modification of a method used for the 
fractionation of Madin-Darby canine kidney (MDCK) cells (Fey et al. 1984). Modifications 
include the addition of a digitonin extraction step, the inclusion of EDTA in digitonin and 
Triton buffers, and the elimination of a nuclease digestion step (DNA is denatured by shear 
force in the presence of SDS). Intact RNA can also be isolated from each detergent fraction 
by routine protocols and subjected to northern blot analysis for investigation of mRNA 
dynamics (see ADDITIONAL PROTOCOL: ISOLATION OF RNA FROM DETERGENT EXTRACTS at the end 
of this protocol), The main protocol and additional protocols were generously provided by 
Melinda Ramsby and Gregory Makowski (University of Connecticut Health Center). 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «t. 


Acetone (90%) chilled to -209C <I> 
Detergent extraction buffers 
In general, the detergent extraction buffer formulations are for 100 ml of working sotution. Adjust 
the final volumes for specific use, or freeze aliquots for future use. Solutions should be thawed on 
ice and not refrozen. The PIPES buffer recipes are listed elsewhere in this Materials list. For addi- 
tional information on the properties of the detergents, see Table 3.8 and the booklet, A guide to the 
Preparation and Uses of Detergents in Biology and Biochemistry (available at http://www.cal- 
biochem.com). 
» Digitonin extraction buffer (0.015% digitonin, pH 6.8 at 49C) <!> 
Dissolve by heating (boiling) 18.75 mg of digitonin with 10 ml of 4x PIPFS buffer in a small 
flask with a stir bar. Add 1 ml of 100 mM PMSF <t> while the buffer is still warm. Combine 
with 15 ml of 4x PIPES buffer and 5 ml of 100 mm EDTA. Cool to 4°C and adjust pH to 6.8 
with HC] «t». Add H,O to a final volume of 100 m! and keep on ice unti! use. Digitonin buffer 
without EDTA is stable for 3 hours at WC. EDTA prolongs the stabdity of this buffer. 
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TABLE 3.8. Detergents used in differential detergent fractionation of cells 


Comments 


Cell 
Detergent fraction Volume of detergent used 
Digitonin CYTO suspension cells; 5 volumes/g 


(wet weight) of cell pellet 
adherent cells: 1 ml per 1-25 
flask-equivalent; 3 ml per 
100-mm Petri dish 


Triton X-100 MO suspension cells: 5 volumes/g 
(wet weight) of cell pellet 
adherent cells: L rnt per T-25 
flask-equivalent; 3 ml per 


106-mm Petri dish 


Tween/ DOC NUC 
(wet weight) of cell pellet 
adherent cells: 0.5-1 mi per 


T-25 flask-equivalent 


- CSK/MAT — 
(detergent- 
resistant) 


suspension cells: 2.5 volumes/g 


Digstonin is a steroidal compound that complexes with plasma mem- 
brane cholesterol to cause membrane permeabilization and rapid release 
of soluble cytosolic components (Mackall et al. 1979). 

Low concentrations of digitonin (0.01596) preserve ER, mitochondrial, 
lysosomal, and organelle membrane integrity, hence, ultrastructure, which 
are damaged at higher concentrations (~0.1%) (Fiskum et al. 1980; 
Weigel et al. 1983), 

EDTA significantly enhances the effectiveness of low concentrations of 
digitonin, as evidenced by an increased rate of membrane permeabiliza- 
tion (10 minutes in the presence of EDTA vs. 40 minutes in the absence of 
EDTA) and inhibits calcium-dependent neutral proteases (calpains), tvpi- 
cally enriched in cytosolic extracts (Ramsby et al. 1994). 

Digitonin releases proteins larger than 200 kD (Weigel et al. 1983), as 
evidenced by the presence of myosin (7220 kD), desmoplakin II (7250 kD), 
and the calpain-inhibitor calpastatin (~300 kD) in cytosolic extracts 
(Ramsby et al; 1994), 

The combination of ice-cold temperatures and EDTA results in micro- 
tubule depolymerization, which promotes fractionation of tubulms and 
microtubule-associated proteins in digitonin/ EDTA extracts. Tubulins 

and microtubule-binding proteins can be harvested by repolymerization 
at 37°C and magnesium precipitation (see ADDITIONAI. PROTOCOL: PRE- 
CIPITATION GE TUBULINS AND MAPS USING MAGNESIUM). 


Triton is a nonionic detergent, which solubilizes membrane lipids and 
releases organelle contents. 

Tt has been used in hyper- or hypotonic buffers to obtain cytoskeletal 
preparations enriched in intermediate filaments (Franke et al. 1978). 
Low concentrations of Triton concomitant with iso-osmolar, isotonic 
buffer composition, as used here, preserve the integrity of tbe nuclear 
membrane as well as the microfilament cytoskeleton (Fey et al. 1984). 
Triton extraction is possible using either the X-100 or X-114 detergent series. 
Fractionation with X-114 allows subfractionation of peripheral versus 
integral membrane proteins (Bordier 1981; Pryde and Phillips 1986) and 
lends further flexibility to fractionation goals. 


DOC is a weakly ionic detergent that disrupts nuclear membrane and 
microfilament integrity (Capco et al. 1982; Fey et al. 1984; Reiter et al. 1985). 
Tween/ DOC extracts contain proteins common to both the membrane/ 
organelle and detergent-resistant cytoskeletal fractions, perhaps represent- 
ing a more labile, protein compartment. 


DNA present in the detergent-resistant fraction is viscous and difficult to 
manage. 

Denaturation is achieved by mechanical shear, using either Teflon/giass 
homogenization (for suspension pellets) or tituration with a pipette (for 
monolayer residues). 

The cytoskeletal filaments in this fraction are intact as evidenced by two- 
dimensional PAGE (Ramsby et al. 1994). Staircase patterns, indicative of 
cytoskeletal degradation, however, do arise if PMSF or EDTA is absent 
from extraction buffers. 

Solubilization of detergent-resistant samples in SDS phosphate buffer, in 
the absence of mercaptoethanol, allows direct assay of protein content by 
the method of Peterson (1983). 


"Table courtesy of Melinda L. Ramsby and Gregory $, Makowski (University of Connecticut Health Center], 
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e ‘Triton X-100 extraction buffer (0.596 Triton X- 100, pH 7.4 at PC} <!> 
Combine 25 ml of 4x PIPES buffer, | ml of 100 mw PMSF, 3 ml of 100 mm EDTA, and 5 ml of 


freshly prepared 10% (v/v) Triton X-100. Cool to 49C, adjust pH to 7.4, and add H,0 to 100 
m] (final volume). Keep on ice until use. i 


* Tween-40/deoxycholate extraction buffer (1% Tween-40/0.5% DOC, pH 7.4 at 4C) 
Dissolve 0.5 g of DOC in 25 ml of 10x PIPES buffer, and separately dissolve 1 ml of "Iween-40 
in 2.5 ml of 10x PIPES buffer (warm if necessary). Combine the Tween and DOC solutions, 
and then add 5 ml of 10x PIPES buffer and 1 m! of 100 mM PMSF. Cool to 49C, adjust pH to 
7.4, and add H,O to 100 inl (final volume). Keep on ice until use. 


EXPERIMENTAL TIP: Triton and Tween are both nonionic detergents, which decompose in 
aqueous solutions to form peroxides that oxidize protein sulfhydryl groups (Lever 1977; Chang 
and Bock 1980; Lenstra and Bloemendal 1983). Although EDTA prolongs stability, buffers 
should be used as freshly prepared or freshly thawed and not refrozen. ‘This avoids artifacts and 
assures reproducibility between different fractionations. 


Cytoskeleton solubilization buffer (5% SDS <!>, 10 mM sodium phosphate <!>, pH 7.4) 
For nonreducing buffer, dissolve 0.5 g of SDS in 5 ml of 20 mM sodium phosphate buffer, pH 7.4. 
Add H.O to 10 ml (final volume). For denaturing buffer, add 1 m! of B-mercaptoethanol «t. 
Adjust H,O appropriately. 

1x O'Farrell lysis buffer 
Dissolve 5.7 g of ultrapure electrophoretic-grade urea <1>, 0.2 ml of NU-40, 0.2 ml of ampholines 
(0.16 ml at pH 5-7 and 0.04 ml at pH 3-10), and 0.5 ml of (j-mercaptoethanol in H,O. Adjust tinal 
volume to 10 ml with H,O. Solution may be warmed to facilitate solubilization. Divide into 1-ml 
aliquots and store at —70°C. 


10x O'Farrell lysis buffer 
Combine 0.2 ml of NP-40, 0.2 ml of ampholines (0.16 ml at pH 5-7 and 0.04 ml at pH 3-10), and 
0.5 ml of B-mercaptoethanol in H;O. Adjust final volume to 1 ml with H;O. Solution may be 
warmed to facilitate solubilization. Divide into 100-1] aliquots and store at —709C. 


Phosphate-buffered saline (PBS, pH 7.4), ice-cold 
8.0 g of NaCl (final concentration 137 mm} 
0.2 g of KCl (final concentration 2.7 mM) <!> 
1.44 g of Na, HPO, (final concentration 10.1 mM) <!> 
0.24 g of KH,PO, (final concentration 1.8 mM} «t» 
in 800 ml of H;O 
Adjust pH to 7.4, and then adjust volume to 1 liter with H;O. 


PBS (pH 7.4) containing 1.2 mM PMSF, ice-cold 
Add the PMSF just before use. 


Ax PIPES buffer (piperazine-N,N’-bis|2-ethanesulfonic acid]) 
Dissolve 103 g of sucrose and 5.8 g of NaCl in 150 ml of H,O. Dissolve 3 g of PIPES in a small vol- 
ume of | M NaOH «1». Mix the PIPES solution with the sucrose/NaC] solution. Add 0.64 g of 
MgCl, 61,0 <!> to the mixture. Adjust final volume to 250 ml and filter through a 0.45-Jrn ster- 
ile filter. Store in the dark at 4°C or aliquot and freeze (stable for 2 months at 49C), Employ aseptic 
technique when diluting the 4x PIPES buffer for preparation of 1x working solutions or store as sin- 
gle-use aliquots to avoid contamination. 

10x PIPES buffer 
Dissolve 0.58 g of NaCl in 50 ml of H,0, and separately dissolve 3 g of PIPES in a small volume of 


1 M NaOH. Mix the PIPES solution with the NaC} solution, Add 0.2 g of MgCl, -611,0 to the solu- 
tion and adjust final volume to 100 mi with H,O Filter and store as described tor the 4x PIPES 


buffer. 
Trypan blue 
Urea (solid) 


b Celis 


» Equipment 
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Suspension or monolayer cell cultures 


Centrifuge (refrigerated ) 

Homogenizer, Teflon smooth-walled glass 
Lyophilizer 

Platform mixer 


P Additional Reagents 


METHOD 


Step 4 of Section IV of this protocol requires the reagents listed in Appendix 2, Technique 1. 
Section V of this protocol requires the reagents listed in Appendix 2. 


|. Preparation of Cells 


1. Cool the cell cultures on ice, and wash them two to three times in ice-cold saline, PBS, or 


other nondetergent buffer. 


Save an aliquot of the wash buffer (store it at -70°C) to use for future sample normalization 
or as control material for enzymatic, protein, or RNA analyses. 


. Choose the appropriate method below for working with suspension or monolayer cul- 


tures: 


If working with suspension cultures: The volume of DDF solutions for suspension-cultured 
cells is based on wet weight or cell number. 


a. Transfer an aliquot of the suspension culture to a preweighed plastic tube and cen- 
trifuge briefly. 


b. Decant the culture media and determine the wet weight of the cell pellet. 
c. Proceed to Step 1 in Section II. 


If working with monolayer cultures: The volume of DDF buffers for monolayer cell cul- 
tures is based on surface area or cell number. 


a. Determine the surface area of the culture flask or dish. Alternatively, count the cells in 
a representative culture vessel. 


b. Proceed to Step 1 in Section III. 


Il. Detergent Fractionation of Suspension Cell Cultures 


DDF is performed by the sequential addition and removal of DDF buffers. Ail extractions are 
performed on ice with gentle agitation. Attention to extraction times is important for repro- 
ducibility between different fractionations. Extracts are maintained on ice until used for assay 
or frozen for storage (—709C is best). Measure the recovered volume of each detergent extract 
to calculate net yields or percent distributions. Save an aliquot of each DDF buffer for future 
sample normalization or control material for assays. 
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15. 


. Add 5 volumes of ice-cold digitonin extraction buffer per gram wet weight to the washed 


cell pellets. Gently resuspend the cell pellets by swirling. 


- Incubate the cells on ice with gentle agitation on a platform mixer until 95-10096 of the 


cells are permeabilized (~ 10 minutes} as assessed by ‘Trypan blue exclusion. 


EXPERIMENTAL TIP: li i5 important to maintain consistent extraction times between dif- 
ferent fractionations. 


. Centrifuge the extraction mixture at 480g for 10 minutes. 


. Transfer the supernatant to a clean tube. Record the supernatant (cytosolic fraction) vol- 


ume, divide the cytosolic fraction into aliquots, and store them at -70°C. This is the 
CYTO fraction (see Figure 3.5 and Table 3.7). 


. Carefully resuspend the digitonin-insoluble pellets in 5 volumes (relative to starting wet 


weight of the cell pellet [determined above in Section I, Step 2]) of ice-cold Triton X-100 
extraction buffer to obtain a homogeneous suspension. 


triton X-114 can be substituted for Triton X-100 in DDF protocols without obvious dif- 
ference on two-dimensional gels, Relative to Triton X-100, Triton X: 114 has a lower cloud 
point (609C vs. 20*C) and can partition into lipid-rich and lipid-poor phases at nonde- 
naturing temperatures. This enables subfractionation of integral and peripheral mem- 
brane proteins as well as soluble organelle contents (Bordier 1981, Pryde and Phillips 
1986). 


. Incubate the cells on ice with gentle agitation on a platform mixer for 30 minutes. 
_ Centrifuge the extraction mixture at 5000g for 10 minutes. 


. Transfer the supernatant to a clean tube. Record the supernatant (membrane and 


organelle fraction) volume, divide this fraction into aliquots, and store them at —709C. 
This is the MO fraction (see Figure 3.3 and Table 3.7). 


. To the Triton-insoluble pellets, add Tween-40/deoxycholate extraction buffer at one-half 


the volume used for Triton extraction (Step 5). Resuspend the pellets with five strokes at 
medium speed using a Teflon, smooth-walled glass homogenizer. 


_ Centrifuge the extraction mixture at 6780g for 10 minutes. 


_ Transfer the supernatant to a clean tube. Record the supernatant (nuclear fraction) vol- 


ume, divide the nuclear fraction into aliquots, and store them at -70°C. This is the NUC 
fraction (see Figure 3.3 and Table 3.7). 


. Add ice-cold PBS (pH 7.4) containing 1.2 mM PMSF to the detergent-resistant pellets. 


Resuspend the pellets using three strokes in a Teflon/glass homogenizer. Collect the pel- 
lets by centrifugation at 12,000g for 10 minutes, 


, Repeat Step 12 two more times to thoroughly wash the pellets. 


. Discard the supernatant, and wash the pellets once with 9095 acetone (previously chilled 


to ~20°C)}. 


Lyophilize the pellets ( cytoskeletal/nuclear matrix fraction) overnight. 
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16. Determine the pellet weights in tared microfuge tubes. Store the samples at -70°C. This 


is the CSK/MAT fraction (see Figure 3.3 and Table 3.7). 


Typically, the yield of the CSK/MAT fraction from cell suspensions is large. ‘Thus, it is usu- 
ally more convenient to store the fraction as dried pellets, and resuspend measured aliquots 
in nondenaturing cytoskeleton solubilization buffer as needed. 


17. Proceed to Section IV. 


Ill. Detergent Fractionation of Monolayer Cell Cultures 


DDF is performed by the sequential addition and removal of DDF buffers. All extractions are 
performed on ice with gentle agitation. Attention to extraction times is important for repro- 
ducibility between different fractionations. Extracts are maintained on ice until used for assay 
or frozen for storage (~70°C is best). Measure the recovered volume of each detergent extract 
to calculate net yields or percent distributions. Save an aliquot of each DDF buffer for future 
sample normalization or control material for assays. 


l. 


Add | ml of ice-cold digitonin extraction buffer directly to the cell monolayer in a T-25 
flask. 


A typical 1-25 culture flask contains ~5 x 10° celis. Use 3 ml of digitonin extraction buffer 
for a 1-75 flask or 100-mm diameter Petri dish. 


. Incubate the cells on ice with gentle agitation on a platform mixer until 95—10096 of the 


cells are permeabilized (~10 minutes) as assessed by Trypan biue exclusion. 


EXPERIMENTAL TIP: It is important to maintain consistent extraction times between dif- 
ferent fractionations. 


. Tilt the culture flask and decant the liquid (the cytosolic fraction) into a culture tube. Use 


à pipette to transfer any residual liquid from the flask into the culture tube. Record the 
volume of the cytosolic fraction, divide it into aliquots, and store them at —70°C. This is 
the CYTO fraction (see Figure 3.3 and Table 3.7). 


. Add 1 ml of Triton extraction buffer per T-25 flask equivalent (~5 x 10$ cells). 


Triton X-114 can be substituted for Triton X- 100 in DDF protocols without obvious differ- 
ence on two-dimensional gels. Relative to Triton X-100, Triton X-114 has a lower cloud 
point (60°C vs, 209C) and can partition into lipid-rich and lipid-poor phases at nondena- 
turing temperatures. This enables subfractionation of integral and peripheral membrane 
proteins as well as soluble organelle contents (Bordier 1981; Pryde and Phillips 1986). 


. Incubate the celis on ice with gentle agitation on a platform mixer for 30 minutes. 


. Tilt the culture flask and decant the liquid (the membrane and organelle fraction) into a 


culture tube. Use a pipette to transfer any residual liquid from the flask into the culture 
tube. Record the volume of this fraction, divide it into aliquots, and store them at —70°C. 
This is the MO fraction (see Figure 3.3 and Table 3.7). 


, Extract cell monolayers with 0.5—1.0 ml of Tween-40/deoxycholate extraction buffer per 


T-25 flask equivalent (~5 x 10 cells). 


. Tilt the culture flask and decant the liquid (the nuclear fraction) into a culture tube. Use 


a pipette to transfer any residual liquid from the flask into the culture tube. Record the 
volume of the nuclear fraction, divide it into aliquots, and store them at -70°C. This is 
the NUC fraction (see Figure 3.3 and Table 3.7). 
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9. Rinse the monolayers in situ with PBS. 


10. Add 1-3 ml of nondenaturing cytoskeleton solubilization buffer without B-mercap- 
toethanol to the detergent. resistant residue in the culture flasks. Suspend the residue by 
tituration. Record the volume of this material (the cytoskeletal and matrix fraction), 
divide it into aliquots, and store them at -70°C. This is the CSK/MAT fraction (see Figure 
3.3 and Table 3.7). 

The volume of noncenaturing cytoskeleton solubilization buffer required will vary depend- 
ing on the cell type and length of time in culture (i.e., the amount of extracellular matrix). 
In general, 1-3 ml is an appropriate range. 


11. Proceed to Section IV. 


IV, Determination of Protein Concentration 


1. Thaw detergent buffers and extracts on ice. 


2. Dilute the digitonin/EDTA and Triton/EDTA extracts with H,O (4 volumes for extracts 
from suspension cultures, 2-3 volumes for extracts from monolayers). Use the 
‘Tween/DOC extracts undiluted. 


3. Solubilize the lvophilized CSK/MAT pellets in cytoskeleton solubilization buffer without 
§-mercaptoethanol. Use 1 ml of buffer per 10 mg (dry weight) of pellet. Dilute CSK/MAT 
preparations from monolayers as necessary. 


A. Assay 20-50 ul of each sample in duplicate, using the BCA method (sec Appendix 2). 


An alternative assay is the Folin-phenol method of Peterson (1983), which i» not susceptible 
to interference by detergents. Assay by the standard Lowry method is not recommended 
because it results in detergent-induced flocculation (M. Ramsby, unpubl. ). 


V. Analysis of Protein Fractions Using Two-dimensional Gel Electrophoresis 


DDF samples obtained from a variety of cell types can be utilized for two-dimensional PAGE 

under both IEF and nonequilibrium pH gradient electrophoresis (NEpHGE). For examples 

of two-dimensional gels, see Ramsby et al. (1994) and Ramsby and Makowski (1999). Fresh 

or thawed DDE extracts (kept on ice) are first normalized to contain protein in equal volumes 

and then brought to 9.5 M urea by addition of solid urea. 

e For a 100-p! sample, add 85 mg of urea and 15 ul of 10x O'Farrell lvsis buffer and warm to 
room temperature. 

e For the dried CSK extract, solubilize directly in 1x O' Farrell lysis buffer (O Farrell 1975; 
CF Farrell et al. 1977). 

a Kor CSK extracts in nonreducing SDS buffer, adjust to 9.5 M urea by the addition of solid 
urea and add 10x O'Farrell lysis buffer. 


Solid urea and 10x O'Farrell lysis buffer are added to minimize dilutional effects and 
enable detection of low-abundance proteins. Samples may be warmed slightly to facilitate 
urea solubilization, but avoid excessive warming (increased temperature and/or prolonged 
heating), which may result in carbamylation artifacts. 
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* DDF samples are analyzed by 2D electrophoresis using established methods (O'Farrell 
1975; O' Farrell et al. 1977; also see Chapter 4). 


* [EF gels (sce Chapter 4) contain a total of 3.5% ampholines (296 pH 5-8, 1% pH 3-10, 
0.596 pH 2—5), and samples are electrophoresed for a total of 9800 Volt-hours with hyper- 
focusing at 800 V for the final hour (Duncan and Hershey 1984; Ramsby et al. 1994). 


e NEpHGE gels contain 2% pH 3-10 ampholines, and samples are electrophoresed for 2400 
Volt-hours (Ramsby et al. 1994). 


In general, samples containing 25-100 ug of protein in 15-60 ul are ample for visualiza- 
tion by Coomassie Blue staining or autoradiography: low-abundance proteins are detected by 
silver staining. In control experiments, the volume of detergent buffer contained in the sam- 
ples did not adversely affect the linear range of the pH gradient in IEF gels. However, a slight 
shift to more acidic values may occur with samples containing Tween/DOC. 


DEGRADATION STUDIES 


DDF extracts are an excellent source of material for protein degradation studies. Frozen 
samples are stable (-709C), and when. aliquoted, they provide a reliable and reproducible 

' source of protein, In general, DDF extracts are of sufficient protein concentration (Ramsby 
et al. 1994) that they may be diluted sevéralfold in the buffer and pH of choice, thereby lim- 
iting potential interference by detergents. DDF detergents are also compatible with a vari- 
ety of potential protease activators (e.g, metals and sulfhydryl reagents) and inhibitors 
(e.g., PMSF, pepstatin, leupeptin, and chelators), which may be added from stock solutions 
(Makowski 1989). DDF extracts have been used successfully to study degradation of spe- 
cific proteins in various subcellular compartments in rat hepatocytes including cytosolic 
(Makowski 1989), membrane/organelle (Makowski aud Ramsby 1999), and cytoskele- 
tal/matrix fractions {Makowski and Ramsby 1998). Because of the high protein content of 
DDF extracts, degradation experiments may be simply monitored by standard or modified 
(Makowski ad Ramsby 1993) gel electrophoretic methods (SDS-PAGE), with and without 
reduction with fi-mercaptoethanol, and can be combined with western blotting. In gener- 
al, the concentration of detergents in the DDF:extracts does not interfere with enzymatic 
and electrophoretic analysis of proteins. 
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ADDITIONAL PROTOCOL: ISOLATION OF RNA FROM DETERGENT EXTRACTS 


B Total RNA is isolated from detergent extracts by the acid guanidinium isothiacyanate/phenol/chloroform 
method of Chomczynski and Sacchi (1987). 


IMPORTANT: Strict attention must be paid to the preparation of glassware and reagents to avoid RNA degradation by 
tontaminating RNases. Bake ali glassware for 24 hours at 300°C and use for RNA processing only. Formulate RNA 
baffers/solutions with ultrapure H,O previously treated with DEPC and autoclaved. Keep samples on ice, and pipette 
using sterile, cell culture 10-ml plastic pipettes. RNA isolation from 2.0 ml of detergent extract is convenient to work with 
and affords enough sample for both protein and RNA analysis, including northern blot hydridization. In general, 3 vol- 

3 umes of GIT buffer/volume of detergent extract is the minimal ratio compatible with recovery of intact RNA; lesser ratios 
are associated with RNA degradation. 


Additionat Materials 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
Chloroform <!> 
Ethanol 
. isopropanol 
Phenol (H,O-saturated) <!> 
‘RNA extraction buffer 


Prepare RNA extraction buffer according to the methed of Chomezynski and Sacchi (1987), To prepare a stock solu- 
tion, use only molecular-biology-grade reagents. A stock solution contains 4 M guanidinium isothiocyanate <!> (heat 
« . at 65°C to dissolve), 0.5 M sodium citrate, and 0.596 (v/v) '"N-lauroylsarcosine (Sarkosyl). The stock solution is stable 

for 3 months at room temperature. Add B-merceptoethanol fresh before use (360 ju) per 50 mi of RNA extraction 
buffer) to obtain a working solution. 

SDS (0.196) containing 1 mM EDTA <!> 

Sodium acetate (2 m, pH 4.0) 

Vacuum dryer (SpeedVac or equivalent) 

Water bath preset to 65°C 


Method 
1. Add 3 volumes of RNA extraction buffer per volume of detergent extract, and titurate four times to dena- 
ture the proteins. 
2. To each sample, add: 
0; volume of 2 m sodium acetate (pH 4.9} 


t volume of H,O-saturated phenol 
0.2 volume of cholorfor 


Mix vigorously after each addition. Allow the samples to sit on ice for 15 minutes. 
. Centrifuge the samples at 10,000g for 20 minutes. 


, Carefully remove the upper aqueous phase with a pipette, and transfer it to a dean tube. Add an equal vol- 
ume of isopropanol, and invert the tube to mix. Store the tubes overnight at -20°C to precipitate RNA. 


. The next day, centrifuge the samples at 10,000g for 15 minutes. 
Carefully discard the supernatant, and invert the tubes to allow them to drain. 


Wash the RNA precipitates with 1 ml of 7596 ethanol, and transfer them to clean 1.5-ml microfuge tubes. 
Centrifuge them again at 10,000g for 10 minutes. 


Dry the RNA pellet in a centrifugal vacuum dryer for 5 minutes. 
Resuspend the RNA in 30-50 pil of 0.1% SDS containing | mM EDTA. Heat samples for 10 minutes at 65°C. 
10. Clarify the solution by centrifugation in a microfuge for 2 minutes. 


11. Transfer the clear supernatant to a fresh tube, To quantify and assess purity, dilute an aliquot of the RNA 
solution 250-fold (e.g add 4 ul of RNA to 996 ul of H,O) and measure the absorbance at 260 and 280 nm. 


Ancol Anp ratios are typically 1.45 + 0.15. 


Boc 


mpg 


o m 
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Resuits OF RNA isoLATION Usine DDF 


* 


Total RNA yield (milligram of RNA per 100-mm diameter culture dish) is routinely higher for RNA isolat- 
ed during DDF (459 mg + 18) than for control RNA (289 mg + 42.5) prepared in parallel by direct addition 
of guanidinium isothiocyanate to unfractionated cell cultures. 


In osteoblastic cells, the percent distribution of total cellular RNA is ~17% for digitonin/EDTA extracts, 
~29% for Triton extracts, and ~54% for combined nuclear/detergent resistant fraction (M. Ramsby, 
unpubl.). 


RNA prepared from DDF extracts can be readily fractionated on agarose-formaldehyde gels (see Figure 3.4) 
and processed for northern blot hybridization by standard protocols. Using this method, single bands of 
intact mRNA have been visualized and found to vary in a compartmental-specific manner as a function of 
differentiation stage and treatment (M. Ramsby, unpubl.). 


Enrichment of tRNA in cytosolic extracts and rRNA in nuclear/cytoskeletal fractions is observed, consistent 
with selective fractionation relative to distinct subcellular compartments (see Figure 3.4). 


Specific mRNAs differentially distribute in DDE fractions relative to differentiation stage and treatment (M. 
Ramsby, unpubl.), which suggests that RNA isolation using DDF provides a novel approach for investigating 
mRNA localization and trafficking in parallel with protein analysis. 


ADDITIONAL PROTOCOL: PRECIPITATION OF TuBULINS AND MAPS Usine MAGNESIUM 


Tubulins and microtubule-associated proteins present in cytosolic extracts can be separated from 
other cytosolic constituents by incubation at 37°C, followed by magnesium-induced precipitation 
(Sahyoun et al. 1982). 


Additional Materials 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Imidazole buffer (pH 7.5) <!> 
MgCl, (0.1 M) 


Method 


1. Adjust an aliquot of the digitonin/EDTA extract (the CYTO fraction from the main protocol) 
to 7-35 mM magnesium by addition of 0.1 M MgCl. Incubate the samples for 20-30 minutes at 
37°C, 

The total volume of starting digitonin/EDTA extract used will depend on experimental goals. Final 
magnesium concentration will depend on the origin of the starting material. In general, 35 mM 
magnesium is excessive, but broadly applicable for complete recovery. 


2. Harvest tubulins and associated proteins by centrifugation in a microfuge for 8 minutes. 


3. Transfer the supernatant to a clean tube. Wash the pellet in a large volume of imidazole buffer 
(pH 7.5), or other suitable buffer, and resuspend the pellet in cytoskeleton solubilization buffer 
for determination of protein concentration and electrophoretic analysis (see Sections IV and V 
of the main protocol and Table 3.7). 
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T wo-pimensionar (2D) ELECTROPHORESIS IS A POWERFUL PROTEOMICS METHOD for separating 
complex mixtures of proteins into many more components than is possible with the classical 
one-dimensional method (see Chapter 2). All proteins in an electric field migrate at a speed 
that is dependent on their conformation, size, and electric charge. 2D electrophoresis uses the 
latter two of these characteristics to allow high-resolution separation of proteins. As first 
described by both P.H. O'Farrell and J. Klose in 1975, the first-dimension step, isoelectric 
focusing (IEF), separates proteins on the basis of their charge, whereas the second-dimension 
step, SDS-polyacrylamide gel electrophoresis (SDS-PAGE), separates proteins according to 
their molecular weights. The result is an array of protein spots (Figure 4.1) that are assigned 
x and y coordinates, rather than the protein bands seen in one-dimensional separations. Each 
spot in the resulting gel represents one to a few proteins, depending on the complexity of the 
sample. Thus, thousands of proteins can be separated on a single gel, permitting the deter- 
mination of the isoelectric point, apparent molecular weight, and relative abundance of many 
of the proteins. In addition, 2D electrophoresis can separate proteins on the basis of their 
posttranslational modifications. This technique is currently a major field of proteomics 
where it is applied in conjunction with mass spectroscopy (see Chapter 8) to identify differ- 
ential expression of proteins (expression proteomics). 

The power of 2D electrophoresis has been recognized since its inception; however, 11 was 
not widely applied because the original technique was difficult to. master and the results 
were hard to analyze. During the past several years, a number of developments have 
improved the resolution and reproducibility of the technique and expanded our ability to 
interpret results (Górg et al. 2000). 

. The new 2D method developed by Gorg and colleagues (1985, 1988, 2000) uses an 
improved separation technique in the first dimension that replaces carrier ampholyte-gen- 
erated pH gradients with immobilized pH gradients within the gel. 
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FIGURE 4.1. Separation of human cerebrospinal fluid proteins by 2D gel electrophoresis. pH 3-10 nonlin- 
ear first dimension, 14 %T SDS-PAGE second dimension. 


e Less expensive and more powerful computers and software have improved our ability to 
analyze the complex patterns of protein spots within 2D gels. 


e Advancements in mass spectrometry techniques have reduced the amount of material 
needed for identification and characterization of a protein to less than a few hundred 
nanograms, approximately the amount of material in a single 2D protein spot. 

e Ready access, via the World Wide Web, to data on genomes of organisms has allowed for 
the rapid identification of the genes encoding proteins separated by 2D electrophoresis. 


This chapter provides techniques for separating, visualizing, and analyzing complex mix- 
tures of proteins by 2D electrophoresis (for troubleshooting 2D gel electrophoresis, see 
www.proteinsandproteomics.org). The following topics are covered: 

« Sample preparation, including examples of methods for preparing tissue, cell, and body 
fluid samples. 

e Isoelectric focusing of proteins using immobilized pH gradients (IPG) for first-dimension 
separations. 

o Methods for using mini-, standard-, and large-format SDS-PAGE gels for second-dimen- 

sion separations. 

Visualization of proteins on gels, including five different staining techniques that can be 

used on analytical and/or preparative gels. 

Methods for analyzing 2D gels by blotting. 


Instrumentation used to obtain images of 2D gels and programs for computer analysis of 
2D separations. 
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PREPARATION OF SAMPLES FOR 2D GEL ELECTROPHORESIS 


General Strategy 


A useful, interpretable, and reproducible 2D electrophoresis separation depends on the 
appropriate preparation of the sample to be analyzed. Electrophoretic separation requires 
that the molecules to be separated remain in solution throughout the entire. procedure. 

'arious agents are therefore employed in the sample preparation and electrophoresis solu- 
tions in order to assure protein solubility. The IEF technique used for the first dimension of 
2D electrophoresis imposes a number of constraints on the solubilizing agents that can be 
used, These agents can neither modify the p] of the proteins to be separated nor result in an 
excessively conductive solution. Only solubilizing agents that remain substantially uncharged 
over the pH range of the separation can be used. 2D electrophoresis contrasts in this regard 
to one-dimensional SDS-PAGE. This technique utilizes the strongly solubilizing detergent 
SDS both during sample preparation and during the procedure itself. This reagent is excep- 
tionally effective at rendering proteins soluble, but it does so by binding to proteins and 
imparting a strong charge to the resulting complex. This reagent ts theretore unsuitable as a 
solubilizing agent for JEF, which depends on the unmasked, endogenous charge of the pro 
tein. Protein-SDS complexes also remain highly charged, and therefore soluble, throughout 
electrophoresis. Proteins approach their isoelectric uncharged state during IEF, where they 
are minimally soluble. For these reasons, there are many proteins that are not amenable to 
separation by 2D electrophoresis using current methods, as they are not soluble under con- 
ditions compatible with IEF. Large proteins and hydrophobic proteins tend to be poorly rep- 
resented on 2D gels, However, advances continue lo be made in identifying reagents and con- 
ditions that allow optimal solubility of proteins under [EF conditions. 

Sample preparation usually begins with cell lysis or tissue disruption, which shouk be 
done rapidly and at as low a temperature as possible in order to minimize proteolysis and 
other modes of protein degradation. It is often advantageous to lese cells or disrupt tissue 
directly in a strongly denaturing solution, which will prevent protein degradation or modifi- 
cation. Protease inhibitors are frequently used as well. Protocols 1-4 describe methods for 
preparing various mammalian and bacterial samples for 2D electrophoresis. 

Complex biological samples generally contain more proteins than can be seen on à sin- 
gle 2D gel. At most, a few thousand of the most abundant proteins are available for analysis. 
it is therefore advisable to use some kind of prefractionation to lower the complexity of the 
sample to be analyzed on a single 2D gel, If it is possible to restrict the investigation to a sub- 
set of proteins most likely to be of interest, these proteins should be separated from the bulk 
of irrelevant proteins and analyzed separately. For example, if one is studying 4 phenomenon 
known to be restricted to the cell nucleus, then nuclei should be purified and 2D analysis car- 
ried oul on a nuclear lysate, Alternatively, a complex sample can be fractionated to yield sev- 
eral samples of lower complexity, each of which can then be analyzed separately by 2D elec- 
trophoresis (for examples of fractionation schemes utilized in this approach, see Lenstra and 
Bloemendal 1983; Ramsby et al. 1994; Corthals et al. 1997; Molloy et al. 1998; Taylor et al. 
2000) (for preparation of cellular and subcellular extracts, see Chapter 351. 

Also present in the sample may be substances that either interfere with the separation, 
render the proteins less soluble, or interfere with visualization of the result. Steps should be 
taken in such cases to reduce the amount of these interfering, substances, or reduce their 
effect. The removal of interfering substances requires either additional processing steps (e.g. 
trichloroacetic acid [TCA] precipitation], which can result in selective losses, or the addition 
of enzymes (¢.g.. nucleases), which can result in additional spots in the final result. The inves- 
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tigator must therefore consider the trade-off between improved sample quality and accurate 
representation of the proteins in the sample. 

In general, it is advisable to keep 2D sample preparation as simple as possible. If the sam- 
ple contains a sufficiently high concentration of protein, and interfering substances are not 
present in high amounts (as is the case with most animal tissues and cells), it may be suffi- 
cient to disrupt the sample directly into a solution containing solubilizing agents, clarify by 
centrifugation, and apply the supernatant directly to the first-dimension [EF. However, if the 
sample is dilute, and contains relatively high levels of interfering substances (e.g., plant 
extracts or urine), proteins in the sample can be precipitated and resuspended in a smaller 
volume of solubilizing solution. This simultaneously concentrates the sample and removes 
many of the interfering substances. Flengsrud and Kobro (1989), Guy et al. (1994), Usuda and 
Shimogawara (1995), Tsugita et al. (1996), and Taylor et al. (2006) provide examples of meth- 
ods to precipitate proteins for 2D analysis. if the sample source has a high ratio of nucleic 
acids to protein (as is the case with most bacterial celis), then a nuciease treatment should be 
employed. Other general guidelines include the following: 

e Reagents used in the preparation of samples for 2D electrophoresis should be of the high- 
est quality available. Solutions used in sample preparation should be freshly prepared or 
stored as frozen aliquots. 


Preparing the sample immediately prior to first-dimension IEF best preserves sample qual- 
ity. If the sample must be stored, it should be divided into aliquots and kept at —809C. 
Samples can be stored this way for up to 1 year. 


All particulate material in the sample should be removed by centrifugation prior to appli- 
cation of the sample to first-dimension IEE. 


Protein Solubilization 


Proteins in the sample must be fully solubilized, disaggregated, denatured, and reduced to be 
effectively separated in first-dimension IEF. To achieve this, samples for 2D electrophoresis 
must always contain: 

+ A neutral chaotrope, which is always either urea or a mixture or urea and urea derivatives. 
e A neutral or zwitterionic detergent. 


« A reductant capable of reducing protein disulfide bonds to sulfhydrvls and maintaining 
their reduced state. 


These reagents are also present during first-dimension IEF. Buffers or ampholvtes may be 
present as well. A number of choices exist for each of these components, which are consid- 
ered separately below. 


Chaotropes 


Traditionally, urea is used at a concentration of at least 8 M. More recently, it has been shown 
that many additional proteins can be solubilized using mixtures of urea and thiourea 
(Rabilloud et al. 1997). The effect of using these reagents together, as opposed to urea alone, 
can significantly increase the number of proteins visible in the final 2D gel or membrane. Urea 
and thiourea are used most commonly at concentrations of 7 M and 2 M, respectively, but the 
optimal ratio may differ among samples (Musante et al. 1997). The use of thiourea can result 
in disturbances and loss of resolution in the acidic range (pH 3-5) of IEF separations. This 
could be an intrinsic effect of thiourea or a result of impurities in commercial preparations. 
Full solubilization and denaturation into urea-containing solutions can be slow. The 
sample should be allowed to remain in this solution for at least 1 hour at room temperature 
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prior to centrifugation and application. Heating urea-containing solutions above 30°C: must 


be avoided because this causes hydrolysis to cyanate, which can result in protein charge mod- 
ifications. 


Detergents 


Originally, either of two similar nonionic detergents was used: Triton X-100 or Nonidet P-40 
(NP-40). More recently, the use of the zwitterionic cholamidosulfobetaine detergents CHAPS 
(3-[3-cholamidopropyl}dimethylammonio]-!-propane sulfonate} and CHAPSO has become 
prominent (Perdew et al. 1983). CHAPS is available in high purity and can be used at con- 
centrations up to 4%, Alkylsulfobetaine detergents such as Zwittergent or SB3- 10 have also 
been used (Adessi et al. 1997; Molloy et al. 1998), but these reagents are relatively insoluble in 
urea solutions and often require a lower concentration of urea, resulting in potentially less- 
effective solubilization. Amidoalkylsulfobetaine detergents have been developed that are 
highly soluble in urea and thiourea solutions and allow solubilization of otherwise unde- 
tectable proteins ( Rabilloud et al. 1990; Chevallet et al. 1998; Santoni et al. 2000). However, 
many of these latter detergents are not available commercially. 

SDS is often used in sample preparation for 2D electrophoresis, but it must be considered 
a special case, as it cannot be used as the sole detergent in an IEF separation. SDS solutions are 
used for their ability to rapidly solubilize and extract proteins while simultaneously inactivat- 
ing modifying enzymes. The protein sample may also contain SDS because it was originally 
prepared for SDS-PAGE analysis. In these cases, the interfering effect of SDS must be reduced 
by diluting the sample into a solution containing an excess of a neutral or zwitterionic deter- 
gent prior to application to first-dimension IEF. The final concentration of SDS should be 
0.2596 (wiv) or lower, and the ratio of the excess detergent ta SDS should be at least 8:1. 


Reductants 


Originally, 2-mercaptoethanol was used to reduce samples for 2D electrophoresis. Its use has 
been largely superseded by the sulfhydryt reductants dithiothreitol (DTT) and dithioerythri- 
tol (DTE), which are effective at lower concentrations and available in higher purity. DTT 
and DTE are used at concentrations ranging from 20 mM to 100 mM. More recently, it has 
been reported that tributyl phosphine (TBP) may be more effective with samples that are dif- 
ficult to solubilize (Herbert et al. 1998). However, the instability and insolubility of this 
reagent in aqueous solution render it relatively ineffective at maintaining proteins in their 
reduced state during first-dimension IEF. It is therefore advisable to utilize a sulfhydryl 
reductant as well during the first-dimension separation. 


Ampholytes or buffers 


Carrier ampholytes and/or buffers are often used in sample preparation. They act by main- 
taining the pH at an optimum for protein solubility and by minimizing protein aggregation 
due to charge-charge interactions. Carrier ampholytes can be used up to 296 (v/v) and buffers 
can be used at concentrations up to 40 mM during sample preparation. Buffers should be 
used with care, however, as they can interfere with IEF if present at too high a concentration 
during the first-dimension separation. Cationic buffers (e.g., Tris) will be drawn toward and 
concentrate at the cathode, resulting in less-effective separation at the basic end of the pH 
range. Amphoteric buffers (e.g., HEPES) can actually focus within the pH gradient, resulting 
in a zone of high conductivity where proteins do not focus. If buffers are used during sample 
preparation, their final concentration in the first-dimension solution should be less than 5 
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mM. Samples are often prepared at high pH, since most tissue proteases are inactive above pH 
9. Spermine base (used at 20 mM) is reported to be particularly effective for high-pH extrac- 
tions (Rabiiloud et al. 1994a). Different proteins are optimally soluble at different pH values, 
so the choice of buffering conditions during sample preparation can strongly influence the 
profile of proteins appearing in the final 2D result. 


Interfering Substances 


Nonprotein impurities may be present in samples destined for separation by 2D elec- 
trophoresis. These impurities can interfere with separation of the proteins or their subse- 
quent visualization, so steps are often taken to rid the sample of these substances. Commonly 
encountered interfering substances, methods for their removal, and possible consequences if 
they are not removed are considered below. 


Small ionic molecules 


Small ionic molecules may be present in a 2D sample either because they are endogenous to 
the material under investigation (e.g., metabolites in tissue and salts in serum) or because 
they have been used as additives during sample preparation (e.g., residual buffers and salts). 
These substances can interfere with first-dimension separations by (1) increasing the con- 
ductivity in the first-dimension separation, thereby reducing the overall effectiveness of IEF, 
and (2) accumulating at the anode or cathode or within the strip, resulting in regions of high 
conductivity where proteins do not focus. 

Such contamination can be removed by precipitating the proteins in the sample and 
resuspending them in salt-free sample solution. There are many ways of doing this, but the 
most commonly used precipitants are TCA (Guy et al. 1994) or organic solvents such as ace- 
tone (Flegsrud and Kobro 1989), ethanol, or a mixture of chloroform and methanol ( Taylor 
et al. 2000). The combination of TCA (10%) and acetone is particularly effective (Damerval 
et al. 1986). When using TCA, it is important that this reagent be completely removed prior 
to resuspension. This is accomplished by extensive washing of the precipitate with acetone ot 
diethyl ether. 

Dialysis is also an effective means of desalting a sample, which generally resuits in mini- 
ral loss of protein. Desalting by gel-filtration chromatography may be employed, but it can 
result in protein losses. 

Often, the problem of a highly conductive sample can be solved by modifying upstream 
sample preparation steps to limit contamination with salts. This is generally preferable to try- 
ing to remove these contaminants at a later stage. If high-ionic-strength solutions are used 
during sample preparation, carryover of salts into subsequent steps can be significant enough 
to be problematic. Whenever possible, a solution without added salts should be substituted. 
For example, it is common practice to remove external proteins from cultured cells by wash- 
ing with phosphate-buffered saline prior to cell lysis. Carryover of salt into the celi lysate can 
then result in disturbances in the first-dimension IEF separation. This problem can be pre- 
vented by substituting a low-ionic-strength washing solution, such as Tris-buffered sorbitol. 


Nucleic acids 


The presence of DNA and RNA (especially high concentrations of nucleic acid) in a sampie 
can result in poor focusing in the acidic region of the IEF gel. They also generate a significant 
background if the 2D gel is silver-stained. In addition, the presence of DNA may make the 
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sample too viscous to work with effectively, ] arge nuclei acids can be removed by ultracen- 
trifugation, but this may also remove larger proteins and proteins bound to nucleic acids. 
"Treating the sample with nucleases often mitigates the negative etfects of nucleic acids, which 
can be accomplished by adding a one-tenth volume of a solution containing | mg/ml DNase 
|, 0.25 mg/m! RNase A, and 50 mN MgCl. to the protein sample and incubating on ice 
(Blomberg et al 1995). High concentrations of urea render these enzymes ineffective, so this 
(treatment must be applied prior to the addition of urea te the sample. Note that the nucle- 
ases may appear in the final 2D result as extra spots. These two limitations can be circum- 
vented through the use of Benzonase, a bacterial endonuclease with specificity for both DNA 
and RNA. Benzonase retains enough activity in the presence of 8 at urea lo remain etfective, 
and it can be used at a low enough concentration to not appear on a 2D gel. 


Lipids 

Lipids can form complexes with proteins, reducing the proteus solubility. They can also 
form complexes with detergents, which reduces the detergent's effectiveness. Organic solvent 
precipitation can remove much of the tipid fram a sample. Using excess detergent can mini- 
mize the effects of lipids. 


Phenolic compounds 


Phenolic compounds are present in many plant tissues and can modify proteins through an 

enzyme-catalyzed oxidative reaction. There are a number of ways this can be prevented. 

e. Employing reductants during sample extraction prevents the oxidative reaction. In addi- 
tion to the DTT or DTF conventionally used during sample preparation, sulfite or ascor- 
bate can also be used. 


e The proteins can be rapidly separated from the phenols, compounds by using precipitation 
techniques such as those used to remove small iome molecules. 


+ The enzyme polyphenol oxidase can be inhibited ditectly with thiourea. 


+ Polyphenolic compounds can be sequestered by adsorption to polyvinylpyrrolidone or 
polyvinylpolypyrrolidone. 


Protection against Proteolysis 


Cell lysis liberates proteases, which can degrade proteins and compheate 2D analysis. Most 
proteases are effectively inhibited by the denaturants employed during 2D sample prepara- 
tion, but some retain activity even in the presence of B M urea. in these cases, protease 
inhibitors may be used. Individual protease inhibitors are active against specific proteases, so 
it is advisable to use protease inhibitors in combination. Protease inhibitors used during sam- 
ple preparation for 2D electrophoresis are listed below. 


PMSF and AEBSF 


PMSF (phenylmethylsulfonyl Quoride) and AEBSE tanunoethylbenzenesulfonyl fluoride) 
irreversibly inhibit serine and cysteine proteases. They rapidly become inactive m aqueous 
solutions and must be prepared directly prior fo use. Suilhvdry! reagents also inactivate them, 
so they are best added to an extract to inactivate proteases prior to addition of DTT or DTE. 
PMSF is used at | mM, whereas AEBSF is more soluble and can be used at concentrations of 


up to [0 mM. 
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EDTA 

EDTA (ethylenediaminetetraacetic acid) inhibits metalloproteases by chelating metal ions 
required for activity. It should not be present at more than 1 mM during first-dimension IEF. 
Peptide protease inhibitors 


Reversible protease inhibitors (e.g., leupeptin, pepstatin, aprotinin, and bestatin) are active 
against various classes of proteases under a variety of conditions, They are relatively expen- 
sive, but are cost-effective because they are seldom used at concentrations above 1 uM. 


THE FIRST DIMENSION: ISOELECTRIC FOCUSING WITH IMMOBILIZED pH GRADIENTS 


Proteins are amphoteric molecules that carry a positive, negative, or zero net charge, depend- 
ing on the pH of their surroundings. The isoelectric point (pl) of a protein is the pH value of 
the protein's surroundings at which the protein has a zero net charge. At pH values above its 
pl, a protein carries a net negative charge, and at pH values below the pl, the protein carries a 
net positive charge. IEF takes advantage of this phenomenon. By placing proteins in a pH gra- 
dient within an electric field, the proteins wil! migrate to the pH where they have no net 
charge, their pL. A protein with a positive net charge will migrate toward the cathode through 
the pH gradient, becoming progressively less positively charged, until it reaches its pl. A neg- 
atively charged protein will move toward the anode, progressively becoming less negatively 
charged, until it reaches its pl. Once there, if a protein drifts from its pL it gains a net charge 
and will then migrate back to its pI. As the slope of the pH gradient and the strength of the 
electric field determine the degree of resolution, IEF is typically carried out under very high 
voltage (typically in excess of 1000 volts). Additionally, IEF is typically performed under dena- 
turing conditions to provide the highest resolution and the clearest results. 

The original IEF method depended on carrier ampholyte-generated pH gradients in 
polyacrylamide tube gels (Klose 1975; O'Farrell 1975). Carrier ampholytes are small ampho- 
teric molecules, which have high buffering capacities near their pls. When a voltage is applied 
across a mixture of these molecules, the carrier ampholytes align themselves according to 
their pls. The strong buffering capacity of each carrier ampholyte at its pI results in a con- 
tinuous pH gradient across the gel. However, the use of carrier ampholytes in 2D elec- 
trophoretic studies has several limitations: 


+ Carrier ampholytes are not a well-defined mixture of molecules and suffer from batch-to- 
batch variations in manufacturing, creating difficulties with reproducibility in first-dimen- 
sion separations. 

© The pH gradients generated by carrier ampholytes are unstable and have a tendency to drift 
toward the cathode over time. This drift reduces the reproducibility of the first dimension 
as it is time-dependent. Additionally, the drift results in a flattening of the pH gradient at 
each end, especially in the basic region. Thus, it is not possible to resolve proteins at 
extreme pH values (i.e., above pH 9). 


It is very difficult to work with soft acrylamide tube gels. The gels can stretch or break, 
resulting in variations between separations. Therefore, the quality of the results of tube-gel- 
based 1EF separations is often dependent on the experience of the operator. 


In 1982, Bjellqvist et al. introduced an alternative method for generating immobilized pH 
gradients for IEF by incorporating acrylamido buffers into a polyacrylamide gel at the time 


152 = CHAPTER 4 


of its casting. During polymerization, the acrylamide portion of the buffers copolymerizes 
with the acrylamide and bisacrylamide monomers to torm a polyacrylamide gel. The pH gra- 
dient is formed in a fashion similar to pouring an acrylamide gradient gel, except that rather 
than using two solutions of different %T, the two solutions contain a relatively acidic mixture 
and a relatively basic mixture of acrylamido buffers. For improved handling, these gradients 
are casl onto a plastic backing. After polymerization is complete, the gel i» washed to remove 
catalysts and unpolymerized acrylamide monomers, which can interfere with the separation 
of proteins and reduce the sample loading capacity, Gianazza (1999) has recently reviewed 
methods for casting JPG (immobilized pH gradient) gels. Although acrylamido buffers (and 
carrier ampholytes) are available from several manufacturers, commercial preparations of 
precast gel strips incorporating immobilized pH gradients are a simpler (but more expensive) 
alternative. The gel strips can be rehydrated in any desired solution suitable for the first- 
dimension [EF separation. 

]PGs for first-dimension separations provide a number of advantages over carrier 
ampholyte-based separations. 


* The problem of cathodic drift is eliminated, because the pH gradient is covalently fixed 
within the gel. Thus, broader pH range separations can be preformed, allowing the separa- 
tion of more highly acidic and highly basic proteins. 


IPG gels have a higher load capacity for proteins than do gels cast with carrier ampholytes 
(Bjellqvist et al. 1993). 


Acrylamido buffers are a small set of well-characterized molecules, allowing for very repro- 
ducible manufacturing, thus eliminating batch-to-batch variation. 


Precast IPG gel strips also offer some further advantages. 


+ 


Precast IPG gels reduce preparation time and effort and greatly improve the reproducibil- 
ity of the pH gradient. 


Gel strips are cast onto a plastic backing, preventing stretching or breaking, and generally 
easing the handling of gels. 


Samples can be applied to the gel strips during gel rehydration (Rabilloud et al. 1994a; 
Sanchez et al. 1997). 


Protocol 5 presents two methods for carrying out the first-dimension IEF step of 2D gel 
electrophoresis. The first is a flat-bed IEF procedure. The second method uses a high-voltage 
self-contained IEF system, which requires fewer manipulations of the gel and is capable of 
faster separations. 


THE SECOND DIMENSION: SEPARATION OF PROTEINS BY MOLECULAR WEIGHT 


Introduction to SDS-PAGE 


SDS-PAGE is a method of separating proteins on the basis of their molecular weight. The 
technique is performed in a polyacrylamide gel containing the detergent SDS. SUS is an 
anionic surfactant, with a 12-carbon alkyl tail and an ionic head group that denatures pro- 
teins by wrapping around the polypeptide backbone in a ratio of ~1.4 g of SDS per gram of 
protein. The bound SDS gives the proteins a net negative charge per unit mass, overwhelm- 
ing the intrinsic electrical charge of the sample protein. In addition, SDS disrupts hydrogen 
bonds, blocks hydrophobic interactions, and partially unfolds the protein, thus eliminating 
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secondary and tertiary structures. Proteins can be completely unfolded by reacting them with 
reducing agents, such as DT'T or 2-mercaptoethanol, in the presence of SDS. These agents 
cleave the disulfide bonds within cystine residues. Under these conditions, the unfolded pro- 
teins form ellipsoid shapes surrounded by SDS. During electrophoresis in a sieving gel, there 
is a linear relationship between the logarithm of the molecular mass of the protein and the 
relative migration distance of the SDS-polypeptide micelle (Weber and Osborne 1969; 
Laemmli 1970). However, this relationship does not hold for all protein types. For example, 
glycoproteins with high carbohydrate concentrations migrate at a slower rate than expected 
based on their molecular mass, because they bind smaller amounts of SDS relative to other 
proteins of the same molecular weight (Dunbar 1987). 

SDS-PAGE is normally performed with a discontinuous buffer system. In this system, the 
proteins migrate in the order of their mobility in tight zones, called the stack, between the 
leading and trailing ions of the discontinuous buffer. The most common buffer system uses 
glycine as the trailing ion as described by Laemmli (1970). Other buffer systems can be used, 
particularly the Tris-tricine system of Schágger and von Jagow (1987), for resolution of 
polypeptides below 10 kD in size (for an extensive discussion of SDS-PAGE and alternative 
buffer systems, sce Chapter 2). 

Altering the amounts of acrylamide and cross-linker controls the pore size, and thus the 
sieving properties, of the polyacrylamide gel. Acrylamide and cross-linker concentrations are 
described in ?6T and %C. 


* %T is the total weight of acrylamide and cross-linker expressed as a percentage of the tota! 
volume. 


* %C is the weight of cross-linker expressed as the percentage of total weight of both the 
acrylamide and cross-linker. 


For example, a 1-liter stock solution of acrylamide at 30%T and 2.5%C would contain 
292.5 g of acrylamide and 7.5 g of cross-linker. When %C remains constant and %T is in- 
creased, the pore size of the gel decreases. When %T remains constant and %C is increased, 
the pore size of the gel follows a parabolic function, where at high and low values of C, the 
pores are large. The minimum pore size is at C — 496. Table 4.1 provides recommended acry- 
lamide concentrations for a variety of protein molecular-weight ranges. 

Second-dimension gels are made as either homogeneous gels with the same %T and %C 
throughout or as gradient gels with a varied %T but a constant %C. Homogeneous gels offer 
the best resolution for a particular molecular-weight window and are the easiest to pour repro- 
ducibly. However, a gradient gel is best when proteins over a wide range of molecular weights 
are to be analyzed. Gradient gels provide a wider linear separation interval, and the resulting 
protein spots are sharper, because the decreasing pore size functions to minimize diffusion. 


TABLE 4.1, Recommended acrylamide concentrations for protein separation 


% Acrylamide in resolving gel Separation size range (m.w. X 10) 
Single Percentage 5 36-200 
7.5 24—200 
10 14-200 
12.5 14--1008 
15 14-60? 
Gradient 5-15 14-200 
5-12 10-200 
10-20 10-150 


"The larger proteins fail to migrate significantly into the gel at this percentage of acrylamide. 
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An Overview of Second-dimension SDS-PAGE 


Following the separation of proteins by IEF, the second-dimension separation is carried out 
by SDS-PAGE. This can be performed on a variety of flatbed or vertical gel electrophoresis 
systems depending on the throughput and resolution desired. 


* Flatbed systems provide excellent resolution and relatively rapid separation and can be 
used with commercially available precast gels. However, such systems have limited capaci- 
ty and are only recommended for micro-preparative separations. These systems will not be 
discussed further. 


« Vertical systems have become favored for their relative ease of use and ability to perform 
multiple separations simultaneously. Vertical 2D is generally run on 1.0- or 1.5-mm thick 
gels. For rapid results using 7-cm IPG for the first dimension, minigel units are used. Such 
units are available from a variety of companies including Amersham Biosciences, Bio-Rad, 
and Invitrogen. In addition, precast vertical SDS-PAGE gels are also available trom Bio- 
Rad, FMC, and Invitrogen. For increased resolution, 11-, 13-, and 17-cm gels can be run on 
standard size 16 x 16-cr or 20 x 20-cm gels. The multiple gel casters available with these 
systems improves reproducibility of the second-dimension gels. Precast gels for the Bio- 
Rad 20 x 20-cm gel system are available. 


e For maximal resolution, reproducibility, and capacity, use large-format gels (e.g., the 
ETTAN DAJT II 26 x 20-cm gel from Amersham Biosciences or the 22 x 26-cm Investigator 
gels from Genomic Solution). These systems are used with 18- and 24-cm IPG gels, and 
they can run up to 20 second-dimension SDS-PAGE gels simultaneously. They also provide 
multiple gel-casting systems ta help improve the reproducibility of gels. Precast gels for the 
investigator and the ETTAN DALT II systems are available. 


The second dimension consists of four steps: 

1. Preparation of the SDS-PAGE gel (Protocols 6 and 7). 

2. Equilibration of the IPG strips in SDS buffer (Protocol 8). 

3. Placement of the equilibrated IPG gels on the SDS-PAGE gel (Protocol 9). 
4. Electrophoresis of the second-dimension gel (Protocol 9). 


STAINING 2D GELS 


A large number of methods have been developed to detect proteins separated by elec- 
trophoresis on 2D gels, with the most common ones involving the binding of a dye or silver 
ions to the proteins. The choice of method is greatly dependent on the experimental ques- 
tions being asked. The complexity of the steps, the sensitivity of detection, compatibility with 
mass spectrometry, cost, and equipment required vary among the staining methods. Five dif- 
ferent procedures are presented in Protocols 10 through 14. 


© The colloidal Coomassie method is commonly used with preparative gels, has a detection 
range of 8-50 ng of protein within a spot, and takes an average of 1-2 days to complete. 
Although the least sensitive of the methods presented here, it is broadly used because it is 
simple and compatible with mass spectrometry (see Protocol 10; for conventional 
Coomassie Blue R250 staining, see Chapter 2). 


PREPARATIVE 2D GEL ELECTROPHORESIS WITH IMMOBILIZED pH GRADIENTS a 155 


Classical silver-staining methods are commonly used with analytical gels because of their 
highly sensitive range of detection, typically 2-4 ng of protein per spot. However, classical 
silver-staining methods are complex and are incompatible with mass spectrometry, because 
the glutaraldehyde used in the sensitization steps can cross-link proteins, and the silver ions 
can interfere with mass spectrometric analysis. Protocol 11 provides a very sensitive ammo- 
niacal silver stain based on the method of Merril and Harrington (1989). 


Mass spectrometry-compatible silver staining is an important variation on conventional 
silver staining, due to the rising importance of mass spectrometry in proteomics. Protocol 
12 provides a silver staining method that is compatible with mass spectrometry, based on 
the method of Yan et al. (2000b). 


Fluorescent stains for 21) gel electrophoresis have advanced greatly with the recent devel- 
opment of the ruthenium-based fluorescent dye SYPRO Ruby (Molecular Probes} (see 
Table 2.1, Chapter 2). This dye provides sensitivity similar to that of ammoniacal silver but 
without the complex methodology, limitation on linear dynamic range, or the problems 
with mass spectrometry compatibility. However, these dyes are very expensive and require 
fluorescent scanners for analysis (see Protocol 13). 


Phosphoprotein stains are becoming increasingly important as it is clear that the phos- 
phorylation state of proteins is a key regulator of their function (see Chapter 9). Protocol 
14 uses the GelCode Phosphoprotein staining kit produced by Pierce. 


Although autoradiography has also been used to visualize 2D gels, it is beyond the scope 
of this chapter. For a recent review, see Link (1999) and Lee and Harrington (1999). 


Colloidal Coomassie Staining 


Protocol 10 is a modification of the method of Neuhoff et al. (1988) which involves the bind- 
ing of Coomassie Blue G250 to proteins. This dye complexes with basic amino acids, such as 
arginine, tyrosine, lysine, and histidine, and produces a very low background due to the col- 
loidal properties of the stain. This property prevents the Coomassie dye from penetrating the 
gel, eliminating the requirement for the long destaining periods characteristic of Coomassie 
R250 staining. 


Ammoniacadt Silver Staining 


Silver staining of polyacrylamide gels (Protocol 11) was first introduced by Switzer et al. 
(1979) and has rapidly become the most popular high-sensitivity staining method for pro- 
teins. Problems with high backgrounds, reproducibility, and siiver mirrors resulted in many 
improvements to this initial protocol over the years. In a recent review, Rabilloud et al. 
(1994b) found more than 100 different protocols in the literature. However, all protocols 
contain at least the following five basic steps: fixation, sensitization, silver impregnation, 
image development, and cession of development and image stabilization. 

We have found that the ammoniacal silver staining process presented here provides high 
sensitivity, in the range of 2—4 ng per protein spot, and works well with gels down to a thick- 
ness of 1 mm. Due to its high sensitivity, silver staining is very susceptible to interference 
from a variety of sources. Exceptional cleanliness is necessary, and reagent and water quality 
is critical. Deionized water with a resistivity of more than 15 mho/em is necessary for all solu- 
tions and washes. Distilled water should not be used, for it tends to give erratic results. All 
chemicals should be of high grade. 
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Mass Spectrometry-compatible Silver Staining 


Ammoniacal silver staining is one of the most sensitive methods for detecting proteins on an 
SDS-PAGE gel. However, it and other standard silver-staining methods are incompatible with 
mass spectrometry, which is fast becoming the best way to identify proteins isolated on 2D 
gels. This is because the proteins in gels to be analyzed by mass spectroscopy must remain 
unmodified, so many of the common sensitizing agents used in silver staining procedures 
(e.g., glutaraldehyde and strong oxidizing agents) cannot be employed. Additionally, many 
labs find it advantageous to purchase dedicated kits, where reagent quality is assured, to carry 
out silver staining. Kits for MS-compatible silver staining are available from a variety of 
sources, including Amersham Biosciences, Bio-Rad, and Pierce. The method provided in 
Protocol 12 is based on the Amersham Biosciences PlusOne silver stain kit as described by 
Yan et al. (2000b). These authors found that this method is compatible with MALDI and ESI- 
MS and works with semipreparative protein loads without resulting in negative staining. 
However, this process is less sensitive than standard sitver-staining methods. Other mass 
spectrometry-compatible silver-staining methods are described by Shevchenko et al. (4996), 
Scheler et al. (1998), Gharahdaghi et al. (1999), and Sinha et al. (2001). 


Fluorescent Staining of Proteins with SYPRO Ruby 


The development of new mass spectrometry instrumentation and its application to the field 
of proteomics have been a great help in the identification of proteins isolated by 2D elec- 
trophoresis (see Chapter 8). Identification of proteins is no longer limited to those that are 
highly abundant, with protein identification in the femtomole level now being feasible 
(Jensen et al. 1997; Gatlin et al. 1998). However, to make this sensitivity applicable to 2D gel 
electrophoresis, sensitive and compatible staining methods are necessary. Silver staining pro- 
vides very sensitive detection of proteins; however, it is complicated and without modifica- 
tions that reduce its sensitivity, it is incompatible with mass spectroscopy. A sen sitive ruthe- 
nium-based fluorescent dye (SYPRO Ruby) has recently been developed by Molecular 
Probes. Although the structure of this dye is not in the public domain, it has been reported 
that SYPRO Ruby is a transition metal organic complex that binds directly by electrostatic 
mechanisms (Patton 2000). Similar to silver stains, SYPRO Ruby interacts with basic amino 
acids, including lysine, arginine, and histidine {Lopez et al. 2000). Recent studies by Yan et al. 
(2000a} and Lopez et al. (2000) have shown that this dye is as sensitive as silver staining and 
completely compatible with MALDI mass spectroscopy. The staining method is also simple, 
having only a fix, staining, and single wash step. However, SYPRO Ruby is very expensive. 
Thus, this dye should be limited to micropreparative gels as the cost to run the large number 
of gels needed for quantitative studies may be prohibitive. 


Phosphoprotein Staining with the GelCode Phosphoprotein Staining Kit 


The phosphorylation state of a protein has an important role in the regulation of a wide vari- 
ety of cellular processes. As a result, there has been a great deal of interest in detecting the 
phosphorylation status of proteins. The method presented in Protocol 14 uses the GeiCode 
phosphoprotein staining kit (Pierce Chemical Company). This method depends on the 
hydrolysis of the phosphoprotein phosphoester linkage using 0.5 N sodium hydroxide in the 
presence of calcium ions. The gel containing the newly formed insoluble calcium phosphate 
is then treated with ammonium molybdate in dilute nitric acid. The resultant insoluble 
nitrophospho-molybdate complex is finally stained with Methyl Green. After destaining, 
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phosphoproteins are stained green. The detection limits depend on the degree of phospho- 
rylation of the protein. 


TRANSFER OF PROTEINS FROM 2D GELS 


Proteins can be transferred from 2D gels in the same manner as those transferred from one- 
dimensional SDS gels (see Chapter 6, Protocol 2). Following the second-dimension run, the gel 
is placed adjacent to a membrane, and the pair is sandwiched between blotting paper. An elec- 
trical field is applied perpendicular to the slab gel, driving the proteins to migrate from the gel 
onto the membrane. The process is commonly known as western blotting and was first per- 
formed by Towbin et al. (1979). Once bound to the membrane, proteins can be stained to 
examine the total protein pattern (see Protocols 17—20) or specific proteins can be character- 
ized by reactions with antibodies (immunoblotting) or other detection reagents. Processing 
the membrane rather than the gel with a protein detection reagent provides several advantages. 


* The proteins are bound to the membrane's surface, allowing very small volumes of reagents 
to be used. 


* The protein separation pattern on the membrane does not deteriorate due to diffusion. 


* Membranes are much more robust than gels, enabling them to survive the rigors of protein 
detection. 


* Membranes can become part of the experimental record. 


Membranes containing bound proteins can be used analytically as an end in themselves 
or they can be a step preceding another analytical method. In the latter case, all of the pro- 
teins on the membrane are stained, and selected spots are excised for further analysis by 
either mass spectrometry or protein sequencing (Matsudaira 1987) (see Chapter 6). 


Transfer Equipment 


The equipment for transfer of proteins from gels to membranes falls into two categories: tank 
and semidry. Within the transfer tank, a cassette holds the sandwich of gel, membrane, and 
blotting paper suspended vertically in buffer. Electrode panels are placed parallel to the cas- 
sette. Typically, liters of buffer are required in the tank to maintain the current and pH 
throughout the transfer. Since these units require high field strength to drive the proteins 
from the gel onto the membrane and may require extended times to complete efficient trans- 
fers, cooling of the transfer unit is typically required to avoid overheating the apparatus. 
(Overheating will result in poor transfer of proteins to the membrane and, at worst, can cause 
severe damage to the transfer equipment.) The advantages of tank systems are their efficien- 
cy of protein transfer: Transfers can be run for extended periods of time (allowing more com- 
plete transfer) and proteins from several gels can be transferred simultaneously. 

Semidry units transfer proteins from the gel to the membrane when they are sandwiched 
between buffer-saturated stacks of blotting paper. The electrical field is applied parallel to the 
stack. Compared with tank transfer units, semidry transfer units use much less buffer, and 
because the electrode panels are closer together, less voltage is required for transfer. Typically, 
protein gels are transferred based on the surface area of the gel exposed to the electrical pan- 
els, A recommended current for transferring proteins in a semidry apparatus is 0.8 mA/cm. 
At this current, heating is minimized and the buffering is adequate to transfer gels for up to 
2 hours. The disadvantage is that no more than two large gels can be efficiently transferred at 
one time and that, due to buffer depletion, the units cannot run longer than ~2 hours. 
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Transfer Membra 


nes 


Many types of membranes arc available. Nitrocellulose is frequently used for total protein 
staining methods and immunoblotting, whereas polyvinylidene difluoride (PV DF} is required 
for protein sequencing. Some nitrocellulose membranes are supplied with material incorpo- 
rated on or into the membrane to add support. These supported nitrocellulose membranes are 
preferred as they have more strength than frangible unsupported nitrocellulose. Nylon mem- 
branes are also available; however, these are seldom used with proteins due to difficulties with 
staining. Nylon membranes are more commonly used with nucleic acid transfers. 


IMPORTANE TIPS WHEN TRANSFERRING PROTEINS 


* 


Use gloves at all times when handling gels and membranes. Fingerprints can cause stain- 
ing artifacts and contaminate the membranes. Handle membranes from the edges only. 


To prevent diffusion, transfer gels immediately after the SDS electrophoresis separation 
is completed. 


Do not equilibrate second-dimension gels when using Tris/glycine/methanol transfer 
buffer. Although equilibration is sometimes recommended, because the gel may shrink 
in the methanol buffer, our lab has not found this to be a problem. On the other hand, 
equilibration may decrease the transfer of high-molecular-mass proteins (7100 kD) and 
contribute to the loss of lower-molecular-mass proteins («20 kD). Such losses are much 
mote prevalent in thinner (e.g, 1 mm) gels and lower-percentage gels. 


Equilibrate gels to be transferred in the presence of CAPS buffer with CAPS transfer 
buffer for 5 minutes before transfer to reduce the amount of Tris and glycine present 
within the gel. CAPS is the buffer preferred for transfers that will be used for amino acid 
sequencing, as high concentrations of glycine will interfere with sequencing. 


Take care to remove all air bubbles from the transfer sandwich. Trapped air will block the 
transfer of proteins in the region of the air bubbles. 


Make sure to wet the nitrocellulose membranes in water and PVDF membranes in 
methanol prior to equilibrating them in transfer buffer. If the membrane drics out at any 
time during the process, they must again be wetted in water (nitrocellulose) or methanol 
(PVDF). 


Wet the membranes by slowly lowering them onto the surface of the liquid. After a 
moment, gently submerge the membrane. This ensures even wetting without trapping 
air. 


Nitrocellulose membranes age and will eventually no longer wet properly. An old mem- 
brane may show yellowing around the edges. A useable membrane should rapidly absorb 
the liquid and change color from pure white to a light gray. If the membrane does not 
wet properly, use a fresh membrane. Fresh membranes always provide the best results. 


Up to 0.1% SDS can be added to the transfer buffer to improve the transfer of proteins 
out of gels, especially when transferring large proteins. However, the presence of SDS 
may decrease the efficiency of protein binding to the membrane, resulting in losses dur- 
ing staining. Thus, SDS should be present during protein transfers only when high-mol- 
ecular-weight proteins are of interest. 
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IMAGE ACQUISITION AND ANALYSIS OF 2D GELS 


Image Acquisition 


The varied methods of visualizing proteins on 2D gels require image acquisition techniques 
that can detect diverse chromogenic, radioactive, and fluorescent signals. A comprehensive 
review of all of the imaging methods is beyond the scope of this chapter but is availabie in 
reviews by Patton et al. (1999), Patton (1995) and Sutherland (1993). Below are described 
three main types of devices presently used to acquire images of 2D gels. 


Document scanner 


The least expensive of these devices is a document scanner. For densitometry analysis of gels, 
a document scanner must have the ability to acquire images in a transmission mode. 
Additionally, a scanner with high resolution (2400 x 1200 dpi), 14-bit pixel depth, and lin- 
earity over a wide optical density range (ie. above 3.4 OD units) is desirable. The 
ImageScanner (Amersham Biosciences) and the Model GS-710 Imaging Densitometer (Bio- 
Rad) meet these requirements. 


Charged-coupled device camera 


The charged-coupled device (CCD) camera is probably one of the most versatile acquisition 
devices for electrophoretic applications. CCD cameras have been used to acquire images from 
a variety of stained gels (including silver, Coomassie, and fluorescent stains), autoradi- 
ographs, and stained membranes. CCD-based systems are available from a variety of compa- 
nies, including Bio-Rad, Amersham Biosciences, Biolmage Products, and PerkinElmer 
Wallac. In the past, these systems have had some limited sensitivity and resolution, especial- 
ly when dealing with large gels. With the advent of cooled CCD cameras, the problem of sen- 
sitivity has been greatly reduced. Recent advances in the ARTHUR system (PerkinElmer 
Wallac) have also decreased the resolution problem with large gels. This system scans the gel 
using a cooled CCD camera at a fixed focal length, acquiring multiple adjacent overlapping 
frames that are then stitched together to produce a single image. Because of this scanning 
mechanism, the resolution is independent of the size of the gel. The addition of transillumi- 
nators or top illumination of a specific wavelength has allowed CCD camera-based systems 
to also be used with fluorescent stains. Increasing the signal to noise can increase the sensi- 
tivity of CCD cameras for fluorescent dyes. One way to achieve this is by using camera bin- 
ning, which is available on some instruments. Binning involves summing data from image 
pixels either 2 x 2, 3 x 3, or 4 X 4, which has the effect of increasing the signal-to-noise ratio 
by factors of 4, 9, and 16, respectively. Binning, however, decreases the resolution of the 
image, and thus may only be useful when the detection of very low-abundance proteins is 
required. A variety of methods of acquiring 2D gel images using CCD camera systems are 
available in the recent review by Patton et al. (1999). 


Laserbased densitometers 


A large number of laser-based densitometers are available for 2D analysis. These systems tend 
to have a broader linear OD range than scanners and are less susceptible to saturation effects 
than other detectors. Laser-based systems offer high resolution (50 jum) and sensitivity 
(many providing »4000 levels of gray scale). Many of these systems have been designed with 
lasers of multiple wavelengths in order to work with a variety of fluorescent dyes. 
Additionally, a number of these instruments such as the PhosphorImager from Molecular 
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Dynamics will also read phosphor storage imaging plates, which can be used with radiola- 
beled gels to detect |?P]- and | ^S] methionine-labeled proteins. The advantage of phosphor 
imaging plates is that they provide increased sensitivity, good spatial resolution of signal, and 
a broad linear dynamic range of response as compared to standard X-ray film. Systems with 
these characteristics are available from Molecular Dynamics, Bio- Rad, and Fuji Medical 
Systems, as well as other vendors. 


Analysis of 2D Gels 


Given the inherent complexity of 2D gels, which typically display thousands of proteins on a 
single gel, they are best analyzed using a computer. Using such systems, the investigator can 
detect and quantify even faint spots, quantitatively compare 21) images with each other, and 
identify protein expression changes between gels representing varied conditions. Currently, 
there are several major commercially available 2D analysis software systems: 73 (Compugen), 
Melanie III (Genebio and Bio-Rad), PDQuest (Bio-Rad), and the various versions of Phoretix 
2D software marketed as Imagemaster 2D (Amersham Biosciences), HT Analyzer (Genomic 
Solutions), and Phoretix 2D Advance (PerkinElmer Wallac) ( Raman et al. 2002). Each of these 
software systems differs slightly in various aspects, including the computer platforms on which 
they run, the graphical user interface, the amount of gel manipulation aliowed, and file for- 
mats. However, they all follow the same general steps for analysis of the gels. It is beyond the 
scope of this chapter to contrast these software systems or to give specific instructions of their 
operations. Detailed information on each software package is available at their respective 
home pages on the World Wide Web (see Two-dimensional Analysis Software Sites at the end 
of this chapter). Here, we review the general steps taken in the analysis of 2D gels. 


Image visualization and manipulation 


First, the image of the gel must be displayed in the program. All three of the programs accept 
TIFF files, so that is the recommended format for saving the gel images. Once the file is open, 
the image may need to be edited so that it is in the correct orientation and extraneous areas 
are removed from the image. This can be achieved using the rotate and crop functions in 
these programs. In addition, the display of the image may need to be optimized. Modern 
image acquisition systems are able to scan 2D gels with 12-bit or 16-bit/pixel, with 4,096 to 
65,536 gray levels. Common computer screens are only able to display 256 color or gray lev- 
els; thus, the programs must map intensity values between the scanned image gray levels and 
the screen gray levels. To visualize faint spots, increase the contrast of the image by using the 
transform or adjust contrast feature in these programs. Note that changing the contrast only 
affects how the image is displayed on the screen and does not alter the underlying data used 
for spot detection and quantification. Upon completion of the first modification to the gel, 
the program allows the new image to be saved in a format specific for that program. All three 
programs also allow manipulation of multiple gel images simultaneously. 


Spot detection and quantification 


Once the gel images of interest are open and in the desired format, the next step is spot detec- 
tion, This is the most critical phase of the analysis: All other results are worthless if the spot 
detection is incorrect. All three programs provide an automated detection system based on 
unique algorithms. Each program also allows alteration of the system's parameters, such as 
sensitivity, spot size, background subtraction, and number of smooths, although all of these 
parameters may not be available with all programs. Once an initial spot detection is complete, 
all of the programs allow the parameters to be altered again until an optimal detection Is 
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achieved. Once the optimal automated detection parameters are established, they can be 
applied to all of the other gels in the experiment or saved for later application. It is important 
that the spot detection parameters are consistent across all of the gels analyzed within an 
experiment. Spot detection may need to be manually edited, adding, deleting, or editing spots 
that were not detected by the automated spot detection algorithm. The spots may also be fit- 
ted to to a Gaussian model, which is done automatically in PDQuest and is an option in 
Melanie IIl. Gaussian modeling can help resolve areas of overlapping density and streaking 
and improve the accuracy of the density measurement of spots. During the spot detection 
method, all three programs automatically determine a number of parameters for the spot 
(Le., spot volume, area, optical density, and coordinates). 


Spot matching 


Once the spots of interest have been identified on all of the gels in the data set, the next step 
is to match the spots between gels. The first step is to choose a sample that will serve as the 
reference gel for the match. The reference gel should have a large number of well-resolved, 
good-quality spots, be representative of the experiment as a whole, and possibly be a control 
gel. The purpose of the reference gel is to provide a few starting pairs or landmarks to use for 
comparison with the other gels. Although these reference points are not required with either 
the Melanie III or the Phoretix software, using them greatly increases the rate and accuracy 
of matching. Pairs matching is done in the overlay or stacked mode in the Phoretix and 
Melanie IlI software, respectively, and in the match tool mode in PDQuest. It is important, 
although not always possible, to provide landmarks in all of the regions of the reference gel, 
especially at the edges where the greatest possible distortion of the gels may have occurred. 
Once a few starting pairs have been identified in the gel, it can be advantageous to align or 
warp the gels to the pairs, thus allowing better placement of additional pairs. Once a number 
of matches have been placed, the automated match mode can then be chosen. This occurs 
automatically as matches are added in PDQuest. 


Editing matches 


All three software packages allow visualization of matches using vectors that link the location 
of the paired spots. The lines in a given region should be reasonably parallel and of the same 
length. Vector pair lines that cross each other indicate a major problem with matching. These 
matches can be corrected by using the match editing tools of these programs. It is important 
to ensure that the spots are matched correctly for accuracy in comparisons between gels. 


Molecular weight and pl calibration 


In both PDQuest and the Phoretix software, the pI and molecular-weight values of every spot 
on the gel can be estimated by assigning known values to a few spots. 1n Phoretix 2D 
advanced software, this is done by adding values to a protein list of any gel in the matched 
set. Once these values are added, they are assigned to all other matched spots in the set, and 
the molecular-weight and pl values are calibrated for all other spots that fall within calcula- 
bie ranges. This is done in a similar fashion in PDQuest; however, the values are entered using 
the Mrpl data button. 


Synthetic and average gels 


All three software programs allow the user to create synthetic or average gels. These gels are 
important when studying variations in protein expression evident among a series of gels. In 
Melanie IIT, the synthetic gel is created by selecting set characteristics of all matched gels (i.e. 
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all paired or all nonpaired spots) and creating a gel containing those spots. The spot taken for 
the group is that which is closest to the average quantification value of the group. PDOuest 
and Phoretix 2D Advanced offer the same ability; however, they create a true average spot for 
the average gel. In Melanie III, synthetic gels are commonly used as the reference gel for 
matching a set of gels, because they contain the most important spots from all gels in the sets. 
Once a set of gels are matched to the synthetic reference gel, they produce a unique number- 
ing scheme for the spots contained in all considered gels. 


Normalization 


When comparing 2D gel images, there is often sume variation in spot intensity between gels 
that is not due to differential protein expression. This variation can be caused by differences 
in sample preparation, loading, staining, and imaging between gels. The process of compen- 
sating for this background variation is called normalization. Both PDQuest and Phoretix 2D 
Advanced allow gels to be normalized to the spot intensity of proteins known not to change 
in the experimental treatment. In addition, all three of the analvsis systems provide data on 
relative spot quantification, for example: 


» 96 OD = the optical density (OD) of a spot divided by the total OD of the entire image. 


e % Volume = the volume of a spot divided by the total volume of the entire image. Volume 
is defined as (the OD of a spot) x (the area occupied by a spot). 


Both of these methods allow for normalization when no stable proteins are known. 


Analyzing the data 


Once spots have been picked and matched, and the data values have been normalized, all 
three programs offer a variety of ways of analyzing the data, First, spots may be displayed that 
have only certain characteristics, c.g., matched spots that show more than a twofold change 
between gels, or all of the spots found in at least 90% of all members of a gel of a specific 
group (e.g., treatment group). Once specific groups of spots have been selected tor analysis, 
the changes in spot characteristics (e.g. optical density or spot volume) between individual 
gels can be visually displayed in histograms. All of the software programs generate reports of 
statistics {e.g means, standard deviation, and variation). A variety of simple statistics such 
as the Student's t-test can also be carried out on the data within these programs. Finally, all 
programs will allow export of the data to a spreadsheet for formatting, printing, and further 
analysis. 


Annotating the gels 


Once information about the protein spots on the gel has been generated, it can be added 10 
the gel image or database using the annotation feature available in each of these programs. 
As the protein database grows in size and complexity, annotations will be essential for keep- 
ing track of the experimental data. Annotations are organized in categories such as accession 
number, landmark, pl, and molecular weight. In addition, some of the programs allow anno- 
tations to be linked to specific data files, 2D databases (including on-line databases), or any 
other type of file. This is very valuable, because the information from on-line databases, such 
as SWISS-2DPAGE, can be added to the annotations easily. A fully annotated gel is known as 
a Master Gel. Master gels can be used to copy data into other gels to identify protein spots by 
gel comparison. The Phoretix 2D Advanced software also provides a Web page building com- 
ponent that allows production of on-line databases based on the investigator’ gels. This can 
be achieved with PDQuest data using Bio-Rad's PDQWeb software. 


PROTOCOL Í 


Preparation of Rat Liver Protein Extract for 
2D Gel Electrophoresis 


Liver, LIKE MANY ANIMAL TISSUES, CAN BE PREPARED for 2D electrophoresis simply by homog- 
enization in a suitable solubilization solution followed by centrifugation. Additional steps to 
concentrate protein or reduce the level of interfering substances are not necessary. The 
extraction solution used here contains urea, thiourea, and the amidosulfobetaine detergent 
ASB-14, which in combination, maximize the number of proteins solubilized. 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
b Reagents 
Dithiothreito! (DTT) 1.2 M <!> 
Extraction solution 
7 M urea <!> 4.2g 
2 M thiourea «1» 1.52g 
2% (w/v) ASB-14 (Calbiochem) 0.2 g 
0.5% (v/v) Pharmalyte 3-10 (AP Biotech) 50 pl 
H,O to 10 ml 
Prepare the solution fresh or store at -80°C. Cool on ice before use. Can be stored at -80°C for up 
to 6 months. 
Phenylmethyisulfonyl fluoride (PMSF) <!> (100 mM) in isopropanol <!> 
Store at 4°C or lower for up to 1 month. 
b Equipment 


Liquid nitrogen <!> 

Microfuge (refrigerated) 

Mortar and pestle 

Pestle (plastic or metal) to fit 1.5-ml microfuge tube (e.g., Labscientific) 


» Cells and Tissues 


Rat liver 
If the rat is exsanguinated before harvesting the liver, there will be less contamination with serum 
proteins. Livers should be quick-frozen with liquid nitrogen as soon as possible following harvest, 
and, if not used immediately, they should be stored at 80°C until needed. 
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, If not already frozen, place some of the liver in a mortar and fill the mortar with liquid 


nitrogen to keep the liver frozen. Use a pestle to break the liver into small pieces. 


. Weigh small pieces of frozen liver, and transfer 10—40 mg into a microfuge tube in ice. 


Work rapidly to minimize thawing of the liver sample. Store the remainder of the liver at 
-80»C. 


. Pipette 1 ml of cold extraction solution per 40 mg of liver into the tube with the liver sam- 


ple. Add 10 ul of 100 mm PMSF per | ml of extraction solution. 


. Thoroughly grind the liver sample using a small pestle designed to fit in a microfuge tube. 
. Add 50 ul of 1.2 M DTT per | ml of extract and vortex. 

. Centrifuge the sample in a microfuge at maximum speed for 10 minutes at 4°C. 

. Measure the protein concentration of the supernatant (see Appendix 2). 


. Divide the supernatant into 100-1 aliquots, and if not used immediately for IEF, store the 


protein at —80°C. 


Thaw frozen aliquots only once to avoid the adverse effects of multiple freeze-thaw cycles. 


. Proceed to Protocol 5 to perform first-dimension JEF. 


[n the case of rat liver proteins, for first-dimension IEF, the protein concentration should be 
5-10 mg/ml. The sample may be diluted to the appropriate concentration into rehydration 
solution containing 7 M urea, 2 M thiourea, and 2% (w/v) ASB-14 for first-dimension IEF. Tt 
is, however, possible to load fairly large volumes onto the IEF strips using rehydration load- 
ing (see Protocol 5). In addition, the optimal rehydration solution will vary depending on 
the source of protein and should be determined empirically. 


PROTOCOL 2 


Preparation of Eukaryotic Lysates for 2D Gel Electrophoresis 


MATERIALS 


Corrure MAMMALIAN CELL LINES ARE WIDELY USED as study models in biomedical research. 
The protocol described here has been used successfully for many human colon carcinoma cell 
lines and fibroblast celi lines {Ji et al. 1994, 1997). Although most cellular proteins can be 
extracted by this protocol, some DNA-binding proteins may not be recovered. If DNA-bind- 
ing proteins are to be studied, a nuclease (e.g., Benzonase) should be included, as described 
in Protocol 3. Expression proteomic studies frequently require the detection of low-abun- 
dance proteins. An extremely sensitive and tried-and-true method for detecting these rare 
proteins is the metabolic labeling of cellular proteins with radioisotopes, which is presented 
in ADDITIONAL PROTOCOL: RADIOISOTOPIC LABELING OF EUKARYOTIC CELL PROTEINS at the end of 
this protocol. 

This protocol was provided by Hong Ji (Joint ProteomicS Laboratory, Ludwig Institute 
of Cancer Research/Walter and Eliza Hall Institute for Medical Research, Parkviile, Victoria, 
Australia). 


» Reagents 


P» Equipment 


b» Cells 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Extraction buffer 


9 M urea <!> 54g 
4% CHAPS <!> 0.4g 
0.5% IPG buffer (pH 3-10) (AP Biotech) 50.0 pl 
50 mM DTT <!> 0.077 g 
H,O to 10 ml 


IPG buffer (pH 3-10) (AP Biotech) 
Phosphate-buffered saline (PBS), ice-cold 


Cell scraper 

Centrifuge (refrigerated, low-speed) 
Filter paper 

Ice bath 

Ultracentrifuge (refrigerated) 
Vortex 


Cultured cells, either adherent or suspension 
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16. 


17. 


. Remove the celis from the culture dish. 


If working with adherent ceils: Scrape the cells from the culture dish with a cell scraper. 
Transfer the cells and media into a 15-ml centrifuge tube using a 5- ml pipette. 


If working with suspension celis: Transfer the cells and media into a centrifuge tube. 


. Pellet the cells by centrifugation at 480g for 5 minutes at 4C. 


. Discard the supernatant without disturbing the peliet. 


IMPORTANT: For the remainder of the protocol, keep all cells cold; keep cells on ice when 
not centrifuging or vortexing them. 


. Wash the cells by adding 10 ml of ice-cold PBS to the centrifuge tube. Resuspend the cell 


pellet by pipetting the mixture up and down. 


. Centrifuge the cells again at 480g for 5 minutes at 4°C. 
. Discard the supernatant without disturbing the pellet. 
. Repeat Steps 4—6 two more times. 


. After the last wash, allow the tube to drain thoroughly. Use a piece of filter paper to draw 


the remaining PBS away from the pellet. 


, Use a pipette to transfer the extraction buffer into the tube. 


The volume of extraction buffer used is dependent on the cell line being studied. The pro- 
tein concentration zn the lysate should be 10 mg/ml and should not exceed 20 mg/ml. 


. Vortex the tube briefly (10-30 seconds) to resuspend the pellet. 


. Add 50 ul of IPG buffer/10 ml of extraction buffer (to a final IPG buffer concentration of 


0.596), 


. ‘Transfer the lysate to an appropriate ultracentrifuge tube. 
_ Centrifuge the lysate at 100,000g for 20 minutes. 


. Collect the supernatant, being careful to avoid the clear, viscous blob at the bottom of the 


tube (which is DNA and RNA). 


. Measure the protein concentration using the BCA assay (see Appendix 2). 


IMPORTANT: If using the BCA assay, omit DTT from the extraction buffer because DTT 
interferes with the assay. After samples are assayed, add DTT to the sample to a final con- 
centration of 50 mM. 


Divide the supernatant into 100-11 or 200-1! aliquots. 


The size of the aliquot will depend on the amount of sample to be loaded on the LEF. Store 
the aliquots at -80°C if they are not used immediately for IEF. 


Proceed to Protocol 5 to perform first-dimension IEF. 
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ADDITIONAL PROTOCOL: RADIOSOTOPIC LABELING OF EUKARYOTIC CELL PROTEINS 


Radioisotope labeling of proteins is an important technique in expression proteomics studies for 
detecting low-abundance proteins. 


Materials 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <! >. 
Culture medium for *S labeling <!> 
methionine-free RPMI 1690 medium (ICN Biomedicals) 90 ml 
dialyzed fetal bovine serum (FBS) (JRH Bioscience) 10 ml 
355S-Jabeled methionine and cystine (ICN Biomedicals} <!> 
Culture medium for **P labeling <!> 
phosphate-free medium (ICN Biomedicals) 90 ml 
dialyzed fetal bovine serum (FBS) 10 mi 
P-labeled orthophosphate (Perkin Elmer Life Science) <!> 


Methods 
Method A: 5 Labeling of Cells 
1. Passage the cells into a 35-mm diameter cell culture dish containing RPMI medium comple- 
mented with 10% FBS. 
2. Incubate the cell culture for 16-24 hours at 37°C. 


3. Discard the RPMI-based medium, and replace with 3 ml of methionine-free medium. Rinse the 
cells with this medium for a few seconds and discard. Repeat the wash with 3 ml of methion- 
ine-free medium. 


4. Add 3 ml of 5S-labeling medium to the culture dish and incubate the cells for 30 minutes at 37°C. 
5, Add 250-500 i Ci/ml of *S-labeled methionine and cystine. 
6, Incubate the cells for 4-16 hours at 37*C. 

The labeling time depends on the metabolic turnover rate of the proteins. 


7. Proceed with the preparation of cell lysates as described in the main protocol. 


Method B: *P Labeling of Cells 


1. Passage the cells into a 35-mm diameter cell culture dish containing RPMI medium comple- 
mented with 10% FBS. 


2. Incubate the cell culture for 16-24 hours at 37°C. 


3. Discard the RPMI-based medium, and replace with 3 ml of phosphate-free medium. Rinse the 
cells with this medium for a few seconds and discard. Repeat the wash with 3 ml of phosphate- 
free medium. 


4. Add 3 ml of ??-labeling medium to the culture dish and incubate for 30 minutes at 37°C, 
5. Add 250—500 pCi/ml of * P-labeled orthophosphate. 


6. Incubate the cells for 2-4 hours at 37°C. 
The labeling time depends on the phosphorylation rate of the proteins. 


7. Proceed with the preparation of cell lysates as described in the main protocol. 


PROTOCOL 3 


Preparation of E. coli Lysates for 2D Gel Electrophoresis 


Bacresiat LYSATES CONTAIN HIGH QUANTITIES OF NUCLEIC ACIDS and benefit from nuclease 
treatment prior to analysis by 2D electrophoresis. The lysate is prepared by sonication in an 
extraction solution containing urea and 3-[3-cholamidopropyl] -dimethylammonio)-1- 
propanesulfonate (CHAPS). The nuclease used (Benzonase) is active in the presence of urea, 
but it requires Mg?* for activity. EDTA is added following nuclease treatment to inhibit met- 
alloproteases. 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
b Reagents 
Benzonase (Serratia marcescens recombinant endonuclease, 250 units/pl; Sigma) 
Store at -20°C for up to 1 year. 
Dithiothreitol (DTT) (1.2 M) <!> 
Extraction solution 
8 M urea <!> 48g 
4% (w/v) CHAPS «t» 400 mg 
20 mM Tris-base 100 ul of 2 M Tris 
1 mm MgCl, 100 ul of 100 mM MgCl,6H,O <!> 
H,O to 10 mi 
Prepare the solution fresh or store at -80°C. Cool on ice before use. 
Na, EDTA (200 mw) 
Phenylmethylsulfonyl fluoride (PMSF) (100 mM) <!>in 10 ml of isopropanol <!> 
Store at 49C or lower for up to 1 month. 
» Equipment 
Centrifuge (refrigerated) 
Conical centrifuge tube (15- or 50-ml plastic tube) 
Sonicator with microtip (e.g., Branson Sonifier with 3.2-mm [1/8 in] microtip, ot equivalent) 
P Cells 


E. coli cells in the growth medium and growth phase of choice 


METHOD 
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. Pellet E. coli by centrifugation of the culture solution in a tared bottle or tube at 12,000g 


for 5 minutes at 4°C. 


. Discard the culture supernatant and allow the bottle or tube to drain thoroughly. 


Determine the weight of the E. coli pellet by weighing the bottle or tube and subtracting 
the tare weight. 


. Suspend the pellet in cold extraction solution. Use 10 ml of extraction solution per 1 g of 


cells. Transfer the cell suspension to a plastic conical centrifuge tube. Add 2 ul (500 units) 
of Benzonase and 100 Hl of 100 mm PMSF per 10 ml of extraction solution. Mix by inver- 
sion following each addition. Place the tube in a beaker of ice. 


- Sonicate the cells by immersing the rnicrotip in the cell suspension. Set the sonicator to 


the maximum power output possible for the tip. Sonicate in 15-second bursts but allow 
the suspension to cool down in the beaker of ice between each burst. Repeat until the cell 
suspension clarifies. 


. Add 0.5 ml of 1.2 M DTT per 10 ml of extraction solution and mix by inversion. Place the 


tube on ice for 15 minutes. 


. Add 100 pl of 200 mM Na EDTA per 10 ml of extraction solution and mix by inversion. 


. Transfer the lysate to a centrifuge tube and centrifuge at 20,000g for 15 minutes at 49C. 


. Measure the protein concentration of the supernatant (see Appendix 2). 


. Prepare 100-11 aliquots of the supernatant, and if not used immediately for IEF, store the 


protein at -809C. 
Aliquots should be used only once to avoid the adverse effects of multiple freeze-thaw cycles. 
. Proceed to Protocol 5 to perform first-dimension IEF. 


For first-dimension IEF, the protein concentration should be 5—10 mg/ml. The sample may 
be diluted to the appropriate concentration into rehydration solution. 


PROTOCOL 4 


Preparation of Cerebrospinal Fluid Proteins for 2D Gel 
Electrophoresis 


Human CEREBROSPINAL FLUID IS PRECIPITATED with ethanol to both concentrate the protein 
and remove salts that would otherwise interfere with first-dimension IEF 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
» Reagents 
Ethanol (absolute, undenatured) <!> 
2-Mercaptoethanol <!> 
NaOH (1 N) <!> 
Solubilization solution 
9 M urea «1» 135 g 
4% (wiv) CHAPS «t» lg 
0.8% (v/v) Biolyte 3-10 (Bio-Rad) 0.2 ml 
0.00296 bromophenol blue <!> -0.5 mg 
H,O to 25 ml 
Prepare the solution fresh or store at -80°C. 
b Equipment 


Centrifuge tubes (15-ml) 
Centrifuge (refrigerated) 
Sonicator with cup horn (e.g., Branson Sonifier or equivalent) 


b Ceils and Tissues 


Human cerebrospinal fluid <!> l 
Human cerebrospinal fluid is obtained by lumbar puncture, centrifuged to remove cells, and frozen 


at -80°C in 1-ml aliquots. 


METHOD 
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. Thaw cerebrospinal fluid samples at room temperature. When the samples have thawed, 


tighten the tube caps and vortex the tubes. 


. While the samples are thawing, label 15-ml centrifuge tubes, both tube and cap, with the 


name of the sample to be processed. 


. Combine 1 m! of cerebrospinal fluid with 9 ml of ethanol in each 15-ml centrifuge tube. 


CAUTION: Cerebrospinal fluid is potentially biohazardous. Deposit used pipette tips, tubes, 
and gloves in an appropriate biohazard receptacle. Carefully clean contaminated surfaces 
with | N NaOH. 


. Place the caps on the centrifuge tubes tightly and vortex. Store the tubes overnight at 


—20°C. 


. Centrifuge the tubes at 5000g for 5 minutes at 4°C. 


. Pour the supernatant into a designated waste bottle. Use a 200-11 pipette to remove as 


much residual ethanol as possible without disturbing the pellet. Angle the pipette tip 
away from the pellet. 


. Allow the pellet to air dry in a chemical fume hood for 10-20 minutes. Do not allow the 


pellet to dry completely. 


. Pipette 100 ul of solubilization solution into each tube. Add 2 pl of 2-mercaptoethanol. 


Place the caps on the tubes tightly and vortex. Let the tubes stand for 30 minutes at room 
temperature. 


. Sonicate the tubes for 30 seconds in 5-second bursts using a cooled cup horn sonicator at 


the maximum setting possible with the cup horn. 
Centrifuge the tubes at 5000g for 5 minutes at 4°C. 


Transfer each supernatant to a fresh tube, and measure the protein concentration of each 
supernatant (see Áppendix 2). 


The protein concentration will vary widely depending on the cerebrospinal fluid sample. 
However, 25 ul of this solution is sufficient to detect the proteins on an ammoniacal silver- 
stained 2D gel (see Protocol 11). 


. If the samples are not used immediately for IEF, store them in 100-11 aliquots at —809C. 


Thaw frozen samples only once to avoid the adverse effects of multiple freeze-thaw cycles. 


. Proceed to Protocol 5 to perform first-dimension IEF. 


PROTOCOL 5 
GGG ho,» Sd 


The First Dimension: Isoelectric Focusing of Proteins 


Two METHODS ARE DESCRIBED HERE FOR SEPARATING PROTEINS based on their net charge using 
the technique of IEF on immobilized pH gradient (IPG) gels. These methods serve as the first 
dimension of the 2D separation. In addition to the traditional IPG IEF on a flatbed unit 
(Method A in this protocol}, IPG gels can be focused on self-contained instruments for IEF 
(Method B in this protocol). These high-voltage systems allow fewer manipulations of the 
IPG gels, resulting in less error, strip mix-up, contamination, air contact, or urea crystalliza- 
tion. Because rehydration and IEF can be performed consecutively within a single unit, these 
two steps can be performed unattended overnight. Finally, faster separations and sharper 
focusing are possible due to the higher voltage available in these instruments. IPG gels can be 
cast in the laboratory (for a recent review, see Gianazza 1999), but it is time-consuming and 
fraught with problems. Therefore, these protocols assume that the investigator is using dehy- 
drated, precast IPG gels strips available from a variety of suppliers (e.g., Amersham 
Biosciences, Bio-Rad, and Genomic Solutions). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <:>. 
EXPERIMENTAL TIP: The quality of the reagents used in IEF is critical. Always use electrophoresis grade 
or better. 


P Reagents 
Carrier ampholytes or IPG buffers 
Dithiothreitol (DTT) (1 M) <!> 


IPG gel strips 
Casting IPG gels in the laboratory is possible, but it is time-consuming and fraught with problems. 


The more reliable (and more expensive) option is to purchase precast, dehydrated IPG strips (e.g., 
Bio-Rad or Amersham Biosciences). Rehydrated strips are 3-mm wide and ~0.5 mm thick. 


Mineral oil (low viscosity) or paraffin oil l l 
Alternatively, a commercial product such as Immobiline DryStrip Cover Fluid can be used. 


Rehydration solution 
8 M urea <!> 48 g 
1.5% (wiv) CHAPS <!> 1.5g 
H,O to 100 ml 
Divide the rehydration solution into 2.5—5-ml aliquots, and freeze them at —20°C, 


P Equipment 
Forceps 
Pipette (1 ml) 
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Sample applicators (optional) 
Use these applicators if protein samples are loaded onto the IPG strips after the strips have been 


rehydrated (see Section I, Method A, Step 4). The applicators should be purchased from the man- 
ufacturer of the [PG strips. 


Screw-cap tubes 
The tubes should be of sufficient length to hold the IPG gel strips. 


Metwoo A: IEF or PROTEINS IN A MULTIPURPOSE FLATBED ELECTROPHORESIS UNIT 


Circulating water bath 
Electrophoresis power supply 


Choose a power supply capable of running at very low currents (e.g., <0.1 mA or preferably one in 
which the low current check can be circumvented). 


IEF electrode strips 
IEF electrophoresis unit with temperature control 
IPG strip rehydration tray 


Memon B: IEF or PROTEINS IN AN IEF-DEDICATED ELECTROPHORESIS UNIT 


Dedicated IEF system (Bio-Rad, Genomic Solutions, Amersham Biosciences) 
IPG strip holder of appropriate length with cover 


Typical lengths are 7, 11, 13, 18, or 24 cm, Wash the strip holders with a suitable cleaner, rinse them 
thoroughly with H,O, and either dry the holders with a cotton swab or lint-free tissue or allow them 
to dry in the air. 


b Biological Sample 
Protein sample, prepared in Protocol 1, 2, 3, or 4 


METHODS 


SECTION |. REHYDRATION OF IPG STRIPS 


1. Add DTT to a final concentration of 0.296 (w/v) to freshly prepared or thawed rehydra- 
tion solution. Add carrier ampholytes or IPG buffers to a final concentration. of 
0.5-2.096, Select the appropriate pH range to match the pH range of the IPG strips to be 
used in the analysis. 


2. If protein samples are to be loaded into gels during rehydration, dilute (or dissolve in) up 
to 1 mg of sample per strip with the rehydration solution. 


The amount of protein to load will depend on the goals of the experiment. Load smaller 
amounts of sample when examining abundant proteins; load more sample to detect scarce 
proteins, 


3. Use an indelible marker to label the back (the nongei side) of the IPG gel strip with the 
name of the protein sample to be loaded. 


4. Pipette the appropriate amount of rehydration solution (see Table 4.2) into the slot of the 
reswelling tray. Deliver the solution slowly to the center of the slot, without producing 
bubbles. Remove any large bubbles that form. Make sure to record which sample is 
applied to which IPG gel. Numbers in the rehydration tray can aid in strip identification. 
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TABLE 4.2. Recommended rehydration solution volumes 


IPG strip length Volume per strip" (ul) 
7 125 
1! 200 
13 250 
18 350 
24 480 


"Includes the volume of the protein sample in cases where the sani- 
ple is leaded during the IPG gel strip rehydration step. 


5. Remove the protective covering from each of the IPG gels and place the strip, gel side 
down, into the rehydration tray. Make sure that the solution is distributed across the 
entire gel surface. If necessary, lift the basic end of the strip with a pair of forceps and slide 
the strip back and forth to ensure complete and even wetting. Make sure that no bubbles 
are trapped under the IPG gel. If using IEF Method B, make sure that the gel is in contact 
with the electrodes embedded in the bottom of the strip holder. 


6, Pipette ~1.5 ml of mineral oil over each IPG strip to prevent precipitation of the urea. 
Apply the mineral oil dropwise into one end of the strip holder until it is approximately 
half full, and then finish filling the strip holder from the other end. Do not overfill the 
strip holder with oil. 


7. Allow the strips to rehydrate for a minimum of 6 hours at room temperature (overnight 
is preferable). If using IEF Method B, place the plastic cover on top of the strip holder. 


it is also possible with the IEF-dedicated electrophoresis unit to rehydrate the gel strips on 
the unit platform and to program the rehydration as the first step of the electrophoresis pro- 
tocol. This permits the focusing steps to automatically follow the rehydration step, without 
additional need for human intervention. 


8. Proceed with either Method A or Method B for isoelectric focusing of the proteins. 


SECTION II. ISOELECTRIC FOCUSING OF PROTEINS 


Method A: IEF of Proteins in a Multipurpose Flatbed Electrophoresis Unit 


1, Set up the 2D electrophoresis unit and temperature controller according to the manufac- 
turer’s instructions. 


2. Cut two electrode wicks to the appropriate lengths, Soak each strip with 1 ml of H;O. Blot 
each strip with filter paper so that they are damp, but not wet. 


Excess water in the electrode strips may cause proteins to streak on the gel strips. 


3. Place the IPG and electrode strips into the electrophoresis unit according to the manu- 
facturer's instructions. If necessary, position the two electrodes over the electrode 
strips. 


4. (Optional) Apply the protein sample to the gel strips if protein was not loaded into the 
IPG gel strips during rehydration (Section I, Step 2). Follow the manufacturer's instruc- 
tions for use of the protein sample applicator. 
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TABLE 4.3, Recommended electrophoresis conditions for IPG gels on flatbed IEF systems 
Step IPG length (cm) Voltage Current (mA) Power (W) Made Duration (hr:min} Volt-hours 


l 7,11,13,18, 24 200 2 5 step 0:01 ] 

2 7,11,13,18, 24 3500 2 5 gradient 1:30 2,800 

3 7 3500 2 5 step 0:55-1:30 3,200-5,200 

3 11 3500 2 5 step 2:20-3:30 8,100--12,100 
3 13 3500 2 5 step 3:45-4:20 13,100-18,100 
3 18 3500 2 5 step 3:40-7:40 23,000—30,000 
3 24 3500 2 5 step 14:30-25:45 50,000-90,000 


"During Step 2, the voltage should rise from 200 V to 3500 V over the 1:30 time period. 


The optirnal application point must be empirically determined for each sample type. Sample 
application at the cathode works well with proteins of acid pI values or when SDS is used in 
the solubilization of the sample. Application at the anode is required for very basic IPG gels. 


Protein samples applied via sample applicators, rather than during rehydration, can precip- 
itate at the application point, reducing the amount of sample loaded onto the gel. To reduce 
this problem, load dilute protein solutions, which also contain urea, nonionic detergents, 
and carrier ampholytes. In addition, by applying low current to the strips at the beginning 
of the electrophoresis run, the sample can enter the strip slowly, which helps to minimize 
accumulation of sample at the application point. 


5. Attach the electrodes to the power supply and program the power supply with the desired 
run parameters. 


A typical IEF protocol proceeds through a series of voltages that gradually increase to a 
desired focusing voltage that is held for several hours. The length of time at the focusing 
voltage is affected by many factors including EPG gel length, pH gradient, sample composi- 
tion, sample load, and rehydration solution composition, and must be empirically deter- 


mined for each sample. Table 4.3 provides recommended run conditions for analytical pro- 
tein loads. 


Overfocusing is seldom a problem below 100,000 total Volt-hours, but on very long runs, it 
can contribute to horizontal streaking observed in 2D results, especially in the basic end of 
the gel. 


6. Turn on the power supply to begin the electrophoresis. Make sure that the low current 
check is circumvented if possible. 


7. At the completion of the run, turn off the power supply and remove the lid of the elec- 
trophoresis unit. Follow the manufacturer's instruction for properly accessing the gel strips. 


8. Use a forceps to pick up an IPG gel strip. Allow the oil to drain from the strip and place 
it in a test tube. Repeat with each gel strip. 


9. Proceed to the second-dimension electrophoresis protocols (Protocols 6—9) or store the 
gel strips in their test tubes at —409C to ~80°C. Gels can be stored up to 1 month. 


Method B: IEF of Proteins in an IEF-dedicated Electrophoresis Unit 


1. Program the electrophoresis unit for IEF. 


A typical IEF protocol proceeds through a series of voltages that gradually increase to a 
desired focusing voltage that is held for several hours. The length of time at focusing voltage 
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TABLE 4.4. Recommended electrophoresis conditions for IPG gels on dedicated LEF systems 


Step IPG length (cm) Voltage Duration (hr:min) — Volt-hours Gradient type 
1 7,11,13,18, 24 rehydration 12:00 - - 
2 7 500 0:30 250 step and hold 
2 11,13,18, 24 200 1:00 500 step and hold 
3 7 8000^ 1:00 8,000 step and hold 
3 11,13 8000^ 2:00 16,000 step and hold 
3 18 3000" 4:00 32,000 step and hold 
3 24 8000* 6:25—11:25 50,000- 90,000 step and hold 


"The total rehydration time can be varied for convenience, but it must be more than 6 hours. 
"Depending on the sample, this voltage may not be reached within the suggested step duration. For this reason, it is best to 
program the steps based on Volt-hours rather than time 


is affected by many factors, including IPG gel length, pH gradient, sample composition, 
sample load, and rehydration solution composition, and must be empirically determined for 
each sample. Table 4.4 provides recommended run conditions for analytica! protein loads. 


Overfocusing is seldom a problem below 100,000 total Volt-hours, but on very long runs, it 
can contribute to horizontal streaking observed in 2D results, especially in the basic end of 
the gel. Due to variations in sample types, some samples may not reach maximum voltage 
during a run. Therefore, it is best to program the run conditions on the basis of total num- 
ber of Volt-hours, rather than time to ensure that proper focusing occurs. 


2. Close the lid to the electrophoresis unit and begin the IEF protocol. 


3. At the completion of the run, turn off the power supply and open the lid of the elec- 
trophoresis unit. 


4. Use forceps to pick up an IPG gel strip. Allow the oil to drain from the strip and place it 
in a test tube. Repeat with each gel strip. 


5. Proceed to the second-dimension electrophoresis protocols (Protocols 6, 7, and 9) or 
store the gel strips in their test tubes at 40°C to —80°C. Gels can be stored up to 1 month. 


PROTOCOL 6 


Preparation of Vertical SDS Slab Gels: Casting a Single 
Homogeneous Gel 


MATERIALS 


Fouowne THE SEPARATION OF PROTEINS BY IEF, the second dimension is carried out by SDS- 
PAGE. This protocol details the method for casting single homogeneous SDS-PAGE gels. 
Homogeneous gels (with the same 96T and %C throughout) offer the best resolution for a 
particular molecular-weight range and are commonly used because they are the easiest to 
pour reproducibly. 

The second-dimension gels can be conveniently prepared in three different formats (i.e., 
sizes); minigels, for use with 7-cm IEF first-dimension gels; standard gels, for use with 11-, 
13-, and 18-cm IEF first-dimension gels; and large-format gels, for use with 18- and 24-cm 
IEF first-dimension gels. All of the gels use a common set of reagents, listed below, but differ 
slightly in the equipment required. 

If the gel is subsequently stained using the ammoniacal silver staining method (Protocol 
11), consider including thiosulfate in the gel to reduce background levels of stain 
(Hochstrasser and Merril 1988; see the note to Table 4.5). 


P Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 

EXPERIMENTAL TIP: The quality of the reagents is critical to good separations. Always use electrophore- 
sis grade or better. In addition, always wear gloves when preparing gels, because acrylamide is a neurotox- 
in, and keratin and other skin and hair proteins can contaminate the gel. 


Ammonium persulfate (10%) <!> 
Fresh ammonium persulfate “crackles” when H,O is added. If the sample does not crackle, replace 
it with fresh stock. Prepare 1 ml of ammonium persulfate just prior to use. 

n-butanol (H,O-saturated) «t» 
Mix H,O and butanol in a 1000-ml flask. Once mixed, allow the n-butanol and H,O to separate into 
two phases. This butanol will be used as a gel overlay (see Step 11). 

Ethanol <!> 

4x Gel buffer (1.5 M Tris-Cl, pH 8.8) 
Dissolve 181.5 g of Tris-base in 750 ml of H,O. Adjust the pH to 8.8 with HCl «1», and bring the 
final volume to 1000 ml with H,O. Filter the solution through a 0.45-Jim filter and store it at room 
temperature. 

Gel storage solution 
0.375 M Tris-Cl (pH 8.8) 


0.1% SDS <!> 
Combine 125 ml of 4x gel buffer, 5 ml of 10% SDS, and 370 ml of H,O. Store the solution at 4*C. 
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» Equipment 


TABLE 4.5. Recipes for single-percentage gels 


Final gel concentration 


5% 7.596 1096 12.596 1596 
Monomer stock solution 16.7 ml 25 ml 33.3 ml 41.7 ml 50 ml 
4x Resolving gel buffer 25 ml 25 ml 25 ml 25 ml 25 ml 
10% SDS 1 ml | ml iml 1 ml 1ml 
H,O 56.8 ml 48.5 ml 40.2 ral 31.8 ml 23,5 ml 
10% Ammonium persulfate 500 pi 500 ul 500 pl 500 ul 500 pl 
TEMED 33 pl 33 jl 33 ul 33 ui 33 ul 
Total valume 100 ml 100 ml 100 ml 100 mi 100 ml 

M the gel will be subsequently stained using the ammoniacal silver staining method (Protocol 1 1), include sodium thiosul- 


fate in the gel to reduce background levels of stain (Hochstrasser and Merril 1988). Use 180 ul of 10% ammonium persulfate 
solution, 90 i of 5% sodium thiosulfate, and 180 ul of TEMED per 100 ml of gel solution. Sodium thiosulfate slightly inhibits 
polymerization, thus the requirement for higher amounts of TEMED. Ammonium persulfate must be added to the gel solution 
first, followed by sodium thiosulfate, and TEMED last. 


Monomer stock solution (30.896T, 2.6%C) 


acrylamide <!> 150.0 g (30% final concentration) 
N,N'-methylenebisacrylamide <!> 4.0 g (0.8% final concentration) 
H,O to 500 ml 


Filter the monomer solution through a 0.45-pm filter. Store it at 4°C away from light. 
SDS (10%) 
5x SDS electrophoresis buffer 


125 mM Tris-base 15.1g 
960 mM glycine <!> 72. g 
0.5% SDS 50g 
H,O to 1000 mi 


Store the 5X buffer at room temperature. Dilute to a 1x solution just before use. 
TEMED <!> 


Gel-casting apparatus (or clamps and sealing tape) 


Glass plates 
To clean the glass plates, soak them in a warm solution of laboratory detergent. Scrub the plates 
with a soft scouring pad, and rinse them in H,O. Alternatively, plates can be cleaned using a labo- 
ratory dishwasher. The plates should be dried and stored in a dust-free environment. Always wear 
gloves when handling plates to reduce contamination from keratin and other proteins. 


Magnetic stir bar 

Magnetic stir plate 

Pipettes (25 ml, disposable) 

Spacers (1.0 or 1.5 mm) 

Vacuum flask 

Water aspirator or other vacuum system 
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METHOD 


10. 


tl. 


. Just prior to assembling the glass plates for casting a gel, wipe each plate with an ethanol- 


soaked lint-free tissue. Make sure that no dust particles are on the surface of the plate. 


EXPERIMENTAL TIP: Wear gloves at all times when handling the plates to reduce the 
chance of contamination with keratin and other proteins. 


. Align the two glass plates, separating them with a 1.0- or 1.5-mm spacer placed on each 


side. 


. Clamp the plates together at the spacers using just enough pressure to hold the spacers 


and plates in place. Stand the entire assembly on its bottom edge on a flat surface, and 
ensure that the bottom edges of the two glass plates and the spacers are flush with one 
another. Tighten the clamps securely. 


It is critical that the bottom edge of the plates and spacers rest flush on a flat surface. This 
minimizes the likelihood that the apparatus will leak when casting the gel. 


. Place the clamp assembly into the casting cradle and turn the cams so that the gel form 


(i.e. the glass plates and spacers that are clamped together) is sealed against the gasket of 
the casting stand. 


. Use a typewriter or a pencil to prepare a gel label on a small strip of filter paper. Place the 


filter paper in the bottom left-hand corner of the gel form. Do not use a felt tip pen, as it 
will bleed into the gel. 


. Mark the outer glass plate ~0.5 cm from its upper edge using an indelible laboratory 


marker. 


. Fill the gel form with H,O to determine the volume of gel solution needed before casting 


the gel. Be sure to remove all H,O from the gel form before casting the gel, or the acry- 
lamide concentration in the gel will be affected. Prepare the appropriate volume of gel 
solution (calculate from Table 4.5) in a vacuum flask, omitting the TEMED and ammo- 
nium persulfate. 


. Add a small magnetic stir bar to the solution. Use a water aspirator to degas the solution 


for several minutes while stirring the solution on a magnetic stir plate. 


Although not all investigators degas their acrylamide, it is recommended for more repro- 
ducible polymerization. 


. Add the TEMED and ammonium persulfate and gently swirl the flask to mix. Do not gen- 


erate bubbles. 


Use a pipette to slowly deliver the solution into one corner of the gel form, filling the form 
to 0.5 cm from its upper edge. 


Immediately overlay each gel with a thin layer (100—500 ul) of H,O-saturated n-butanol 
to minimize exposure of the gel to oxygen and to create a flat gel surface, 


EXPERIMENTAL TIP: It is critical that the surface of the second-dimension gel be flat to 
allow good contact with the first-dimension gel during transfer. Many investigators overlay 
with double-distilled H,O; however, this runs the risk of diluting the concentration of the 
acrylamide on the top of the gel. 
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12. Allow the gel to polymerize for a minimum of | hour. 


13. After polymerization, inspect each gel for complete and even polymerization. 
Additionally check that the top surface of the gel is straight and flat. Pour off the n- 
butanol and wash the surface of the gel with distilled or deionized H,0. 


14. The gel can now be used for running the second dimension (Protocol 9) or stored for 
future use for up to 2 weeks at 4°C. 


Gels that are to be stored should be overlaid with gel storage solution and wrapped tightly 
in plastic wrap. 


PROTOCOL 7 


Preparation of Vertical SDS Slab Gels: Simultaneous 
Casting of Multiple Gradient Gels 


Graven SDS-PAGE GELS PROVIDE THE BEST RESOLUTION over a wide range of molecular 
weights, resulting in sharper protein spots, because diffusion is minimized by the decreasing 
pore size in the gel. However, gradient gels are more difficult to produce reproducibly; thus, 
they are commonly cast with multiple gel casters, which allows for an identical set of gels to 
be produced for an experiment. Presented here is a method for casting gradient gels using a 
multiple gel casting system. 

H the gel is subsequently stained using the ammoniacal silver staining method (Protocol 
11), consider including thiosulfate in the gel to reduce background levels of stain (Hoch- 
strasser and Merril 1988; see the note to Table 4.6). 


TABLE 4.6, Recipes for gradient gels 


Fina! gel concentration 
Light Solution 5% 7.5% 10% 12.5% 15% 
Monomer stack solution 8.4 ml 12.5 ml 16.7 ml 21.0 ml 25 ml 
4x Resolving gel buffer 12.5 ml 12.5 ml 12.5 ml 12.5 ml 12.5 ml 
1096 SDS 0.5 ml 0.5 mi 0.5 ml 0.5 ml 0.5 mi 
H,O 28.5 ml 24.4 ml 20.2 mi 16.0 ml 12.0 ml 
1096 Ammonium persulfate 165 ul 165 pl 165 wl 165 ul 165 yl 
TEMED 16.5 yl 16.5 pl 16.5 ul 16.5 ul 16.5 ul 
Total volume 50 ml 50 ml 50 ml 50 ml 50 ml 

Final gel concentration 
Heavy Solution 1096 12.596 1596 17.596 2096 
Monomer stock solution 16.7 ml 21.0 ml 25.0 ml 29.2 ml 33.3 ml 
4x Resolving gel buffer 12.5 mi 12.5 ml 12.5 ml 12.5 ml 12.5 ml 
Sucrose 75g 758 75g 75g 7.5g 
10% SDS 0.5 mi 0.5 ml 0.5 ml 0.5 ml 0.5 ml 
H,O 16.2 ml 11.7 ml 77 mi 3.5 ml 0 ml 
1096 Ammonium persulfate 165 yl 165 jul 165 ul 165 ul 165 ul 
TEMED 16.5 ul 16.5 ul 16.5 ul 16.5 ul 16.5 ul 
Total volume 50 ml 50 ml 50 ml 50 ml 50 ml 


if the gel will be subsequently stained using the ammoniacal silver staining method (Protocol 11), include sodium thiosul- 
fate in the gel to reduce background levels of stain (Hochstrasser and Merril 1988). Use 90 pl of 10% ammonium persulfate 
solution, 45 Jl of 596 sodium thiosulfate, and 90 pl of TEMED per 50 ml of gel solution. Sodium thiosulfate slightly inhibits 
polymerization, thus the requirement for higher amounts of TEMED. Ammonium persulfate must be added to the gel solution 
first, followed by sodium thiosulfate, and TEMED last. 
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MATERIALS 


p» Reagents 


) Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <>. 


EXPERIMENTAL TIP: The quality of the reagents is critical to good separations. Always use electrophore- 
sis grade or better. In addition, always wear gloves when preparing gels, because acrylamide is a neurotox- 
in, and keratin and other skin and hair proteins can contaminate the gel. 


Ammonium persulfate (1096) «1» 
Fresh ammonium persulfate “crackles” when H,O is added. If the sample does not crackle, replace 
it with fresh stock. Prepare | ml of ammonium persulfate just prior to use, 
n-butanol (H,O-saturated) <!> 
Mix H,O and #-butanol in a 1000-ml flask. Once mixed, allow the n-butanol and H,O to separate 
into two phases. This butanol will be used as a gel overlay (see Step 13). 
Ethanol <!> 
4x Gel buffer (1.5 M Tris-Cl, pH 8.8) 
Dissolve 181.5 g of Tris-base in 750 ml of H,O. Adjust the pH to 8.8 with HCI «1», and bring the 
final volume to 1000 ml with H,O. Filter the solution through a 0.45-ym filter and store it at room 
temperature. 
Gel storage solution 
0.375 M Tris-Cl (pH 8.8) 
0.196 SDS «1» 
Combine 125 ml of 4x gel buffer, 5 ml of 10% SDS, and 370 ml of H,O. Store the solution at 4C. 
Monomer stock solution (30.896T, 2.6%C) 


acrylamide <!> 150.0 g (30% final concentration) 
N,N’-methylenebisacrylamide <!> 4.0 g (0.8% final concey tration) 
H,O to 500 ml 


Filter the monomer solution through a 0.45-Jim filter. Store it at 4°C away from light. 
SDS (10%) 
5x SDS electrophoresis buffer 


125 mM Tris-base 151g 
960 mM glycine <!> 72.3 g 
0,596 SDS 50g 
H-O to 1000 ml 


Store the 5x buffer at room temperature. Dilute to a 1x solution just before usc. 

Sucrose l l 0 
Sucrose is added to the higher 96T solution to aid in the formation of the gradient. This solution is 
denser and is thus frequently called the heavy solution, whereas the lower %T acrylamide solution 
is referred to as the light solution. 


TEMED <!> 


Glass plates l 
To clean the glass plates, soak them in a solution of hot laboratory detergent. Scrub the plates with 
a soft scouring pad, and rinse them in H,O. Alternatively, plates can be cleaned using a laboratory 
dishwasher. The plates should be dried and stored in a dust-free environment. Always wear gloves 
when handling plates to reduce contamination from keratin and other proteins. 


Gradient maker with a volume of at least 600 mi 
Magnetic stir bars 


METHOD 
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Magnetic stir plate 

Multiple-gel casting chamber 

Peristaltic pump capable of 60-75 ml/minute 
Pipettes (25-ml, disposable) 

Pump tubing and clamp 

Spacers (1.0 or 1.5 mm) 

Vacuum flask 

Water aspirator or other vacuum system 


1. Just prior to assembling the glass plates for casting a gel, wipe each plate with an ethanol- 
soaked lint-free tissue. Make sure that no dust particles are on the surface of the plate. 


EXPERIMENTAL TIP: Wear gloves at all times when handling the plates to reduce the 
chance of contamination with keratin and other proteins. 


2. Align the two glass plates, separating them with a 1.0- or 1.5-mm spacer placed on each 
side. 


3. Mark one of the glass plates ~0.5 cm from its upper edge using an indelible laboratory 
marker. 


in some systerns, such as the Bio-Rad Protean II, the glass plates are not of equal length. 
Make sure that this mark is placed 0.5 cm from the upper edge of the shorter plate. 


4, Use a typewriter or a pencil to prepare a gel label on a smal] strip of filter paper. Place the 
filter paper in the bottom left-hand corner of the gel form, Do not use a felt tip pen, as it 
will bleed into the gel. 


5. Place the complete gel cassette into the multiple-gel casting chamber. Repeat the assem- 
bly of the remaining cassettes, and place each one in the casting chamber separated by a 
thin plastic sheet. These sheets are usually supplied with the multiple-gel casting cham- 
ber, but they can also be cut from a sheet of Mylar. Fill any excess space with acrylic spac- 
ers provided with the casting chamber. Clamp the front plate onto the casting chamber 
to ensure that a good seal is made with the gasket. 


6. Fill the casting chamber with H,O to determine the volume of reagents needed to fill the 
chamber before casting the gels. Half of the measured volume will be prepared at the low 
%T (light solution) and the other half will be prepared at the higher %T (heavy solution). 
Use Table 4.6 to calculate the volumes of the gel components required for casting any 
number of gels. 


7. Prepare the appropriate volume of light and heavy acrylamide solutions, omitting the 
ammonium persulfate and TEMED in the vacuum flasks, and add a small magnetic stir 
bar to each flask. 


8. Use a water aspirator to degas the solutions (for several minutes) while stirring them on 
a magnetic stir plate. 


Although not all investigators degas their acrylamide, it is recommended for more repro- 
ducibie polymerization. 
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10. 


11. 


13. 


14. 


15. 
16. 


17. 


18. 


. While the solutions are degassing, connect a piece of tubing to the peristaltic pump and, 


using H,O, ensure that the flow rate of the pump is set to the manufacturer's recommen- 
dations. Once the flow rate is set, connect one end of the tubing to the outlet port on the 
gradient maker and the other end to the inlet port of the multiple-gel caster. Ensure that 
there is a clamp in place near the inlet port to the multiple-gel caster and that it is open. 
Make sure that the gradient maker valves are in the closed position. 


Place a small magnetic stir bar in the mixing chamber and place the gradient maker on a 
magnetic stir plate, 


When the gel solutions have been completely degassed, add the ammonium persulfate 
and TEMED solutions to each solution with a gentle swirl. Pour the light solution into 
the mixing chamber (the front chamber near the outlet port) and the heavy solution into 
the reservoir chamber, and turn on the magnetic stirrer. 


The stir bar should form a vortex in the mixing chamber, but 1 should not spin at such high 
speeds that bubbles are introduced into the acrylamide solution. 


. Open the outlet valve from the gradient maker and start the peristaltic pump. When the 


light solution has reached the bottom of the gel, open the valve between the reservoir and 
the mixing chamber. Check that the stir bar is mixing the solutions adequately. Do not 
allow air bubbles to enter the tubing. 


Allow the gels to fill the chambers until there is a gap of ~0.5 cm (the mark previously 
made on the glass plates) from the top of the gel. This should require nearly all of the 
solution in the gradient maker. When the casting chamber is full, switch off the pump and 
close the clamp at the inlet port to the casting chamber. Overlay each gel with an equal 
volume of H,O-saturated n-butanoi. 


Detach the tubing from the gradient maker and the peristaltic pump. Drain excess acry- 
lamide from the tubing and rinse the gradient maker immediately. 


Allow the gradient gels to polymerize for a minimum of 2 hours. 


After polymerization, disassemble the casting chamber. Use a strong plastic spatula to 
separate the gel cassettes from each other at the plastic sheets. Rinse each cassette in dis- 
tilled or deionized H,O to remove any acrylamide adhering to the outer surface. Quickly 
rinse the top surface of each gel to remove the n-butanol. 


Carefully inspect the gels for air spaces, uneven polymerization, uneven top surfaces, or 
other defects. Discard any unsatisfactory gels. 


The gels can now be used for running the second dimension (Protocol 9) or stored for 
future use for up to 2 weeks at 4°C. 


Gels that are to be stored should be overlaid with gel storage solution and wrapped tightly 
in plastic wrap. 


PROTOCOL 8 


IPG Strip Equilibration 


MATERIALS 


Tue EQUILIBRATION STEP SERVES TO SATURATE THE IPG STRIP with the SDS buffer system 
required for the second-dimension separation. The equilibration solution consists of buffer, 
urea, glycerol, reductant, SDS, and dye. The buffer (50 mm Tris-HCl, pH 8.8) maintains the 
appropriate pH range for electrophoresis. Urea and glycerol are added to reduce the effects of 
electroendosmosis (Górg et al. 1985), thus helping improve protein transfer from the IPG 
strip to the second dimension. The reductant (dithiothreitol) ensures that disulfide bridges 
are broken. SDS ensures that the proteins are denatured and also provides a net negative 
charge to all proteins. lodoacetamide, introduced during a second equilibration step, alky- 
lates thiol groups on the proteins, preventing their reoxidation during electrophoresis, and 
thus reducing streaking and other artifacts in the second-dimension separation. Iodo- 
acetamide also alkylates residual dithiothreitol, preventing point streaking and other silver 
staining artifacts (Gérg et al. 1988). Finally, a tracing dye (bromophenol blue) is added to 
allow the electrophoresis to be monitored during the run. 


P Reagents 


b Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


EXPERIMENTAL TIP: The quality of the reagents is critical to good separations. Always use electrophore- 
sis grade or better. 


SDS equilibration buffer 
50 mM Tris-Cl (pH 8.8) 6.7 ml of 1.5 m Tris-Ci 
6 M urea <!> 72.07 g 
3096 (v/v) glycerol 69 ml of 8796 (v/v) 
2% SDS <!> 40g 
bromophenol blue <!> a few grains 


Adjust the buffer to a final volume of 200 mi with H,O. Store in 40-ml aliquots at -20°C. 
Immediately prior to use (see Step 1), add dithiothreitol (DTT) <!> to a final concentration of 1% 
(wv). If performing the second equilibration (see Step 4), add iodoacetamide <!> to a fresh aliquot 
of SDS equilibration buffer, to a final concentration of 2.5% (w/v). 


Blotting paper (3MM) 
Culture tubes (screw-cap) of sufficient length to hold the IPG gel strips 
Shaker table or rocker 


+ Biological Sample 


Focused IPG strips (from Protocol 5) 
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1, Add 100 mg of DTT per 10 ml of SDS equilibration buffer (final DTT concentration is 
1% w/v). 


2. Place each [PG strip in a separate screw-cap culture tube with the support film against the 
wall of the tube. Add equilibration buffer (from Step 1} to each tube; 10 ml is recom- 
mended for 18- and 24-cm IPG gels, 5 ml for 11 and 13-cm gels, and 2.5—5 ml for 7-cm 
IPG gels, Place the caps on the tubes tightly, and lay them down on a shaker or rocker. 


3. Gently shake or rock the IPG strips for at least 10 minutes. The time to equilibration 
varies depending on the amount of protein loaded on the strip. Long equilibration times 
(1n excess of 20 minutes) may result in a significant loss of sample due to diffusion from 
the gel. At the completion of the incubation, proceed to either Step 4, Step 5, or directly 
to Protocol 9. 


Unfortunately, the only way to know whether the strips are equilibrated is to run the second 
dimension. Incomplete equilibration will result in vertical streaks in the SDS-PAGE gel. 


4. (Optional) Perform a second equilibration step, this time with iodoacetamide, to reduce 
point streaking and other artifacts in the second-dimension separation. This equilibra- 
tion step is optional for vertical second-dimension separations (but is mandatory for 
horizontally run SDS gels). 


a. Decant the first equilibration solution into a waste container, 
b. Add ~2 ml of H,O to each culture tube and quickly rinse the gel. 


c. Discard the H,O, and add the appropriate volume (see Step 2) of SDS equilibration 
solution containing 2.5% (w/v) iodoacetamide (e.g., 250 mg per 10 ml of equilibra- 
uon solution) to each culture tube. 


d. Place the caps on the tubes tightly, and lav them down on a shaker or rocker. Gently 
shake or rock the IPG strips for 5 minutes. 


e. Proceed to either Step 5 or directly to Protocol 9, 


5. (Optional) Place the IPG strips on edge on blotting paper moistened with H,O and allow 
excess equilibration solution to drain from the surface of the IPG strip for 30-60 seconds. 


Draining excess equilibration solution helps reduce horizontal streaking of highly abundant 
proteins in the sample. If gels are drained, this should be done immediately before placing 
them on top of the second -dimension gel. Proceed to Protocol 9. 


PROTOCOL 9 


The Second Dimension: SDS-PAGE of Proteins 


Fouowne FIRST-DIMENSION IEF AND EQUILIBRATION OF THE IPG gel strips, the proteins are sep- 
arated on the basis of their molecular weight in the second dimension on an SDS-PAGE gel. 
Systems for this separation are available from a variety of suppliers and are commonly found 
in many protein chemistry laboratories. This protocol describes a method for placement of 
the IPG strip and gives some recommended electrophoresis conditions for these second- 
dimension gels. 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
b Reagents 
Agarose sealing solution 
5x SDS electrophoresis buffer 20 ml 
agarose (see note to this recipe) 0.5g 
bromophenol blue <!> a few grains 
H,O 80 ml 
Use a low electroendosmosis form of agarose. A low-melting-temperature form may also be advis- 
able. Add ali ingredients to a 500-ml Erlenmeyer flask and swirl to suspend. Heat in a microwave 
oven on low setting until the agarose is completely dissolved. It is best to do this in 20-second inter- 
vals to ensure that the solution does not boil over. Once the solution is warm, keep a close watch, 
for it will easily boil over. Dispense in 1.5-ml aliquots in screw-cap tubes and store at room tem- 
perature or 4°C. 
Protein molecular-weight markers 
Optional, see Step 3. 
5x SDS electrophoresis buffer 
125 mM Tris-base i5.1g 
960 mM glycine <!> 72.1 g 
0.5% SDS <I> 5.0g 
H,O to 1000 ml 
Store the 5x buffer at room temperature, Dilute to a 1x solution just before use. 
P» Equipment 


Filter paper 
Forceps and flexible ruler 


The forceps and the ruler must be thin enough to allow the IPG strip to be maneuvered between 
the glass plates of the slab gel (see Step 2). 
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Heat blocks preset to 45°C and 100°C 
Power supply 

SDS-PAGE gel (from Protocol 6 or 7) 
Vertical electrophoresis unit 


p Biological Sample 


METHOD 


Focused IPG strips (from Protocol 8) 


1. Melt agarose sealing solution in a 100°C heat block. Each gel will require ~1 ml of solu- 
tion. It will take ~10 minutes to completely melt the agarose, so this is best done shortly 
before equilibration of the IPG strips (Protocol 8). Once melted, allow the agarose to cool 
to ~40-50°C before using. 


2. Use forceps to position the equilibrated IPG strip between the plates on the surface of the 
second-dimension gel. Ensure that the plastic backing is against one of the glass plates. 
Use a thin, flexible plastic ruler to slide the IPG strip between the plates until the entire 
bottorn edge of the IPG gel makes a seal with the surface of the slab gel. Work from one 
end of the IPG strip to the other, progressively sliding the gel down the glass plate. To 
reduce the chance of tearing the IPG gel, put pressure on the plastic backing of the gel to 
slide the IPG strip into place. 


Make sure that no air bubbles are trapped between the slab gel and the bottom of the IPG gel, 
or between the plastic backing and the glass plate. If air bubbles are present, place light pres- 
sure on the plastic backing of the IPG strip to force the air bubble out from under the gel. Do 
not place excessive pressure on the surface of the [PG strip once it is on top of the secand- 
dimension gel; it can potentially collapse the pores on the top surface of the SDS-PAGE gel. 


3. If molecular-weight markers are desired, mix an appropriate amount (200-1000 ng of 
each marker for Coomassie staining and 10-50 ng of each marker for silver staining) with 
melted agarose sealing solution. Pipette 15-20 [il of this mixture onto a small piece of fil- 
ter paper. 


4. Once the agarose has set, insert the filter paper between the glass plates, so that it contacts 
the top of the slab gel at one end of the IPG gel. 


Biotinylated standards can also be used if the gels are to be blotted and proteins detected 
with an avidin-horseradish peroxidase or an avidin-alkaline phosphatase system. See the 
manufacturer's instructions for suggested amounts of biotinylated standards to use. 


5. Slowly pipette the melted agarose sealing solution over the iPG gel strip. Avoid introduc- 
ing bubbles. Allow the agarose to cool and solidify for at least 1 minute before continuing. 


Embedding the strip in agarose serves to hold the gel strip in place and ensures good con- 
- tact between the IPG strip and the slab gel. 


6. Finish assembling the electrophoresis apparatus as directed by the manufacturer. Connect 
the cooling apparatus (if available) and set it to maintain a gel temperature of 10—159C. 


Although cooling is not mandatory, it greatly diminishes gel-to-gel variation and allows the 
gels to be run at higher currents, thus reducing run times. 
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TABLE 4.7. Recommended electrophoresis conditions for second-dimension vertical gels 


Step Current (mA/gel) Duration (hr:min} 
Mini vertical system 
1.5-mm-thick gel l 15 0:15 
2 30 1:30° 
1.0-mm-thick gel l 10 0:15 
2 20 130 
Standard vertical system 
1.5- mm-thick gel 1 15 0:15 
2 45 5:00-6:00" 
1.0-mm-thick gel i 15 0:15 
2 45 5:00-6:00" 


Time shown here is only an estimate. Flectrophoresis should be stopped when the dye is 0.5—1.0 cm from the bottom of the gel. 


Cooling is recommended for both standard and large-format gels. Cooling greatly improves 
gel-to-gel reproducibility and helps reduce artifacts, such as smiling in the gels. Cooling is 
very important if there are large temperature fluctuations in the laboratory. 


7. Use Table 4.7 or 4.8 to set the recommended electrophoresis protocol. Turn on the power 
supply to begin the electrophoresis. 


Electrophoresis ts carried out in two steps. The first step is of a short duration and uses a low 
current or power setting. During this time, the initial migration and stacking occur. After the 
initial 5-15 minutes of electrophoresis (see Tables 4.7 and 4.8), check that the bromophenol 
blue dye front has moved into the gel. The current or power is then increased during the sec- 
ond step to allow for faster separation. 


B. Turn off the power supply when the bromophenol blue dye front has reached 0.5-1 cm 
from the bottom of the gel. Remove the gel assembly from the electrophoresis chamber. 


9. Place the assembly on a flat surface and remove the spacers, being careful not to disturb 
the gel. Use a plastic spatula or spacer to carefully separate the two glass plates of each gel 
cassette. Do not use a metal spatula, as it will crack or chip the glass plates. The gel should 
adhere to one of the plates. 


10. Proceed immediately with one of the gel-staining procedures (Protocols 10-14) and/or a 
protein transfer procedure (Protocols 15 and 16). 


EXPERIMENTAL TIP: Do not allow the gel to dry out. Leave in gel cassettes, if necessary, 
while completing the preparations for the next protocol. 


TABLE 4.8. Recommended electrophoresis conditions for large-format second-dimension vertical gels 


Temperature Step Power (W/gel) Duration (hr:min) 
25°C 1 5 0:15 
2 15 6:00 
2@C i 5 0:15 
2 10 8: 
159C l 5 0:45 
2 6.6 10:00—12:00* 


Time shown here is only an estimate. Electrophoresis should be stopped when the dye is 0.5-1.0 cm from the bottom of the 
gel. Values given here are based on the limit of the cooling system's capacity at the given temperature and are for 12 gels. Power 
levels can he increased if running fewer than 12 gels, which will result in reduced run times, These values are optimized for the 
Ettan DALT I system (Amersham Biosciences). 


pRoTOcOL 10 


Colloidal Coomassie Staining 


Ta PROTOCOL IS A MODIFICATION OF THE METHOD of Neuhoff et al. (1988), which involves 
the binding of Coomassie Blue G250 to proteins. This dye complexes with basic amino acids, 
such as arginine, tyrosine, lysine, and histidine and produces a very low background due to 
the colloidal properties of the stain. This property prevents the Coomassie dye from pene- 
trating the gel, eliminating the requirement for the long destaining periods characteristic of 
Coomassie R250 staining. Colloidal Coomassie is commonly used with preparative gels and 
has a detection range of 8-50 ng. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


y Reagents 
Coomassie Blue G250 <t> (10 ml, 5% w/v) in H,O 
Stir solution for a few minutes to disperse the dye. The dye will not dissolve completely, 
Dye stock solution 
ammonium sulfate <!> 50 g 
85% phosphoric acid <!> 6 ml 
596 Coomassie Blue G250 solution 10 ml 
H,O to 500 ml 
Fixative solution 
10% (v/v) acetic acid <!> 
40% (v/v) methanol <!> 
Prepare 1 liter. 
Glycerol (596) 
Methanol 


» Equipment 
Pyrex or plastic dishes 
These dishes should be larger than the size of the gel being stained. 
Reciprocal or orbital shaking platform 


P Protein Sample 
Gel containing protein samples (from Protocol 9) 


METHOD 
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. Fix the gel with gentle agitation for at least 1 hour in a Pyrex or plastic dish. 


The gel can be fixed overnight if necessary. If the gels contain labels, multiple gels can be 
fixed in the same dish; however, the volume of fixative should be sufficient to allow all gels 
to move freely. 


. Remove the fixative, submerge the gel in H,O, and wash the gel on an orbital shaker for 
: 10 minutes. 


. Discard the H,O. Repeat the wash step two more times. 


. During the third wash step, prepare the colloidal Coomassie stain by mixing four parts of 


Coomassie dye stock with one part methanol. 


, Place the gel in 100—300 ml of colloidal Coomassie stain (depending on the size of the gel, 


ensuring sufficient volume to just cover the gel). Stain the gel overnight with gentle agi- 
tation on an orbital shaker. 


The gel can be left in the stain for a much longer period of time, during which the color of 
the gel will continue to deepen for up to 7 days. 


. To remove residual stain, transfer the gel to a clean dish of 45—559C distilled or deionized. 


H,O. As the H,O colors, discard it, and wash the gel with gentle agitation again with fresh 
45-55°C H,O. Repeat until the protein bands are at the desired contrast against the back- 
ground of the gel. Proteins will appear blue on a clear background. 


. Gel can now be scanned for analysis. It is better to scan wet gels, because gels may crack 


during drying, leading to a loss of data. 


. Prior to drying, soak the gel in 596 glycerol for 30 minutes. Gels can also be stored indef- 


initely at 4°C wrapped in plastic and sealed in a ZipLock bag. 


PROTOCOL 11 


Ammoniacal Silver Staining 


Siver STAINING OF POLYACRYLAMIDE GELS was first introduced by Switzer et al. (1979) and has 
rapidly become the most popular high-sensitivity staining method for proteins. Problems with 
high backgrounds, reproducibility, and silver mirrors resulted in many improvements to this 
initial protocol over the years. Recently, Rabilloud et al. (1994b) found more than 100 different 
protocols in the literature. However, all protocols contain at least the following five basic steps: 


e Fixation 

* Sensitization 

+ Silver impregnation 

» Image development 

+ Stopping development and image stabilization 


We have found that the ammoniacal silver staining process presented here provides high 
sensitivity, in the range of 2-4 ng, and it works well with gels down to a thickness of | mm. 
Due to its high sensitivity, silver staining is very susceptible to interference from a variety of 
sources, Exceptional cleanliness is necessary and reagent and water quality is critical. 
Deionized H,O with a resistivity >15 mho/cm is necessary for all solutions and washes. 
Distilled H,O must not be used, for it tends ta give erratic results. All chemicals should be of 
the highest grade possible. The background levels obtained with this silver staining method 
can be reduced greatly by incorporating thiosulfate into the gel during polymerization 
(Hochstrasser and Merril 1988). This can be achieved by altering the polymerization solution 
suggested in Tables 4.5 and 4.6. It may be worth comparing this procedure with the silver 
staining method provided in Protocol 12. Both methods, when properly performed, provide 
excellent results with low background. The choice on which method to use routinely will hie 
with the investigator. 


MATERIALS 
CAUTION: Sec Appendix 3 for appropriate handling of materials marked with «1». 
IMPORTANT: Prepare all solutions in clean glassware with deionired HO. Approximately 250 ml of each 
solution is required per standard size (160 x 200 x 1.5 mm) gel. 

b Reagents 


Developing solution 


citric acid monohydrate <!> 0.lg 
formaldehyde (stock is 3653896) <!> 1 ml 
H,O to 1000 mi 


Mix in a Pyrex bottle, place the cap on the bottle, and shake manually. This solution should be made 
fresh every time it is needed. 
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Glutaraldehyde sensitization solution (2096 glutaraldehyde solution) <!> 
Combine 200 ml of glutaraldehyde (e.g., ACROS Organics, [stock is 50% glutaradehyde in H,O]) 
with 800 ml of H,O. Mix in a Pyrex bottle. A separate waste bottle will be required for glutaralde- 
hyde waste. 


Primary fixation solution 
10% (v/v) acetic acid <!> 
40% (v/v) ethanol <!> 
Prepare 1 liter in a graduated cylinder, and mix on a magnetic stirrer. 
Secondary fixation solution 
596 (v/v) acetic acid 
596 (v/v) ethanol 
Prepare 1 liter in a graduated cylinder, and mix on a magnetic stirrer. 
Silver staining solution 
Solution A: In a 250-ml graduated cylinder add: 
200 ml of H,O 
800 ul of NaOH «t» 
13.3 m] of ammonium hydroxide «1» 
Mix by stirring on a magnetic stirrer. 


Solution B: In a small beaker (100-ml) add: 
8.0 g of silver nitrate <!> 
~50 mi of H;O 

Mix by stirring on a magnetic stirrer. 


Transfer Solution A to a 1-liter graduated cylinder and let the solution stir in a chemical fume hood. 
When the contents of Solution B have dissolved, slowly add Solution B to Solution A using a 10-ml 
pipette. Add Solution B dropwise such that the dark precipitate clears before adding additional 
drops. If a precipitate forms and does not clear, discard the solution and start over. 


When all of Solution B has gone into solution with A, adjust the volume to 1000 ml with H,O while 
stirring in the chemical fume hood. 


EXPERIMENTAL TIP: This solution stains very easily, and thus must be handled cautiousiy. 
Change gloves if they come into contact with silver-staining solution. Wipe area clean after use. This 
solution must.be made fresh each time it is used. 
Silver nitrate waste must be stored in a brown bottle used only for silver nitrate. 

Stop solution (1096 acetic acid) 
In a Pyrex bottle, mix 100 ml of glacial acetic acid <!> and 900 ml of H,O. 


» Equipment 
Plastic wrap or thin polycarbonate plastic sheets 


Pyrex or plastic dishes 
These dishes should be larger than the size of the gel being stained. 


Reciprocal or orbital shaking platform 


b Protein Sample 
Gel containing protein samples (from Protocol 9) 
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Silver staining is very susceptible to interference from a variety of sources. To minimize 
problems: 


Wear powder-free gloves at all times when preparing solutions or handling gels. 


Do not place pressure on the gels during solution transfers, as this can cause staining 
artifacts on the gels. 


Do not touch the gel with gloved hands throughout this protocol. Carry out all manip- 
ulations of the gel with a piece of plastic wrap or polycarbonate sheet placed between 
gloved hands and the gel. 


. Place the gel (from Protocol 9) in a dish of H,O. Agitate the gel on a reciprocal or orbital 


shaker for 5 minutes. While the gel is on the shaker, prepare fresh primary fixation solution. 


Multiple gels should each he fixed and stained in separate dishes. 


_ Drain the H,O from the dish, holding the gel in the dish with a square of plastic wrap or 


polycarbonate sheet. Do not allow gloved hands to come in direct contact with the gel. 
Pour enough primary fixation solution into the dish to adequately cover the gel. Agitate 
the gel on a shaker for 30-60 minutes. 


. Prepare fresh secondary fixation solution. Drain the primary fixation solution from the 


dish, holding the gel in the dish with a square of plastic wrap or polycarbonate sheet. Pour 
enough secondary fixation solution into each dish to cover the gel adequately. Agitate the 
gel on a shaker for 2 hours if silver staining will be finished on the same day. If not, then 
cover the dish with a lid or plastic wrap and shake it overnight on an orbital shaker. 


If the gel is shaken overnight, be sure that there is enough solution in each dish to keep the 
gel from drying out. 


_ Drain the secondary fixation solution from the dish, holding the gel in the dish with a 


square of plastic wrap or polycarbonate sheet. Add sufficient H,O to cover the gel. Agitate 
the gel on a shaker for 5 minutes. 


. Prepare ~250 ml of glutaraldehyde sensitization solution per standard size gel (160 x 200 


x 1.5 mm). Prepare the solution fresh each time used. Keep volumes to the minimum 
necessary. Proceed with either Step a or b. 


a. Make sure that the gel is face up in the dish. Drain the H,O from the dish. Carefully 
pour ~250 ml of glutaraldehyde solution into the dish containing the gel. Agitate the 
gel on an orbital shaker for 30 minutes at an agitation speed sufficient ta move the 
gels in the dish, but not sliding out of the solution. 


Knowing the orientation of the gei during the staining process will assist in “reading the 
gel.” Traditionally, the gel is oriented with the acidic end on the left and the basic end on 
the right. Although this will have no effect on the staining of the gel, it helps to give an 
idea of where to expect the proteins to appear on the developed gel. 


b. Add ~200 ml of glutaraldehyde solution to a clean dry dish. Carefully transfer the gel 
from the H,O to the glutaraldehyde solution. Use caution when lifting the gel; it is 
fragile. Lift the gel only from its corners to prevent pressure artifacts in critical areas 
of the gel. Agitate the gel on an orbital shaker for 30 minutes at an agitation speed suf- 
ficient to move the gels in the dish, but not sliding out of the solution. 
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. Drain the glutaraldehyde solution into a proper waste receptacle. Rinse the gel with H,O 


and drain this H,O into the glutaraldehyde waste receptacle. 


7, Generously cover the gel with H,O and shake it on an orbital shaker for 5 minutes. 


11. 


13. 


14, 


15. 
16. 


17. 


18. 


. Drain the H,O. Repeat the H,O wash two more times. 


If desired, the number of washes can be increased beyond three, but not decreased. 


. Repeat Steps 7 and 8, but increase each wash time to 30 minutes. 


As with the 5-minute washes, if desired, the number of washes can be increased beyond 
three, but not decreased. 


. Prepare the silver staining solution 30-60 minutes before it is needed in Step 11. 


One liter of the silver staining solution is sufficient to stain four standard-size gels. 


Drain the H,O from the dish, holding the gel in the dish with a square of plastic wrap or 
polycarbonate sheet. Add ~250 ml of silver staining solution to each dish. Agitate the gel 
on an orbital shaker for 20 minutes. The agitation time can be shortened to 15 minutes 
if necessary, but it should be no less than 15 minutes and no more than 20 minutes, 


. Drain the silver staining solution into an appropriate waste bottle, making sure not to 


touch the gei with gloved hands. 


Quickly rinse the gel with just enough H,O to cover the gel. Discard this H,O in the sil- 
ver nitrate waste bottle. 


Add enough H,O to generously cover the gel. Agitate the gel on an orbital shaker for 5 
minutes. 


Drain the H,O. Repeat the H,O wash two more times. 


While the gel is washing, prepare the developing and stop solutions. Pour the stop solu- 
tion into a clean dry dish (or dishes, if multiple gels are being stained). Place the dishes 
in an easily accessible location to avoid any delay between developing and stopping. 


Drain the H,O from the dish, holding the gel in the dish with a square of plastic wrap or 
polycarbonate sheet. Add ~250 ml of developing solution to the dish. Agitate the gel on 
an orbital shaker that has been lined with white bench paper. 


White bench paper enhances the contrast between stained proteins and the background and 
will improve the ability to determine the stop point for developing. 


EXPERIMENTAL TIP: Do not develop more than three gels at one time. Keep additional 
gels in H,O until ready to develop them. 

The time required fot proper development varies with each gel. Watch the gels individually 
and determine the proper time to stop development, which is, typically, when protein spots 
appear dark and small protein spots become visible. It may take up to 1 minute before the 
first protein spots appear. Be careful not to overdevelop. Approximate development time is 
usually 5-10 minutes. 


When the gel appears to be sufficiently developed, carefully lift the gel by its bottom cor- 
ners, and quickly transfer it to a dish containing stop solution. Incubate the gel with gen- 
tle agitation in the stop solution for ~10 minutes. 


. Discard the stop solution, and cover the gel with H,O. Place the gel on an orbital shaker 


until it can be scanned. 
Gels should be scanned as soon as possible, Residual acid in the gel, from the stop solution, 
will cause the stained proteins to fade over time. For accurate densitometry, scan the gels 
within 30 minutes of transfer to H,O. 
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Mass Spectrometry-compatible Silver Staining 


Ta AMMONIACAL SILVER STAINING METHOD presented in Protocol 11 is one of the most sen- 
sitive methods used to detect proteins on an SDS-PAGE gel. However, this and other standard 
silver staining methods are not compatible with mass spectrometry (MS), which is fast 
becoming the best way to identify proteins isolated on 2D gels. Because the proteins in gels 
to be analyzed by mass spectroscopy cannot be modified, many of the common sensitizing 
agents (e.g., glutaraldehyde and strong oxidizing agents) cannot be used. [n addition, many 
investigators find it advantageous to purchase dedicated kits, where the reagent quality is 
assured, to carry out the silver staining. Such kits are available from a variety of sources, 
including Amersham Biosciences, Bio-Rad, Pierce, and other chemical supply companies, 
The method provided here is based on the Amersham Biosciences PlusOne silver stain kit as 
described by Yan et al. (2000b). These authors found that this method is compatible with 
MALDI and ESI-MS, and it shows an increased ability to deal with semipreparative protein 
loads without negative staining as compared to other silver staining methods (including the 
method in Protocol 11). However, this process is less sensitive than standard silver staining 
methods. For further MS-compatible silver staining methods, see Shevchenko et al. (1996), 
Scheler et al. (1998), Gharahdaghi et al. (1999), and Sinha et al. (2001). 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with «! >. 
IMPORTANT: Prepare all solutions in clean glassware using deionized H,O with a resistivity »15 mho/cm. 
P Reagents 
Developing solution 
sodium carbonate <!> 6.25g 
formaldehyde (37%) <'> 100 pl 
H,O 250 ml 


Fixation solution 
10% (v/v) acetic acid <!> 
40% (v/v) methanol <t> 
Prepare 500 ml of this solution. 
Sensitization solution 


methanol 75 ml 
5% sodium thiosulfate 10 ml 
sodium acetate <!> 17g 
H,O 165 ml 
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P» Equipment 


Silver staining solution (0.2596 w/v silver nitrate) «1» 


Just prior to use, dilute 25 ml of stack silver staining solution (stock concentration is 2.39o' tenfold 
with 225 ml of H,O. Alternatively, prepare a fresh 0.2596 /w/v) silver nitrate solution from powder 
just prior to use (see Step 6). 


Stop solution (1.46% EDTA) 


Dissolve 3.65 g of EDTA in 250 ml of H,O. 


Plastic wrap or thin polycarbonate plastic sheets 
Pyrex or plastic dishes 


These dishes should be larger than the size of the gel being stained. 


Reciprocal or orbital shaking platform 


b Protein Sample 
Gel containing protein samples (from Protocol 9j 


METHOD 


Silver staining is very susceptible to interference from a variety of sources. 


Make sure that all glassware is exceptionally clean. 
Wear powder-free gloves at all times when preparing solutions or handling gels. 


Do not place pressure on the gels during solution transfers, as this can cause artifacts on 
the gels. 


Prepare all solutions fresh immediately prior to staining. Approximately 250 ml of each 
solution is required per standard size (160 x 200 x 1.5 mm) gel. 


, Place the gel (from Protocol 9) in a dish containing 250 ml of fixation solution. Agitate 


on a reciprocal or orbital shaker for 5 minutes. 


Multipie gels should each be fixed and stained in separate dishes. 


. Drain the fixation solution from the dish, holding the gel in the dish with a square of plas- 


tic wrap or polycarbonate sheet. Do not allow gloved hands to come in direct contact 
with the gel. Add another 250 ml of fixation solution and agitate on an orbital shaker tor 
an additional 15 minutes. 


. Discard the fixation solution, and add 250 ml of sensitization solution to the dish Agitate 


the gel on an orbital shaker for 30 minutes. 


. Drain the fixation solution from the dish, holding the gel in the dish with a square of plas- 


tic wrap or polycarbonate sheet. Add sufficient deionized H,O to generously cover the 
gel, Agitate the gel on a shaker for 5 minutes. 


. Drain the H,O. Repeat the H,O wash two more times, 


, Prepare the silver staining solution 30-60 minutes before it is needed in Step 7. 


One liter of the silver staining solution is sufficient to stain four standard-size gels. 
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. Drain the H,O from the dish, holding the gel in the dish with a square of plastic wrap or 


polycarbonate sheet. Add ~250 ml of silver staining solution to each dish. Agitate the gel 
on an orbital shaker for 20 minutes. The agitation time can be shortened to 15 minutes 
if necessary, but it should be no less than 15 minutes and no more than 20 minutes. 


. Drain the silver staining solution into an appropriate waste bottle, making sure not to 


touch the gel with gloved hands. 


. Add enough H,O to generously cover the gel. Agitate the gel on an orbital shaker for 5 


minutes, 


. Drain the H,O, discarding the first wash in the silver nitrate waste bottle. Repeat the H.O 


wash two more times. 


. While the gel is washing, prepare the developing and stop solutions. Place the stop solu- 


tion in an easily accessible location to avoid any delay between developing and stopping. 


. Drain the H,O from the dish, holding the gel in the dish with a square of plastic wrap or 


polycarbonate sheet. Add ~250 ml of developing solution to the dish. Agitate the gel on 
an orbital shaker that has been lined with white bench paper. 


White bench paper enhances the contrast between stained proteins and the background and 
will improve the ability to determine the stop point for developing. 


EXPERIMENTAL TIP: Do not develop more than three gels at one time. Keep additional 
gels in H;O until ready to develop them. 

The time required for proper development varies with each gel. Watch the pels individually 
and determine the proper time to stop development, which is, typically, when protein spots 
appear dark and small protein spots become visible. It may take up to 1 minute before the 
first protein spots appear. Be careful not to overdevelop. Approximate development time is 
usually 5-10 minutes. 


. When the gel appears to be developed enough, discard the developer into an appropriate 


waste container, holding the gel in the dish with a square of plastic wrap or polycarbon- 
ate sheet. Quickly pour ~250 ml of stop solution into the dish. Agitate the gel on an 
orbital shaker for 10 minutes. 


. Discard the stop solution, and generously cover the gel with H,O. Agitate the gel on an 


orbital shaker for 5 minutes. 
Discard the H,O, and repeat the washes two more times. The gel is now ready for scan- 
ning for analysis or spot removal for mass spectrometry. 

Gels should be scanned as soon as possible. 


EXPERIMENTAL TIP: Mass spectrometry results can be improved by incorporating a 
destaining step to remove the silver prior to in-gel digestions, as described by Gharahdaghi 
et al. (1999). 


PROTOCOL 13 


Fiuorescent Staining of Proteins with SYPRO Ruby 


MATERIALS 


» Reagents 


P Equipment 


Te DEVELOPMENT OF NEW MASS SPECTROMETRY INSTRUMENTATION and its application to the 
field of proteomics has greatly aided in the identification of proteins isolated by 2D gel elec- 
trophoresis. Identification of proteins is no longer limited to those that are highly abundant, 
with protein identification at the femtomole level now being feasible (Jensen et al. 1997; 
Gatlin et al. 1998). However, to make this level of sensitivity applicable to proteins isolated 
from 2D gels, sensitive and compatible gel-staining methods are necessary. Silver staining 
provides very sensitive detection of proteins; however, it is complicated, and without modi- 
fications that reduce its sensitivity (see Protocol 12), it is incompatible with mass spec- 
troscopy. Á sensitive ruthenium-based fluorescent dye (SYPRO Ruby) has recently been 
developed by Molecular Probes. Although the structure of this dye is not in the public 
domain, it has been reported that SYPRO Ruby is a transition metal organic complex that 
binds directly by electrostatic mechanisms (Patton 2000). Similar to silver stains, SYPRO 
Ruby interacts with basic amino acids, including lysine, arginine, and histidine (Lopez et al. 
2000). Recent studies by Yan et al. (2000a) and Lopez et al. (2000) have shown that this dye 
is as sensitive as silver staining and completely compatible with MALDI mass spectroscopy. 
The staining method is also simple, having only three steps: fixation, staining, and destain- 
ing. However, SYPRO Ruby is very expensive. Thus, for many laboratories, this dye might be 
limited to micropreparative geis, as the cost to run the large number of gels done in quanti- 
tative studies is prohibitive. 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Fixative solution 
7% (viv) acetic acid <!> 
10% (v/v) methanol <!> 
Prepare 1000 ml of this solution, 
SYPRO Ruby dye (Molecular Probes) «1» 


Polypropylene tray 
Do not use a glass tray; it will bind the dye. 


Reciprocal or orbital shaking platform 
UV or blue light transilluminator (e.g., the Dark Reader, Clare Chemical Research, Denver, 
Colorado} or Laser scanner <!> 
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» Profein Sampie 
Gel containing protein samples (from Protocol 9) 


METHOD 


1. Place the gel (from Protocol 9! in a polypropylene dish containing sufficient fix solution 
to allow the gel to float freely in the dish. Agitate on a reciprocal or orbital shaker for 30 
minutes. 


Multiple gels should each be fixed and stained in separate dishes. 


2. Drain the fixative solution from the dish, holding the gel in the dish with a square of plas- 
tic wrap or polycarbonate sheet. Do not allow gloved hands to come in direct contact 
with the gel. Incubate the gel in the dark on a reciprocal or orbital shaker in undiluted 
SYPRO Ruby stain for 90 minutes to overnight. 


The minimum stain volume for standard gel sizes are: 


50 mi for 8 cm x 10 cm x 0.75 mm gel 

330 ml for 16 cm x 20 cm x 1 mm gel 

500 ml for 20 cm x 20cm x 1 mm gel 

or -10 times the volume of larger size gels 
Using too little stain will jower the sensitivity. Sensitivity increases with longer staining time, 
with a minimum of 3 hours for maximum sensitivity. Reusing the stain will greatly reduce 
sensitivity. 


3. (Optional) Discard the SYPRO Ruby stain. To remove residual stain that can cause high 
fluorescent background and speckling, place the gel in a volume of fixative solution equal 
to the amount of dye that was used in Step 2. Agitate on a reciprocal or orbital shaker 
table for 30 minutes. 


4. Visualize the proteins using a 300-nm transilluminator, a blue light transilluminator, or a 
laser scanner. SYPRO Ruby has excitation maxima at ~300 and ~420 nm and an emission 
maximum at ~618 nm. 


The polyester backing of some precast gels is highly fluorescent. For maximum sensitivity 
using a UV transilluminator, place the gel polyacrylamide side down and use an emission 
filter to screen out the blue fluorescence of the plastic. The use of a blue-light iransillumi- 
nator or laser scanner will also reduce the amount of fluorescence from the plastic backing. 


This dye is compatible with imaging systems whose lasers emit at 450, 473, 488, and or 532 
nm. Currently, most major suppliers of Jaser scanners can provide the optimal filter setting 
for this dye. 


PROTOCOL 14 


Phosphoprotein Staining with the GelCode 
Phosphoprofein Staining Kit 


MATERIALS 


Tie PHOSPHORYLATION STATE OF A PROTEIN HAS AN IMPORTANT ROLE in the regulation of a wide 
variety of cellular processes. As a result, there has been a great deal of interest in detecting 
phosphorylated proteins. The method presented here uses the GelCode phosphoprotein 
staining kit (Pierce Chemical Company). This method depends on the hydrolysis of the phos- 
phoprotein phosphoester linkage using sodium hydroxide in the presence of calcium ions. 
The gel containing the newly formed insoluble calcium phosphate is then treated with ammo- 
nium molybdate in dilute nitric acid. The resultant insoluble nitrophospho-molybdate com- 
plex is stained with Methyl Green. After destaining, the phosphoproteins are colored green to 
green-blue. The detection limit is in the nanogram range, but depends on the degree of phos- 
phorylation of the protein. This method will detect the phosphoproteins phosvitin and B- 
casein in the 40-80 ng/band and 80-160 ng/band range, respectively. The method presented 
here is for staining minigels. Volumes will need to be increased for larger gels. 


P Reagents 


» Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


IMPORTANT: These reagents must be purchased as the GelCode phosphoprotein staining kit from the 
Pierce Chemical Company, as the concentrations of the components are proprietary. 

Acetic acid (7%) <!> 

Ammonium molybdate solution <!> 

Ammonium molybdate solution containing nitric acid <!> 

Methyl Green solution <!> 

NaOH (0.5 N) <!> 

Sulfosalicylic acid solution <!> 

Sulfosalicylic acid solution containing calcium chloride <!> 


Glass or plastic tray with a lid 
Oven preheated to 65°C 
Reciprocal or orbital shaking platform 


b Protein Sampie 


Gel containing protein samples (from Protocol 9) 
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. Place the gel (from Protocol 9) in a dish containing 50 m. of H,O. Agitate the gel on a rec- 


iprocal or orbital shaker table for 10 minutes. 


. Move the gel to a dish containing 25 ml of sulfosalicylic acid solution, and agitate the gel 


on a reciprocal or orbital shaker tabie for 15 minutes. 


. Place the gel in 25 ml of sulfosalicylic acid solution containing calcium chioride, and agi- 


tate the gel on a reciprocal or orbital shaker table for 30 minutes. 


, Rinse the gel rapidly with H,O to remove the calcium chioride adhering to the surface of 


the gel. 


. Transfer the gel to 25 ml of 0.5 N NaOH, cover the tray with a lid, and incubate it for 20 


minutes at 65°C. 


. Place the gel in 25 ml of ammonium molybdate solution, and agitate the gel on a recip- 


rocal or orbital shaker table for 10 minutes. 


. Repeat Step 6. 


_ Transfer the gel to 25 ml of ammonium molybdate solution containing nitric acid, and 


agitate the gel on a reciprocal or orbital shaker table for 20 minutes. 


. Place the gel in 25 ml of Methyl Green solution, and agitate the gel on a reciprocal or 


orbital shaker table for 20 minutes. 


To destain, transfer the gel to 25 ml of sulfosalicylic acid solution, and agitate the gel on 


a reciprocal or orbital shaker table for 15 minutes. 
Repeat Step 10. At this time, green spots of phosphoproteins should be seen. 


‘Yo completely destain the gel, place it in 25 ml of 7% acetic acid, and agitate the gel on a 
reciprocal or orbital shaker table overnight. During the destaining, change the acetic acid 
once, after the solution has become green in color. 


_ Following visualization of the phosphoproteins, stain the total proteins on the gel using a 


Coomassie-Blue-staining method. 


PROTOCOL 15 


Tank Transfer of 2D Gels 


Ta: PROTOCOL DESCRIBES A TANK TRANSFER METHOD for blotting proteins, in which a cassette 
holds the sandwich of gel, membrane, and blotting paper suspended vertically in buffer. 
Electrode panels are placed parallel to the cassette. Typically, liters of buffer are required in 
the tank to maintain the current and pH throughout the transfer. The advantages of tank sys- 
tems are their efficiency of protein transfer; transfers can be run for extended periods of time 
(allowing more complete transfer) and proteins from several gels can be transferred simulta- 
neously. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «17. 


b Reagents 
Transfer buffers (Tris-glycine-methanol [Towbin] or CAPS buffer) 
Tris-glycine-methanol buffer should be used for standard transfer and detection procedures. CAPS 
buffer is used if the proteins will be subjected to amino acid analysis or sequenced from the mem- 
brane. If proteins are to be sequenced from a membrane exposed to Tris-glycine-methanol buffer, 
the membrane must be washed extensively to remove excess glycine and Tris, which will both inter- 
fere with the subsequent analysis. 


e Towbin transfer buffer 


Tris-base 15.1 g 
glycine <!> 721g 
methanol <!> 500 ml 
SDS (optional) «t» 5.0g 
H,O to 5 liters 


10% methanol is recommended for binding proteins to nitrocellulose membranes. Up to 20% 
methanol can be included. Methano! helps remove SDS from the proteins and improves the 
binding of the proteins to the membrane. SDS is optional, but it can be included to help improve 
the transfer of large proteins. 

e CAPS transfer buffer 
Add 2.2 g of 3-(cyclohexyl-amino)-1-propanesulfonic acid <!> to ~600 ml of H,O. Adjust the 
pH to 11 with concentrated NaOH <!>, and adjust the volume to ] liter with H,O. 


b Equipment 
Blotting paper 


Membrane, supported nitrocellulose or PVDF 
Nitrocellulose is frequently used for total protein stain methods and immunoblotting, whereas 
polyvinylidene difluoride (PVDF) is required for protein sequencing. 


204 = CHAPTER 4 


Power supply (400 mA for small tank transfers; 1000 mA for large tank transfer units} 
Transfer tank apparatus 


b Protein Sample 
Gel containing protein samples (from Protocol 9} 


METHOD 


. Measure the SDS-PAGE gel (from Protocol 9), and cut a transfer membrane to the appro- 


priate size. 


Membranes used for tank transfer can be cut larger or smalier than the actual gel if neces- 
sary. 


_ Wet the membrane in H,O (or methanol for PVDF) by slowly lowering the membrane 


onto the surface of the liquid. Do this slowly to prevent uneven wetting of the membrane 
and trapped air bubbles beneath the membrane. 


. Transfer the membrane from the wetting solution to a dish of the appropriate transfer 


buffer. Allow the membrane to equilibrate in the transfer buffer for ~5 minutes. 


. Cut the blotting paper to the appropriate size, according to the transfer tank manufac- 


turer's instructions. 


. Assemble the transfer sandwich. It is critical to avoid trapping air bubbles between any of 


the layers of the sandwich, because the bubbles will block the transfer of proteins at those 
sites. It may be best to assemble the sandwich submerged in buffer, as described below in 
Steps a-h. 


a. Place one side of the transfer cassette in a tray of transfer buffer. 


b. Place a foam sponge onto the cassette and squeeze out any air bubbles trapped in the 
sponge. 


c. Place one sheet of blotting paper on top of the sponge making sure it wets evenly and 
completely. Remove any air bubbles by rolling a pipette or glass rod across the surface 
of the blotting paper. 


d. Place the second-dimension gel onto the surface of the blotting paper aligning the gel 
so that the sponge, blotting paper, and gel fit within the cassette. Remove any air bub- 
bles by rolling a pipette or glass rod across the surface of the gel. Be careful not to 
place too much pressure on the gel. 


e. Place the transfer membrane on top of the gel, being careful to place it properly the 
first time. Moving the membrane across the gel may cause smudged results. Carefully 
remove any air bubbles from between the membrane and the gel by rolling a pipette 
or glass rod across the surface of the membrane. It is critical that there be no air pock- 
ets between the gel and the membrane or this will prevent transfer of proteins. 


Layer another piece of blotting paper on top of the membrane, making sure it wets 
evenly and completely. Remove any ait bubbles by rolling a pipette or glass rod across 
the surface of the blotting paper. 
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g. Place a foam sponge onto the cassette and squeeze out any air bubbles trapped in the 
sponge. 


h. Close the cassette and snap it shut. 


. Repeat Step 5 for each gel that is to undergo electrophoretic transfer. 
. Fill the transfer tank approximately two-thirds full with the appropriate transfer buffer. 


. Place the transfer cassette into the tank. 


EXPERIMENTAL TiP: Make sure to orient the assembly so that the electric current will 
transfer the proteins from the gel to the membrane, and not from the gel to the blotting 
paper and into the solution! 


. Finish filling the transfer unit to the recommended buffer level, making sure that all the 


cassettes are submerged in the buffer. 


. Place the lid on the transfer unit so that the red or anode lead is closer to the membrane 


than the gel. 


During electrophoresis, proteins migrate from the negative cathode (black) toward the pos- 
itive anode (red). 


. If transfers are carried out for more than 1 hour, attach a recirculating water bath to the 


heat exchanger and set the temperature to 15°C. 


Tank transfer units are used under very high current conditions, and without adequate cool- 
ing. damage to the apparatus can occur. A recirculating water bath will help maintain the 
temperature during extended transfer times. Quick-fit connectors are recommended to pre- 
vent leakage of the coolant when the recirculating water bath is disconnected. 


. Transfer according to the times and currents recommended by the manufacturer of the 


transfer apparatus. 


When the transfer is complete, stop the recirculating water bath, turn off the power sup- 
ply, remove the transfer lid, and remove the cassette. 


Disassemble the transfer cassette. Rinse the sponges in deionized H,O and dry in the air 
for reuse. Dispose of the blotting papers. Use a soft lead pencil to mark the side of the 
membrane facing the gel. 


Do not use a pen or lab marker, as many inks will solubilize in the buffers used for process- 
ing the membrane. 


Stain or process the blot (see Protocols 17-20) or store the membrane dry in the dark 
between sheets of blotting paper. 


If stored dry, the membrane must be rewet in water (nitrocellulose) or methanol (PVDF) 
prior to processing. 


PROTOCOL 16 


Semidry Blotting of 2D Gels 


Seavey UNITS TRANSFER PROTEINS FROM THE GEL to the membrane when they are sandwiched 
between buffer-saturated stacks of blotting paper. The electrical field is applied parallel to the 
stack. Compared with tank transfer units, semidry transfer units use much less buffer, and 
because the electrode panels are closer together, Jess voltage is required for transfer, 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «7. 


è Reagents 
Transfer buffers (Tris-glycine-methanol [Towbin] or CAPS buffer) 
Tris-glycine-methanol buffer should be used for standard transfer and detection procedures. CAPS 
buffer is used if the proteins will be subjected to amino acid analysis or sequenced from the mem- 
brane. li proteins are to be sequenced from a membrane exposed to Tris-glycine-methanol buffer, 
the membrane must be washed extensively to remove excess glycine and Tris, which wil} both inter- 
fere with the subsequent analysis. 


e Towbin transfer buffer 


Tris-base 15.1 g 
glycine «t» 721g 
methanol «t» 500 ml 
SDS (optional) <!> 50g 
H,O to 5 liters 


10% methanol is recommended for binding proteins to nitrocellulose membranes. Up to 20% 
methanol can be included, Methanol helps remove SDS from the proteins and improves the 
binding of the proteins to the membrane. SDS is optional, but it can be included to help improve 
the transfer of large proteins, 

a CAPS transfer buffer 
Add 2.2 g of 3-(cyclohexyl-amino)- 1 -propanesulfonic acid <!> to ~ 600 ml of H,O. Adjust the 
pH to LI with concentrated NaOH <!>, and adjust the volume to 1 liter with H.O. 


» Equipment 
Blotting paper 
Membrane, supported nitrocellulose or PVDF 


Power supply (400 mA} 
Semidry transfer apparatus 


» Protein Sample 
Gel containing protein samples (from Protocol 9) 
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. Measure the SDS-PAGE gel (from Protocol 9), and cut a transfer membrane and six sheets 


of blotting paper to the same size as the gel. 


. Wet the membrane in H,O (or methanol for PVDF) by slowly lowering the membrane 


onto the surface of the liquid. Do this slowly to prevent uneven wetting of the membrane 
and trapped air bubbles beneath the membrane. 


. Transfer the membrane from the wetting solution to a dish of the appropriate transfer 


buffer. Allow the membrane to equilibrate in the transfer buffer for ~5 minutes. 


. Assembie the transfer sandwich. It is critical to avoid trapping air bubbles between any of 


the layers of the sandwich, because the bubbles will block the transfer of proteins at those 
sites. 


a. Wet three pieces of blotting paper in transfer buffer and stack them onto the anodic 
electrode panel of the transfer unit. Roll a glass rod across the stack to remove any air 
bubbles and to make sure the stack is in uniform contact with the electrode panel. 


b. Place the transfer membrane onto the surface of the blotting paper stack. 


c. Lay the gel on top of the membrane, being careful to place it properly the first time. 
Moving the gel across the membrane may cause smudged results. Roll a glass rod 
across the surface of the gel to remove any air bubbles. 


d. Wet the other three pieces of biotting paper in transfer buffer, and stack them on top 
of the gel. Roll a glass rod across the entire stack to remove trapped air. 


e. Place the cathode panel onto the transfer stack and connect the safety interlock to the 
base. 


During electrophoresis, proteins migrate from the negative cathode (black) toward the 
positive anode (red). 


. Transfer according to the times and currents recommended by the manufacturer of the 


transfer apparatus. 


. When the transfer is complete, turn off the power supply and remove the transfer lid. 


. Disassemble the transfer stack. Dispose of the blotting papers. Use a soft lead pencil to 


mark the side of the membrane facing the gel. 


Do not use a pen or lab marker, as many inks will solubilize in the buffers used for process- 
ing the membrane, 


. Stain or process the blot (see Protocols 17-20} or store the membrane dry in the dark 


between sheets of blotting paper. 


If stored dry, the membrane must be rewet in water (nitrocellulose) or methanol (PVDF) 
prior to processing. 


INTRODUCTION TO PROTOCOLS 17-20 


Detection of Proteins on Membranes 
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Ts TOTAL PROTEIN COMPLEMENT BOUND TO A MEMBRANE can be detected using a variety of 
staining methods (see Table 4.9). For microsequen ing, the membrane can be stained to 
detect all of the bound protein, and the desired spots are then excised from the membrane 
for analysis. 

For the recognition of specific proteins on a membrane, many different antibodies are 
available, including radiolabeled antibodies, horseradish-peroxidase- linked or alkaline-phos- 
phatase-linked antibodies, and biotinylated antibodies. In addition, detection can be carried 
out by several different methods, such as autoradiography, chemilumunescence, or colori- 
metric procedures. The difficulty of western blotting with 2D electrophoresis is aligning the 
spots detected by the bound antibody to an identical sample that has been stained with a total 
protein stain. The following procedures are some of the ways in which the total pratein com- 
plement on 2D gels can be detected. 


TABLE 4.9. Stains for the detection of total protein on a membrane 


Compatible 
Detection with 
Stain limit Nitrocellulose PVDF Comments immunoblotting Protocol 
Ponceau $ 1 pg 4 + reversible yes 17 
Coomassie R250 lug + + permanent yes 18 
high background 
India Ink 100 ng + + permanent - 19 
Colloidal Gold 3 ng + + permanent no 20 


PROTOCOL 17 


staining Membrane-bound Proteins with Ponceau S 


MATERIALS 


b» Reagent 


Because PONCEAU $ IS RELATIVELY INSENSITIVE (~1 pg of protein), only the most abundant 
proteins will be visible. However, it is a reversible stain that can be removed completely with 
H,O prior to processing the blots (Salinovich and Montelano 1986). After staining, a soft lead 
pencil can be used to record the presence of visible proteins and molecular-weight markers, 
which will help when aligning the proteins detected on the membrane by western analysis 
with those in a total protein-stained gel or membrane. Ponceau S is compatible with both 
nitrocellulose and PVDF membranes, 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Ponceau $ (0.196 w/v)/acetic acid (5% v/v) <!> 


» Protein Sample 


METHOD 


Membrane containing bound proteins transferred in Protocol 15 or 16 


1, Submerge the transfer membrane in Ponceau $ stain solution with gentle agitation for 5 
minutes. 


2. Decant the stain and rinse the membrane several times with H,O until the protein bands 
are visible. 


Do not reuse the stain; it will result in nonreproducible results due to depletion of the dye 
after the first use. 


3. Use a soft lead pencil to mark the major protein spots and molecular-weight markers. 


4, Continue rinsing the membrane with H,O with gentle agitation until the Ponceau S is 
removed. 


5. Proceed with protein characterization or analysis (see Chapters 6, 7, and 8). 


PROTOCOL 18 


Staining Membrane-bound Proteins with 
Coomassie Blue R250 


Coomassie BLUE R250 PERMANENTLY STAINS MEMBRANE-BOUND PROTEINS and is compatible 
with PVDF and nitrocellulose membranes, but incompatible with nylon membranes. This 
technique is relatively insensitive, with a detection limit of ~1.5 ug of protein. One drawback 
of Coomassie Blue staining is that it produces a high background which can make interpre- 
tation of results difficult. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!-. 


b» Reagents 
Coomassie Blue R250 (0.1% w/v)/methanol (50% v/v) <!> 
Methanol (50%) containing 10% acetic acid <!> 


P Protein Sample 
Membrane containing bound proteins transferred in Protocol 05 or 16 


METHOD 


1. Submerge the membrane in the Coomassie Blue stain solution for 5 minutes with gentle 
agitation. 


2. Decant the stain, and destain the membrane on a rocker table with several changes of 
50% methanol containing 10% acetic acid. 


Do not reuse the stain; it will result in nonreproducible results due to depletion of the dye 
after the first use. 


3. Rinse the membrane with H,O for 5 minutes with gentle agitation. 
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PROTOCOL 19 


Staining Membrane-bound Proteins with India Ink 


Toa INK PERMANENTLY STAINS PROTEINS BLACK against the white background of the mem- 
brane. It can be used with both nitrocellulose and PVDF membranes, but it is not recom- 
mended for nylon membranes. This technique is sensitive, detecting down to ~100 ng of pro- 
tein, 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


b Reagents 
india Ink stain 
Just before use, dilute 1 ul of India Ink (Pelikan 17 black) per 1 ml of Tween-20 solution. 


Phosphate-buffered saline (PBS) (pH 7.2) 
0.15 M NaCl 
0.01 M Na,HPO,/NaH,PO, <!> 
Fween-20 solution (0.3% v/v) in PBS 


P Protein Sample 
Membrane containing bound proteins transferred in Protocol 15 or 16 


METHOD 


1. Submerge the membrane in the Tween-20 solution for 10 minutes with gentle agitation. 
. Discard the Tween-20 solution. 


. Repeat Steps 1 and 2 three more times. 


b ww M 


. Submerge the membrane in India Ink stain with gentle agitation. Stain the membrane for 
2-18 hours. 


5. Discard the stain, and rinse the membrane with Tween-20 solution for 5 minutes with 
gentle agitation. 


PROTOCOL 20 


staining Membrane-bound Proteins with Colloidal Gold 


Cooma GOLD I$ THE MOST SENSITIVE STAINING TFCHNIQLUE, detecting as little as 1-3 ng of 
protein (Moeremans et al. 1985). Protein spots are permanently stained a dark red after incu- 
bation with the colloidal gold solution. Colloidal gold staining can detect proteins on both 
nitroceilulose and PVDF membranes, but it is not recommended for nylon membranes, 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


» Reagents 
Colloidal gold stain solution (Amersham Biosciences, Bio-Rad, and Diversified Biotech) 
Phosphate-buffered saline (PBS) (pH 7.2) 
0.15 M NaCl 
0.01 M Na,HPO,/NaH,PO, <!> 
Tween-20 solution (0.3% v/v) in PBS 


» Protein Sample 
Membrane containing bound proteins transferred in Protocol 15 or 16 


METHOD 


1. Submerge the membrane in Tween-20 solution for 45 minutes at 37°C with gentle agita- 
tion. 


2. Wash the membrane with Tween-20 solution for 5 minutes at room temperature with 
gentle agitation. 


3. Discard the wash solution, and repeat Step 2 two more times. 


4. Stain the membrane in the colloidal gold stain solution for 2 hours at room temperature 
with gentle agitation. 


5. Rinse the membrane with H,O for 5 minutes with gentle agitation. 
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Two-dimensional Electrophoresis Resources 
on the World Wide Web 


A WEALTH OF INFORMATION EXISTS for dealing with 2D electrophoresis and proteomics on the 
World Wide Web. Below is only a partial list of the Web Sites available with techniques, 
information on meetings, classes, 2D gel databases, and information on the newest equip- 
ment and software for working with 2D electrophoresis. 


Commercial Two-dimensional Electrophoresis and Proteomics Sites 


Amersham Biosciences 
http://proteomics.apbiotech.com 

BioRad Proteomics Workstation 
http://www.proteomeworks.bio-rad.com/ 

Genomic Solutions 
http://www.genomicsolutions.com/proteomics 

Large Scale Biology Home Page 
http:/Avww.lsbe.com 


Two-dimensional Analysis Software Sites 


Phoretix 
http://www. phoretix.com/ 
Melanie IH 
http://www.genebio.com/Melanie.html 
PDQuest: 
http://proteomeworks.bio-rad.com/html/tech5.html 
Flicker for 2D gel analysis 
http://www-lecb.ncifcrf.gov/flicker/ 
NCI/FCRDC LMMB Image Processing Section (GELLAB software) 
http://www.lecb.ncifcrf.gov/lemkin/gellab.html] 
Compugen (Z3 software) 
http://www. 2 Dgels.com 


Two-dimensional Electrophoresis and Proteomics Databases 


Expasy Index to 2D PAGE databases and services 
http://www.expasy.ch/ch2d/2d-index.himl 

HSC 2-DE Gel Protein Databases list 
http://www. harefield.nthames.nhs.uk/nhli/protein/other_sites.html 
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Argonne Protein Mapping Group Server: 
http://www.anl.gov/BIO/ 

Phosophoprotein Database: 
http://www-lecb.ncifcrf.gav/phosphoLB/ 

Rat Serum Protein Database 
http://linux.farma.unim it 

Parasite Proteome Maps 
http://www.aber.ac.ul/ -mpgwww/Proteome/ProtMaps.html 

Cambridge Proteomics Facility 
http://www. bio.cam.ac.uk/proteomics/index.htm| 

Rice 2D Database 
htp://semele.anu.edu.au/2d/2d. html 

Plant Plasma Membrane 2D Database 
http://sphinx.rug.ac.be:8080/ppmdb/index.html 

COMPLUYEAST-2DPAGE Database 
http://babbage.csc.ucm.es/2d/2d.html 

Danish Centre for Human Genome Research 2D PAGF Databases (Aarhus) 
http://proteomics.cancer.dk 

SIENA-2DPAGE 
http://www .bio-mol.unisi.it/2d/2d.html 

PMMA-2D Page—at Purkyne Military Medical Academy, Czech 
http://www.pmma.pmfhk.cz/ 

Embryonal Stem Cells (Immunobiology, University of Edinburgh) 
http://www.dur.ac.uk/~dbl0nh 1 /2DPAGE/ 

Human Colon Carcinoma Protein Database-Joint ProteomicS Laboratory (]PSL), Ludwig 

Institute for Cancer Research, Melbourne, Australia 
http//www.ludwig.edu.au/jpsl/jpslhome.html 

HP-2DPAGE (Max Delbruck Center, Berlin) 
http://www.mdc-berlin.de/~emu/heart/ 

MitoDat—Mendelian Inheritance and the Mitochondrion 
http://www-Immb.ncifcrf.gov/mitoDat/ 

SWISS-2DPAGE at Geneva University Hospital 
http://www.expasy.ch/ch2d/ch2d-top.html 

YPD Yeast Protein Database at Proteome, Inc. 
http;//www.proteome.com/YPDhome.html 

Yeast Proteome Map 
http://www.ibgc.u-bordeaux2.fr/YPM/ 


Sources of information and Methods on 2D Electrophoresis 
and Proteomics 


Australian Proteome Analysis Facility 
http://www.proteome.org.au/ 

The Tubingen Proteome Project 
http://www.uni-tuebingen.de/uni/kxm/Proteome/ 

University of Aberdeen Protein Lab and Proteomics Facility 
http://www.abdn.ac.uk/~mmb023/proteome/index.htm 
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The EXPASY Swiss 2D-PAGE site provides methods, reagents, and links to Proteome sites 
worldwide. 

http://www.expasy.ch/ 
The British Flectrophoresis Saciety site also has excellent links to relevant sites and data bases 
worldwide. 

http://www. proteinworks. com/bes/ 


The HSC-2D PAGE 2-DE Gel Protein Databases site at Harefield Hospital in London provides 
links to worldwide databases, as well as information pertaining to upcoming meetings, 2D gel 
analysis software, and more. 

http://www-harefield nthames.nhs.uk/nhli/protein/ 


The laboratory of Professor A. Górg in Munich has been instrumental in developing the tech- 
nology of isoelectric focusing with immobilized pH gradients. The entire Górg laboratory 
manual is available on-line from this site. 

http://www.edv.agrar.tu-muenchen.de/blm/deg/ 


The laboratory of Dr James R. Jefferies, Parasitology Group, Institute of Biological Sciences, 
University of Wales at Aberystwyth, Aberystwyth, Ceredigion, SY23 3DA, Wales, UK. 
http://www.aber.ac.uk/%7Empgwww/Proteome/Tut_2D.html#Section%201 
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Ruversen-pnase CHROMATOGRAPHY, WHICH SEPARATES MOLECULES based on their reversible 
interaction with the hydrophobic surface of a chromatographic medium, is valuable for the 
separation of proteins and peptides. It has rapidly become the most widely used branch of 
high-performance liquid chromatography (HPLC) since the mid-1970s, and it has found 
wide application for the separation, purification, and analysis of small molecules such as 
peptides. As a result, a great deal of experience has accumulated concerning the physico- 
chemical phenomena underpinning the mechanism of small-molecule separations. 
Reversed-phase (RP)-HPLC is the method of choice for the final polishing of peptides and is 
ideal for analytical separations, such as peptide mapping (see Chapter 7). 

Since the mid 1980s, RP-HPLC has been used increasingly for the separation of proteins 
on both analytical and preparative scales. However, RP-HPLC is not recommended for pro- 
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tein purification if recovery of activity and return to a correct tertiary structure are required, 
Decause many proteins are irreversibly denatured in the presence of organic solvents. 
Chromatography of proteins on reversed-phase supports also suffers from problems associ- 
ated with poor recoveries, broad misshapen peaks, and “ghosting” (i.e, reappearance of a 
protein peak on subsequent chromatographic runs). Although many of these problems can 
be overcome by judicious choice of column usage and careful optimization of extra-column 
variables such as sample pretreatment and mobile-phase and hardware considerations 
(Hearn t991b, 1998), the application of this technique to the purification of proteins still 
requires careful evaluation. 

The following issues must be considered before employing RP-HPLC. The biological 
activities of many proteins are sensitive to extremes of pH, exposure to organic solvents, or 
high salt concentrations, which lead to unfolding and denaturation. Physical losses of pro- 
tein, especially at low concentrations, can also occur by adsorption onto glass and hydropho- 
bic matrices. However, for smaller proteins (M, <30,000), some denaturation effects are min- 
imal (or rapidly reversible), and RP-HPLC can therefore be successfully used to isolate them 
in a biologically active form (Rivier and McClintock 1989; Simpson and Nice 1989). For a 
discussion on the biological activity of polypeptides after RP-HPLC and attempts to increase 
the recovered bioactivity, see Welinder (1988). RP-HPLC is an extremely useful tool for the 
micropreparative purification of proteins and peptides prior to microsequencing using the 
Edman degradation procedure (Simpson et al. 1989a; Simpson and Nice 1989) (Chapter 6}, 
mass-spectrometry-based techniques (Chapter 8), and various peptide-mapping procedures 
(Chapter 7). In this chapter, the basic separation principles of RP-HPLC are discussed, along 
with protocols for 


* the packing of RP-HPLC columns, 


separation of large troublesome polypeptides, 


purification of synthetic peptides, 


desalting of protein samples, and 


computer-assisted method development to optimize gradient conditions. 


REVERSED-PHASE CHROMATOGRAPHY SEPARATES MOLECULES BASED ON 
HYDROPHOBIC INTERACTIONS 


RP-HPLC separation of proteins is based predominantly on reversible hydrophobic interac- 
tions between amino acid side chains on the protein with the hydrophobic surface of the 
chromatographic medium, i.c, the stationary phase (Figure 5.1). The nature of the 
hydrophobic binding interaction is generally considered to be the result of favorable entropy 
effects, The initial starting conditions of RP-HPLC are primarily aqueous in which à high 
degree of organized water structure surrounds both the protein molecules and the immobi- 
lized ligand (stationary phase). As protein binds to the stationary phase, the amount of 
hydrophobic area on the surface of the protein(s) exposed to the solvent is minimized. Ihus, 
the degree of organized water is decteased with a concomitant favorable increase in entropy 
of the svstem. For this reason, it is advantageous, under these solvent conditions, for proteins 
to associate with the stationary phase upon loading onto the RP-HPLC column, Mobile- 
phase composition is then subsequently modified so that bound proteins are differentially 
eluted idesorbed} back into the mobile phase. The order of protein desorption is based on 
their relative hydrophobicity (i.e., least hydrophobic proteins elute first, followed by proteins 
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FIGURE 5.1. Principle of RP-HPLC with gradient elution. / Redrawn, with permission. trom publ Na. 1s 
1134-16, Amersham Biosciences 2002, } 


in increasing order of their surface hydrophobicity}. Because the functional groups on tae 
column support are very hydrophobic in nature, protein binding is usually vers strong and 
requires the use of organic solvents and other additives (ion pairing agents) in the mob lc 
phase for elution. Column development (elution) is usually performed by increasing the 
organic solvent concentration (either in a stepwise fashion or gradient manner), typically 
acetonitrile, By these means, proteins, which are concentrated (or trace enriched; see below! 
during the binding and separation process, are collected in à purified and concentrated form. 
For a detailed discussion of chromatographic theory, see the Further Reading section at the 
end of this chapter. 


STANDARD CHROMATOGRAPHIC CONDITIONS FOR RP-HPLC 


Before embarking on RP-HPLC of proteins, it is useful to consider the optimal chiomato- 
graphic support, mobile phase (especially the organic modifier {solvent}, mobile-phase pH, 
and ion-pairing reagent), and column dimensions. Thee parameters are discussed in turn 
below. A procedure outlining standard chromatographic conditions for RP-HPLC 1s given in 
Protocol !. A generic column-packing method (slurry packing procedure) tor columns of 
varying internal diameters is given in Protocol 2. Procedures tor chromatography ot large 
troublesome polypeptides and also synthetic peptides are given in Protocols 3 and 4, respec- 
tively. 


Choice of Chromatographic Support 


Silica-based reversed-phase materials are the most widely used packings for HPLC of peptioes 
and proteins because they are mechanically strong and provide high separation efficiencies. 
Of these, large-pore-size, silica-based matrices /»300 A) or “macroporous” packings with a 
5—10-11m range of particle sizes, containing C,-, C, Coctyl i ~ or C , (octadecyD -n-alkys chains, 
provide a good general utility for analytical RP-HPLC. Shorter-chain bonded phases iega ©, 
and C,) are preferred for more hydrophobic samples, and longer-chain phases Iuga ©.) are 
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preferred for hydrophilic samples. For large peptides and proteins of ~10-100K M,, 300-À 
packings seem to offer the best overall performance. Although even larger pore size packings 
(1000-4000 A) have been investigated for very large proteins, they have not gained wide- 
spread usage due to their poor mechanical stability. 

If working with complex peptide or protein mixtures, and provided all other column 
variables are the same, it is better to use a 300-A pore-size packing material to ensure that 
restricted diffusion or exclusion from the pores is not encountered for very large polypeptides 
or proteins. Because theory predicts that chromatographic performance should increase as 
particle size goes down, smaller particles are preferred. However, several limitations may be 
encountered (namely, cost and high column back pressure) that will influence this decision. 
For analytical separations («1 mg), most columns have 5-10-um particles, and marginal dit- 
ferences in chromatographic efficiency are observed over this range. Although columns with 
2—3-tm particles are commercially available, the small gains in performance are often offset 
by higher column back pressures, greater susceptibility to column blockage (resulting in 
shorter column life), and cost considerations. For preparative applications involving 10-300 
mg of sample, columns containing particle sizes of 1020 um offer a reasonable compromise 
between cost and performance, whereas for large-scale separations (71 g), columns contain- 
ing 20-40-11m particles are often the best choice. For a list of some commercially available 
macroporous columns, suitable for protein separations, see Table 5.1 (see alsa Table 5.2 for a 
list of useful URL addresses for seeking HPLC information such as columns, loose packing 
materials, troubleshooting, and column fittings). For a selection of examples of polypeptides 
purified by RP-HPLC, along with the column and mobile-phase conditions employed, see 
Table 5.3. 

With the availability of moderately priced column packing equipment (c.g., Shandon 
pumps) and loose macroporous packings (e.g. Vydac C4; The Separations Group, Hespario, 
California), it is possible to pack columns that exhibit excellent separation of proteins and 
peptides (Nice et al. 1979). Indeed, simple separations may be obtained even with relatively 
inferior column packing technique. For a description of a generic column-packing method 
(slurry packing procedure) for columns of varying internal diameters, see Protocol 2. 


Choice of Mobile Phase 


In RP-HPLC, the mobile phase is generally composed of three components: (1) an organic 
solvent, (2) an aqueous "buffer" component, and (3) an ion-pairing agent to optimize selec- 
tivity. Utmost care must be taken in choosing only the highest grade of reagents for RP- 
HPLC, because any contaminants in the mobile phase will be concentrated (trace-enriched) 
on the column and will affect the subsequent chromatography by producing unwanted (spu- 
rious) extra peaks or ghost peaks, and thus may contaminate the target protein. All organic 
solvents must be of the highest purity (HPLC-grade), the buffering salts and ion-pairing 
agents must be of the highest chemical purity (free of metal ions). and highly polished 
(deionized) water is a necessary requirement for the preparation of mobile phases. It is 
important that a "blank run" (i.e. no sample injected) be performed before running any sam- 
ples to monitor the baseline for evidence of any contaminants in the mobile phase (see 


Protocol 1). 


Organic solvent 


With conventional RP-HPLC, an organic solvent (modifier) is added to the aqueous mabile 
phase to lower its polarity. thereby causing proteins to elute from the column. The lower the 
polarity of the mobile phase, the greater its eluting strength in reversed-phase chromatogra- 


REVERSED-PHASE HIGH-PERFORMANCE LIQUID CHROMATOGRAPHY m 223 


TABLE 5.1. Representative macroporous reversed-phase packing materials for HPLC 


Particle Pore 
Name Supplier Support Function size {um} diam. (Å) Comments 
RP- 300 Brownlee silica C8 7 300 
BU-300 Brownlee silica C4 7 300 
Phenyi Brownlee silica phenyl 5 300 
Ultrapore-C3 Beckman silica C3 5 300 
Ultrapore-C8 Beckman silica C8 5 300 
Bakerbond WP_Octadecyl JT. Baker silica C18 5 300 
Bakerbond WP. Octyl J.T. Baker silica CH 5 300 
Bakerbond WP_Butyl J.T. Baker silica C4 5 300 
Zorbax StableBond Agilent silica C18 3.5,5,7 300 
300SB-C18 
Zorbax StableBond Agilent silica C8 3,5,5,7 300 
300SB-C8 
Zorbax StableBond Agilent silica C3 3.5,5,7 300 
3005B-C3 
Partisil ODS, ODS2, Whatman silica C18, C8 5,10 350 
ODS3, C8 
Vydac 201TP, 218TP, Separations Group silica cis 3,5,10 300 Available as 
238TP loose packing 
Vydac 218M5 Separations Group — silica C18 5,10 300 
Vydac 208TP Separations Group silica C8 5,10 300 Available as 
loose packing 
Vydac 214TP Separations Group silica CA 3,5,10 300 Available as 
loose packing 
Vydac 214M5 Separations Group — silica C4 5,10 300 
BioBasic C18 Thermo Hypersil- — silica C18 5 300 High-purity silica 
Keystone base deactivated 
BioBasic C8 Thermo Hypersil- — silica C8 5 300 High-purity silica 
Keystone base deactivated 
BioBasic C4 Thermo Hypersil- — silica C4 5 300 High-putity silica 
Keystone base deactivated 
HyperREZ XP RP-300 Thermo Hypersil- — polystyrene/divinyl- — 5 300 Rigid bead stable 
Keystone benzene at pH 0-14 
Hamilton PRP-3 Hamilton polystyrene/divinyl- — 3,5,10 300 
benzene 
Discovery BIO Wide Supelco silica C18 3,5,10 300 
Pore C18 I 
Discovery BIO Wide Supelco silica Ca 5,10 300 
Pore C8 1 
Discovery BIO Wide Supelco silica C5 5,10 300 
Pore C5 I 
PLRP-S Polymer Labs polystyrene/divinyl- 5,8 300 
benzene 
Nucleosil C18, C8, C4 Macherey Nagel silica short chain. 5,7,10 300 
RP 
LiChrospher WP 330 RP-18 Merck silica C18 5,12,15 300 
HAISIL 300 C4, C8, C18 Higgins Analytical — silica C4,C8,C18 5,10 300 
TSK Octadecyl-NPR TSK silica C18 2.5 non- 
porous 
TSK Phenyl-5PW-RP TSK phenyl- 10 hydrophilic 
bonded polymer 


Kindly compiled by Robert L. Moritz (Joint Proteomics Laboratory of the Ludwig Institute for Cancer Research and the Walter and Eliza Hail Institute of 
Medical Research, Melbourne, Australia). 
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TABLE 5.2. Useful URL addresses for HPLC information 


Chromatographic column suppliers 


ZirChrom Separations, Inc. hit p/wwiw. zirchrom.com/ 
Phenomenex http://www. phenamenes.com 
Waters Corporation hitpz/Awiw.waters com 

Agilent Technologies httpi/we home agilent.com 
TOSOH-HAAS httpzAwww.tosohbiosep.com/ 
Amersham Biosciences http:/ www.apbiotech com 
Merck http wyww.merck.de 
Macherey-Nagel httpi/www macherey -nagel om 
Thermo-Hypersil http; Awww thermohypersil com 
LC Packings htipsfiwww.h packings.com 


Purveyors of chromatographic parts and associated equipment 
Western Analytical Products, Inc, (Murrieta, California] — hutp' www hplesupply.com 
| Old site htipi/wsw netwizards net! ~wap/ | 
The Nest group httpi/Awww.nestarp.com/ 
lonsource httpiAwww.ionsourceconvlinks/tis hnks.htm 
elck on Vendors LC 


Chromatographic part manufacturers and suppliers 
Upchurch Scientific http//www.upchureh «om 
Alltech http /www.alliechweb.com 


phy. Because there is strong dependence on the relative retention tor capacity factor, k's see 
panel below) of individual proteins and the concentration of organic modifier required for 
their elution (Figure 5.2), the separation of a mixture of proteins of slightly different relative 
hydrophobicities requires gradient elution. 


The capacity factor (k'), also referred to as the retention factor, is normalized retention 
under isocratic elution conditions. It is a unitless term that is a measure of the retention 
behavior (i.e. the degree of retention) for a particular solute on a perticular column, 
assuming equal flow rates. The capacity factor k can be calculated by the following equa- 
tion: ki = (t, t,)/t, = (YoYo where k’ is the number of cohumn volumes required to elute 
a particular solute, and t, and v, represent the void time and void volume, respectively. 
Thus, X! is directly related to the distribution coefficient (or partition coefficient) of a 
solute between the mobile and stationary phases (i.e., moles of solute in stationary phase 
per moles of solute in mobile phase) and is now well understood in both empirical and 
thermodynamic terms. 


The most common organic solvents used in RP-HPLC, and their order of elutropic 
strength, are 1-propanol > 2-propanol > tetrahydrofuran ~ dioxane ~ ethanol » acetonitrile 
~ methanol (Mahoney and Hermodson 1980) (see also Table 5.4). In practice, the most wide- 
ly used organic modifiers are acetonitrile and methanol, the most popular choice being ace- 
tonitrile. Although isopropanol (2-propanol) is often used because of its strong eluting prop- 
erties, it is limited because ot its high viscosity (2.30 cPoise;, which results in lower column 
cfficiencies and high back pressures. However, the lower polarity of isopropanol makes it an 
excellent solvent for cleaning the reversed-phase column. Both acetonitrile (0.37 cPoise) and 
methanol (0.60 Poise) are less viscous than isopropanol. Acetonitrile, methanol, and iso- 
propanol are essentially UV-transparent, which is an essential requirement lor RP-HPLC 
because column elution is typically monitored by UV absorbance (Table 5.5), UV trans- 
parency is particularly important because most RP-HPLC separations (especially peptides) 
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TABLE 5.3. Selected examples of polypeptides and proteins purified by RP-HPLC 


Hydrophobic 
Column chain Mobile phase 
HBondapak C18 n-octadecyl 0.1% TPA, 12-70% CH,CN 
Nucleosil C18 n-octadecy| 0.1% TFA, ethanol/butanol/methoxy- 
ethanol 
uBondapak C18 — n-octadecyl 5% formic acid, 40-80% ethanol 
Nucleosil C18 n-octadecy! 50 mM KH,PO,, 10-50% 2-methoxy- 
ethanol 
LiChrospher C8  n-octyl 400 mM Pyr-formate, 04096 nPrOH 
pg Bondapak n-propyi- 196 TEAP, 10—5096 CH,CN 
alkyiphenyl phenyl 
Ultrasphere C8 n-octyl 10 mx TFA, 0-45% nPrOH 
Ultrasphere C3 n-propyl 155 mw Natl (pH 2.1), 0-75% 
CH,CN 
uBondapak C18 — s-octadecyl — 0.196 TFA, 0—6096 CH,CN 
LiChrospher C4 n-butyl 10 mM H,POQ, 0-45% nPrOH 
Bakerbond diphenyi 0.196 TFA, 0-50% CH,CN 
diphenyl 
U-ODS #-octadecy! 0.2% HFBA, 0—50% CH,CN 
Nucleosil C18 n-octadecy] 50 mM KH,PO,, 10-50% 2- methoxy- 


Pharmacia ProRPC -octyl 


LiChrospher C4 n-butyl 
HBondapak C18 —— n-octadecyl 
LiChrospher C8 — n-octyl 
Ultrasphere C3 n-propyl 
LiChroprep RP8 — n-octyl 
W-DP n-octadecyl 
N-ODS n-octadecyl 
Unspecified C8 n-octyl 
LiChrospher C4 n-butyl 
LiChrospher C18 — r-octadecyl 
Ultrasphere C3 n-propyl 
LiChrospher C8  n-octyl 
Various various 
Ultrapore RPSC — r-octadecyl 
Synchropak RPP — n-octadecyl 
ToyaSoda trimethyl 
Bakerbond C18 —  r-octadecyl 


Reprinted from Hearn (1998). 


ethanol 

0.396 TFA, 39—50% CH,CN 

100 mM NH,HCO,, 0-50% CH.CN 

0.2% TFA, 0-50% CH,CN 

0.8 M Pyr, 1 M formic acid, 0-60% 
CH,CN 

0.1% TFA, 0-50% CH,CN 

0.9 M Pyr-acetate, 20-60% nPrOH 

50 mM KH;PO, 2-methoxyethanoi- 
PrOH 

33 mM NaOAc, 10-5096 ethanol 
(pH 5.2 

0.1% TFA, 0-60% IPrOH 


10 mM H,PO,, 045% nPrOH 

50 mM Tris-HCl, 0.1 mM CaCl, 
0-70% 

155 mM NaCI-HCI, 0-50% nPrOH 

0.1% TFA, 0-50% iPrOH 


0.1% TFA, CH,CN-PrOH 

0.196 TFA, 10-80% CH CN 
0.1% TEA, 15-7596 CH,CN 
0.2% TFA, 25-75% CH,CN 


10 mM NH,HCO,, 0-50% CH,CN 


Protein purified 


Reference 


acid phosphatase 
aldolase 


bacteriorhodopsin 
bacteriorhodopsin 


bovine serum albumin 
C-apolipoproteins 


carbonic anhydrase 
carbonic anhydrase 


chorionic gonadotropin 
chymotrypsinogen 
collagen chains 


epidermal growth factor 
ferritin 


globin chains 

growth hormone 

histone proteins 

human fibroblast inter- 
feron 

inhibin, follicular 

interleukin-2 

leukocyte interferon 


tr macroglobulin 
ovalbumin 


papain 
parvalbumin 


phosphorylase 
platelet-derived growth 
factor 

rhodopsin 

ribosomal 505 proteins 
ribosomal proteins 
Sendat viral proteins 


thyrotropin and subunits 


Witting et al. (1984) 

van der Zee and Welling 
(1982) 

Gerber et al. (1979) 

Monch and Dehnen (1978) 


Lewis et al. (1980) 
Hancock et al. (1981) 


Cooke et aj, (1983) 
Cooke et ai. (1983) 


Putterman et al. (1982) 
Cohen et al. (1984) 
Skinner et al. (1984) 


Burgess et al. (1982) 
Monch and Dehnen (1978) 


Jeppsson et al. (1984) 
Grego et al. (1984) 
Gurley et al. (1984) 
Stein et al. (1980) 


Grego and Hearn (1984) 
Wolfe et al, (1984) 
Herring and Enns (1983) 


Sottrup-Jesen et al. (1984) 


Kopaciewicz and Regnier 
(1983) 

Cohen et al (1984) 

Berchtold et al. (1982) 


(Hare et al. (1982) 
Chesterman et al. (1983} 


Berchtold et al. (1982) 

Nick et al. (1985) 

Nick et al. (1985) 

van det Zee and Welling 
(1982) 

Stanton and Hearn (1987) 
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Retention volume 


0 10 20 30 | 40 
% n-propanol 

FIGURE 5.2. Effect of varying organic solvent concentrations on protein retention by a reversed-phase sup 
port (Ultrapore RPCS C3) (75 x 4.6-mm inner diameter). (Open box) Ribonuclease; (opem circle) 
cytochrome s (open triangle) bovine serum albumin. Protein standards (100 pg) of varying hydrophobici- 
ties (ribonuclease, cytochrome c, and bovine serum albumin) were applied to the column. Data were 
obtained under isocratic conditions (0.1% aqueous TFA/n-propanol). A decreasing retention volume {k} 
for each protein standard is observed as the concentration of organic modifier is increased. At n-propanol 
concentration: A (12%), all proteins are retained; B (18%), ribonuclease elutes, others retained; C (22%), 
cytochrome c elutes, bovine serum albumin is still retained; D (28%), all proteins elute. Flow rate is 1 
ml/min, and temperature is 45°C. (Reproduced, with permission, from Simp-son and Nice 1989.) 


are monitored below 220 nm for optimal detection sensitivity of peptides (e.g. proteins and 
peptides lacking the aromatic amino acids tryptophan, tyrosine, and phenylalanine can only 
be detected by UV transparency using wavelengths below 225 nm}. Because acetonitrile has 
a lower UV cutoff (190 nm) than other common solvents, it is used almost exclusively when 
separating peptides. 

Minor selective effects can sometimes be obtained with different organic solvents, and in 
some cases, a combination of different solvents has been used to great effect (Mahoney and 
Hermodson 1980). Additional optimization of protein and peptide separations can be 
accomplished by careful manipulation of the flow rate and gradient steepness (Stadalius et al. 
1984; Glajch et al. 1986). 


Mobile-phase pH 


Conventional silica-based reversed-phase packings suffer from two disadvantages: First, they 
have a limited useable pH range, typically pH 2-8. Below pH 2, the bonded phase is suscep- 
tible to hydrolysis. Above pH 8, hydroxide ions can attack and dissolve the silica, which caus- 
es the collapse of the packed bed (resulting in decreased column lifetime) and a catastrophic 
loss of efficiency. Second, basic analytes interact strongly with residual silanols and cause tail- 
ing peaks that are detrimental to resolution as well as to the accuracy and precision of quan- 
titation. 

Reversed-phase separations are thus most often performed at low pH values, generally 
between pH 2 and 4. Further advantages of using a low pH include good sample solubility, 
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TABLE 5.4. Effect of organic modifiers on salute retention by short alkyl chain (C3) packings 


NPA IPA THF CH,CN Diox MeEtOH EG 
Trp 3.8 34 4.0 43 3.9 3.6 3.6 
ACTH, z 134 14.5 15.9 20.5 18.1 25.2 52 
Ribonuctease 17.0 21.3 22.3 30.0 34.0 n.d. n.d. 
Lysozyme 23.4 29.4 32.8 40.0 50.0 68.0 114.0 
Prol (rat prolactin) 32.5 44.5 43,0 59.6 69,5 22.0 nd. 


Reprinted from Simpson and Nice (1989). 

Data were obtained with a 75 x 4.6-mm ID. column packed with Ultrapore RPSC using a linear 1%/min gradient between 
0.15 M NaCl (pH 2.1) and various organic modifiers. (NPA) }-Propanol (n-propanol); (TPA) 2-propanal (isopropanol); (THF) 
tetrahydrofuran; (CH4CN) acetonitrile; (Diox) dioxane; (MeEtOH) methoxyethanol; (RG) ethyleneglycol. Values are retention 
times (minutes). n.d. indicates not determined. 


ion suppression of both acidic side chains on the sample, and residual silanol groups on the 
silica support. Commonly used acids in RP-HPLC include trifluoroacetic acid (TFA), hepta- 
fluorobutryric acid, and ortho-phosphoric acid in the concentration range of 0.0-50.1% 
(w/v) or 50-100 mm. Unbuffered 1% (w/v) NaCl is a useful mobile phase for high-sensitivi- 
ty RP-HPLC (Nice and O'Hare 1978) because NaCl is transparent to UV below 220 nm, it is 
biologically compatible, and it often provides useful alternative selectivities when compared 
to TFA, For high-sensitivity work, NaCl is considerably cleaner than phosphate in terms of 
background contaminant peaks. Mobile phases containing ammonium acetate (pH 6-7), 
ammonium bicarbonate (pH 7), or phosphate salts (e.g., triethylammonium phosphate, pH 
6) are suitable for use at pH values closer to neutrality. Note that phosphate buffers and NaCl 
are not volatile and care must be taken to ensure that the concentration of organic modifier 
does not exceed 60%, otherwise these salts will precipitate and block the column/flow cell. 
Although volatile buffer components can be removed from the eluted sample by centrifugal 
lyophilization along with the organic component, this can be a risky step when dealing with 
low-microgram amounts of protein/peptide because severe losses are commonly encoun- 
tered due to irreversible binding of the sample to the sides of glass/polypropylene tubes. Such 
losses can be avoided by the addition of a nonionic detergent (e.g., Tween-20, Pierce 
Surfactant- Amps 20 grade) to a final concentration 0.01-0.02% (w/v) prior to the evapora- 
tion step (Simpson et al. 1989a). If nonvolatile salts are used in the mobile phase, they must 
be separated from the recovered sample by an additional desalting step such as RP-HPLC 
using a volatile salt {e.g., TFA) (see below and Protocol 5) or micro-size exclusion chro- 
matography. 


TABLE 5.5. Solvents used in RP-HPLC 


Boiling UV Viscosity 

Polarity point cutoff {cPoise at 
Solvent index (*C) (nm) 209C) Comments 
Aceronitrile 5.8 82 190 0.36 more powerful denaturant 

than alcohols; toxic 

Dioxane 4.8 101 215 1.54 
Ethanol 5.2 78 210 1.20 
Methanol 5.1 65 205 0.60 
1-propanol (n-propanol) 4.0 98 210 2.26 viscous 
2-propanol (isopropanol) 3,9 82 210 2.30 viscous 
Tetrahydrofuran 4.0 65 215 0.55 


Water 9.0 100 «190 1.00 
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Wuv TFA? 
TFA is the most commonly used ion-pairing agent in RP-HPLC because 


» it is volatile and easily removed by lyophilization, 


« it has low absorption within detection wavelengths for peptides and proteins, and 
s it has a proven history. 


TFA has an absorbancy in water different from that in acetonitrile, and thus the TFA con- 
centration must be made in Buffer B at 85-90% the concentration in Buffer A to avoid a 
baseline shift during gradient formation. TFA buffer shonld be fresh, the solvent reservoir 
should be cleaned occasionally, and the inlet filter replaced periodically. 

Because of the problems and limitations associated with conventional silica-based 
columns (e.g., degradation at high pH values), polymer-based columns have been gaining 
popularity. Although organic polymer packings are generally stable from pH ! to pH 14, 
they yield significantly lower efficiencies than silica-based packings and are not as mechan- 
ically strong. Many organic polymer packings also shrink or swell when exposed to differ- 
ent solvents. More recently, a number of silica-based bonded phases (organic-inorganic 
hybrids) with a long column life have been developed (e.g, Zorbax Extend-C18, 
Phenomenex Synergi 4 MAX-RP; for further details, see Tables 5.2 and 5.3) that are sta- 
ble at high pH (up to pH 11.5, thereby allowing chromatography with ammonium hydrox- 
ide and pyrrolidone buffers). An alternative, non-silica-based stationary phase, which can 
be used in the pH range of 314, especially at high pH values, is the Zirconium-based pack- 
ings manufactured by ZirChrom (see Table 5.2). 


lon-poiring agents 


The third component usually added to the mobile phase is an ion-pairing agent (Horvath et 
al. 1977b; Hancock et al. 1978a; Mant and Hodges 1991a). The addition of millimolar con- 
centrations of ion-pairing agents has the effect of decreasing the retention time of a protein 
sample and improving peak shapes. This effect can be attributed to the ion-pairing agent 
binding by ionic interaction to the protein/peptide molecule and changing selectivity. It is 
thought that ion-pairing agents suppress the interaction between polar groups on the sam- 
ple and silanol groups on the reversed-phase matrix (Hancock and Sparrow 1983; Bennet 
1991). At any given pH, proteins and peptides will exhibit ionized amino acid side chains that 
are available for ion pairing. For example, over the pH range of 2-8 (the pH range over which 
most silica- based columns are used), the basic amino acids (arginine and lysine) are fully ion- 
ized, and these positively charged side chains (along with the primary ammo terminus) asso- 
ciate (or “ion-pair”) with anions. Thus, hydrophobic anions such as TFA and alkylsulfonates 
form hydrophobic ion pairs with the basic amino acid chains and, in so doing, tend to 
increase the affinity of the protein or peptide for the column (thereby resulting in increased 
retention times) (for a detailed discussion of ion pairing, see Bennett 1991). The retention 
behavior of the sample components may be affected by both the type and concentration of 
the ion-pairing agent used (see Figure 5.3). 

Useful hydrophobic ion-pairing agents include pentafluoropropionic acid (PFPA) and 
heptafluorobutyric acid (HEBA) (for a list of commonly used ion-pairing agents, see Table 
5.6). HEBA has been used extensively as an ion-pairing agent under circumstances where the 
resolving power of TFA to separate à peptide mixture has been inadequate (Bennett 1983). 
Apart from its effectiveness as an ion-pairing agent, it shares with TFA the advantages of 
volatility and low UV transparency (see Table 5.5). 
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FIGURE 5.3. Comparison of the effectiveness of TFA, pentafluoropropionic acid (PFPA), and heptafluo- 
robutyric acid (HFBA) as hydrophobic ion-pairing reagents in reversed-phase chromatography (RPC) of 
seven natural and synthetic peptides. Peptides were eluted from the Waters Associates pBondapak C18 col- 
umn (300 x 3.9-mm LD.) with a linear AB gradient of 20-40% acetonitrile containing 0.01 M concentra- 
tions of each acid during 1 hour (0.3396 acetonitrile/min) at a flow rate of 1.5 ml/min. (Top) Blution behav- 
ior of the peptides with 0.01 M (0.1390 v/v) HFBA, (Middle) Behavior with 0,01 M (0.1096 v/v) PFPA. 
(Bottom) Behavior with 0.01 M (0.07% v/v) TEA. Samples (2 ug) of the following peptides were injected 
onto the column (number of basic residues at pH 2, including histidine, is shown in parentheses): 1, methio- 
nine enkephalin (1); 2, ACTH, ;, (9); 3, aà-MSH (3); 4, human ACTH, y; 5, somatostatin (3); 6, bovine 
insulin (6); 7, human calcitonin (3). (Reproduced, with permission, from Bennett 1991.) 


TABLE 5.6, Ion-pairing agents 


Ion-pairing agent Formula of pairing ion Comments 

Anionic 

Trifluoroacetic acid (TFA) CF,COO- low UE-absorbance; volatile, low pH, 

Pentafluoroproprionic acid (PFPA) CF,CF,COO more hydrophobic than TFA; volatile, 
low pH 

Heptafluorobutyric acid (HFBA) CR CF, CF; COO more hydrophobic than TFA; volatile, 
low pH 

Ammoniurn acetate CH,COO 

Phosphoric acid H,PO,,HPO*,PO,- less hydrophobic than TFA 

Cationic 

Tetramethylammonium chioride +N(CH,), 

Tetrabutylammonium chloride 4+N(C,H5), 

Triethylamine NH'(G;H), 


Reprinted from Amersham Biosciences (2002). 
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It is apparent from Figure 5.3, which shows a comparison of a standard mixture of pep- 
tides in solvent systems containing 0.01 M TFA, PFPA, or HEBA, that retention times (also rel- 
ative retention times or order of peptide elution) for the various peptides are different in each 
of the solvent systems. In progressing from HFBA to TFA, some peptides increase in reten- 
tion more than others (e.g, ACTH1-24 and ACTH1-39), this correlates well with the high 
basic amino acid content of these peptides relative to the others. 

Although orthophosphoric acid (H,PO,) is nonvolatile, it has proved to be a useful ion- 
pairing agent for hydrophobic proteins and peptides (Hancock et al. 1978a,b). Because pro- 
teins elute from reversed-phase columns at significantly decreased concentrations of organic 
solvent when H,PO, is used as an ion-pairing agent in the mobile phase, the possibility of 
protein denaturation or precipitation is greatly diminished. Amine phosphates such as tri- 
ethylammonium phosphate (TEAP} have also enjoyed great success as ionic modifiers. This 
can be attributed to a variety of effects, namely, the relatively high salt concentrations used 
(typicaily 0.1—0.2 M) combined with the use of both polar anions and cations (which can 
associate with ionic groups present on the sample); these act to prevent unfavorable interac- 
tions between the sample and the silanol groups on the column matrix (Hancock and 
Sparrow 1983). However, with the marked improvements in bonded-phase chemistry and 
more cffective end-capping procedures for column packings, the chromatographic problems 
associated with free silanol groups on column rnatrices (and the usefulness of TEAP as an 
ion-pairing agent) have decreased significantly. 

As with other mobile-phase constituents, ion-pairing agents must be sufficiently pure, 
have low UV transparency («220 nm), and be soluble under the low-polarity conditions 
encountered. 


MICROCOLUMN REVERSED-PHASE CHROMATOGRAPHY IS FAVORED 
FOR STUDIES OF PROTEOMES 


Microcolumn reversed-phase chromatography is one of the most commonly used techniques 
for preparing samples for classical Edman degradation protein microsequencing and the 
more recent proteomic methods for protein identification and characterization. Depending 
on the internal diameter (LD.) of the columns, microcolumn chromatography can be arbi- 
trarily subdivided into microbore («2.1 mm LD.), narrowbore («1.0 mm 1.D.), or capillary 
(«0.5 mm LD.) column chromatography. 

Microbore column chromatography, first proposed in the late 1970s (Ishii et al. 1977; 
Scott and Kucera 1979a,b), was further developed in the 1980s for preparing samples for gas- 
phase sequence analysis (Nice et al. 1984, 1985; Simpson and Nice 1987; Simpson et al. 
19892). The success of microbore column chromatography is largely attributable to the fol- 
lowing: 

e Concentrated samples, ie., recovery of samples in low peak volumes of ~20-60 pl com- 
pared to 500-1500 pl typically found for their large-diameter (4.6 mm LI.D.) conventional 
column counterparts. 

« Enhanced sensitivity of protein detection (5-10 ng) (Nice et al. 1984). This represents a 5- 
fold increase in sensitivity (20-fold for 1-mm I.D. columns) when compared with 4.6-mm 


LD, columns. 


» Rapid sample loading capability. The low back pressures encountered with short columns 
(«10cm in length) permits high flow rates for column reequilibration and sample loading. 
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FIGURE 5.4. Principles of microbore column chromatography: The relationship between column diame- 
ter and eluent peak concentration. It has been demonstrated that the eluent peak volume from a chro- 
matographic column is inversely proportional to the square of the column diameter and directly propor- 
tional to the column length (Scott and Kucera 1979b). In the example shown, if the column diameter is 
reduced from 4.6 mm to 1.5 mm, the flow rate must be reduced by (4.6/1.5), Le., 9.4-fold (1.0 ml/min to 
-0.1 ml/min) te achieve the same linear [low velocity. For concentration-dependent detectors, this will 
result in an equivalent increase in detector signal, t.e., a 10-fold increase for the example given, (Reproduced, 
with permission, from Simpson and Nice 1987.) 


Principles of Microbore Column Chromatography 


To operate microbore columns at linear flow velocities equivalent to those used with wider- 
bore columns, the flow rate must be decreased in a manner proportional to the square of the 
reduction in column diameter (Figure 5.4). For example, if the optimal flow rate for a 4.6- 
mm L.D. column is | ml/min, then a 1.5-mm I.D. column would be operated at ~ 100 l/min 
to achieve the same linear flow velocity. If the column efficiencies are maintained (i.e., peak 
bandwidths remain constant (or a given load of protein), which should be true if a column 
is not overloaded, then proteins will elute from microbore columns in proportionally 
decreased peak volumes compared with conventional columns {i.e smaller peak volumes at 
increased concentrations) (Scott and Kucera 1979a,b; Kucera 1980). The practical relation- 
ship between the internal diameter of an HPLC column, protein mass, and the sensitivity of 
detection is shown in Figure 5.5, which compares the chromatographic performance of a- 
lactalbumin on a series of columns of the same length (10 cm) and packed with the same 
support (Brownlee RP-300}, but with varying internal diameters (1.0, 2.1, and 4.6 mm LD.). 
When compared with a conventional 4.6-mm LD. column, it is evident that a 20-fold 
increase in sensitivity of protein detection is achieved by using the i-mm i,D. column, and 
a 4-fold increase achieved when using a 2.1-mm I.D. column. Ali columns were operated at 
equivalent linear flow velocities as demonstrated by the constant peak bandwidth as a func- 
tion of time. Peak recovery volumes for the 1-, 2.1-, and 4.6-mm 1D. columns were 25, 100, 
and 450 ul, respectively. 
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FIGURE 5.5. Effect of column internal diameter and flow rate on detector sensitivity. Chromatographic 
conditions: column, Brownlee RP-300 (30 nm) 10-ym dimethyloctyl silica; linear 60-minute gradient from 
0.15% (v/v) aqueous TEA to 60% acetonitrile/40% H,O containing 0.12% (v/v) TEA. The column temper- 
ature is 45°C. The sample is bovine a-lactalbumin. (2) Column, 100 x 1-mm LD.; flow, 50 pl/min; load, | 
ug; peak volume, 25 pl. (b) Column, 100 x 2.1-mm LD.; flow, 200 ul/min; load, 4 ug; peak volume, 100 pl. 
(c) Column, 100 x 4.6-mm 1.D.; flow, 1 ml/min; load 20 ug; peak volume, 450 ul. (Reprinted, with permis- 
sion, from Simpson and Nice 1987.] 


Sample Capacity and Resolution of Microbore Columns 


In general, the total amount of protein that can be loaded onto a column (loadability) is a 
function of the chromatographic support, the dimensions of the column, and the inherent 
nature of the protein itself. For example, the loadability of microbore column (e.g., 30 X 2.1- 
mm LD. column packed with a 30-nm reversed-phase [C,] support) can be as high as 4 mg 
using both lysozyme and cytochrome c as standard proteins (Nice et al. 1984). However, the 
recovery volumes under these overload conditions are typically in excess of 1 ml because the 
optimal loading has been greatly exceeded. The following is the optimal protein loading for 
columns of varying internal diameters, determined by peak volumes (see Table 5.7): 


* 2.5-5 ug (1-mm LD. column) 
+ 10-20 pg (2.1-mm LD. column) 
» 30-100 pg (4.6-mm LD. column) 


For 2.1-mm I.D. columns of varying lengths (3 cm and 10 cm), similar peak volumes are 
obtained under ideal aperating conditions regardless of column length. However, band 
broadening (column overload) occurs more rapidly for the shorter column. To achieve opti- 
mal performance, 1.0-mm and 2.1-mm I.D. columns should be operated at flow rates of 50 
and 100 ul/min, respectively. Under these conditions, sample recoveries in excess of 9596 are 
routinely achieved. Figure 5.6 illustrates the ability of microbore columns to resolve a low- 
nanogram mixture of protein standards, the resolution being identical to that obtained on 
conventional columns (4.6 mm I.D.) packed with the same support. Although steeper gradi- 
ent rates (e.g., 2096 per minute) cause some loss of resolution, samples can be recovered from 


the column in volumes less than 20 pl. 
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TABLE 5.7. Effect of protein load on peak bandwidth for columns of varying internal diameters 
Peak volume (yl) for column dimensions 


Protein load (ug) 1.0 x 10 21x3 2.1.x 10 4,6 x 10 
0.5 25 — — — 
1.0 25 100 100 - 
2.5 — ~ - -= 
5.0 30 100 100 450 

10.6 35 120 120 450 
20 50 180 140 - 
30 - - - -= 
50 70 390 190 -= 
100 - 470 240 600 
500 - - - 1200 


Reprinted from Simpson and Nice (19893). 

Support: Brownlee RP-300 (column length 10 cm}. Protein standard: bovine c-lactalbumin. Gradient elution (0.1596 TFA to 
60% CH4CN/0.1296 TFA over 60 minutes) was used at equivalent linear flow velocities: 50 l/min, 200 pl/min, and 1.0 ml/min 
for the 1.0-mm, 2.1-mm, and 4.6-mm 1.1. columns, respectively. 


Trace Enrichment (Concentration) and Buffer Exchange of Samples Using RP-HPLC 


During the phase of their development, microbore columns were restricted to sample volume 
injections of ~} pl to avoid extra column band broadening. However, this historica! practice 
has not limited the practical use of microbore columns for protein purification involving 
large sample volumes. This is due to proteins exhibiting very large capacity factors, K!, that 
are below the critical secondary solvent composition on reversed-phase supports (see Figure 
5.2). Hence, it is possible to trace-enrich large volumes containing nanogram amounts of 
protein directly onto the microbore column (Nice et al. 1979), Proteins can then be recovered 
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FIGURE 5.6. Separation of protein standards on a Brownlee RP-300 column (50 x 1.0-mm LD.) 
Chromatographic conditions: linear 60-minute gradient from 0.1596 aqueous TFA to 6096 acetonitrile/40% 
H,O containing 0.12% (v/v) TFA. The column temperature is 45°C, and the flow rate is 50 pl/min. Protein 
standards: (A) Murine EGF; (B) glucagons; (C) cytochrome c; (D) lysozyme; (E) bovine a-lactalbumin. The 
sample load is 25 ng per protein. (Reproduced, with permission, from Simpson and Nice 1987.) 
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SAMPLE: Cm-P40 in 50 ul 6 m Gn.HCI, DTT, IAA 

COLUMN: Brownlee RP-300, 30 x 2.1 mm LD. 

FLOW RATE: Load 2.0 ml/min, Grad 100 ulmin 
FIGURE 5.7. Desalting of S-carboxymethyl P40 (IL-9} using microbore column RP-HPLC. (Reproduced, 
with permission, from Simpsen et al. 1989b.1 


from the column by gradient elution (with increasing concentrations of organic solvent) or 
stepwise clution in small peak volumes (typically, 25-80 yl). Because column back pressure is 
directly proportional to column length, a practical advantage of short columns («10 cm 
length) is that rapid sample loading can be accomplished by using high flow rates (~2 ml/min) 
without exceeding instrument pressure limits. For example, in Figure 5.6, the protein stan- 
dards ( -1 ml total) were loaded via a 2-ml injection loop at a flow rate of 2 ml/min prior to 
switching the loop "off-line" and initiating the gradient at a flow rate of 50 l/min. Because 
the resolving capability of short columns («100 cm length) is comparable to that of long 
(»250 cm) columns, and short columns permit the rapid loading of large sample volumes 
without exceeding instrument pressure limitations, the use of short-microbore HPLC "guard" 
columns (30 x 2.1-mm LD.) has gained popularity for the preparation of samples for protein 
microsequencing (Nice et al. 1985), A practical application of trace enrichment (and con- 
comitant buffer exchange) of a low-abundance cytokine (20 ug purified [rom 1 x 10" cells) is 
given for the ~35-40K M, glycoprotein P40 (see Figure 5.7) ( Uyttenhove et al. 1988; Simpson 
et al. 1989b). A procedure for desalting samples using RP-HPLC is given in Protocol 5. 

In the example shown in Figure 5.7, 20 ng of S-carboxymethy! (Cm) interleukin-9 (1t.- 
9j in ~300 pl of 7.5 M GdnHCl , 0.2 m Tris-HCI buffer (pH 5.5), 2 mM EDTA, 0.02% Tween- 
20, 15 mM dithiothreitol (DT T), and 50 mM indoacetate were concentrated onto a 30 x 2.1- 
mm 1.D. reversed-phase guard column and simultaneously exchanged into a volatile solvent 
(0.125% v/v TEA} suitable for enzymatic digestion and subsequent sequence analysis of the 
generated peptides. It is good practice to perform a pilot desalting step (e.g., -500 ng of mate- 
rial; sce Figure 5.7A) before committing the bulk of the sample (—-20 ug) to desalting {Figure 
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FIGURE 5.8. Repetitive recovery of GM Chk Irom à short microbore reversed-phase column. Murine GM- 
CSP (300 ng; was chromatographed on a Brownlee RP-300 column (30 x 2.1-mm LI}. at a flow rate ot 100 
ul/min using the chromatographic conditions described in Figure 5.7. The peak traction was recovered 
manually, diluted L1 with primary solvent Gin the injection avringei, and reinjected. Sample recovers was 
calculated from peak height measurement, (Reproduced, with permission, from Simpson and Nive 1984 | 


5,7B), because some nalive proteins upon S-carboxymethylation are difficult to recover by 
RP-HPIC operated under the same chromatographic conditions as the native material, for 
example, granulocyte-colony-stimulating factor (G-CSF). 


Sample Recovery 


The recovery of proteins from reversed-phase supports is very much dependent on the nature 
of the support itself, rather than the dimensions of the column. The excellent recoveries 
(29095) observed with microbare columns are comparable to those obtained with conven- 
tional 4.6-mm LD. columns packed with the same material, A typical total system recovery 
(8096) from repetitive reinjection of a low-M, growth factor (300 ng of murine granulocvie 

macrophage -colony-stimulating factor | GM-CSF], «14K M,) from a microbore column is 
shown in Figure 5.8. Fhis recovery value was based on manual collections, dilution of sec 

ondary solvent, and subsequent reinjection (four times). 


WWW Resources 
Tutorials: RP-HPLC for Proteomics 


« Vydac, The handbook of analysis and purification of peptides and proteins by reversed- 
phase HPLC (http://www.vydac.com/vydacpubs/brindex.html) 


e. Analytical Spectroscopy Research Group, University of Kentucky, High-performance liq- 
uid chromatography (HPLC): A users guide (http://kerouac.pharm.uky.edu/ASRG/ 
hpic/hplemytry.html) 


» lonsource, Reversed phase HPLC basics for LC/MS (Andrew Guzetta) (http://www.ion- 
source.com/tutorial/chromatography/rphplc.htrm) 


PROTOCOL 1 


a a a a 


Standard Chromatographic Conditions for 
RP-HPLC of Proteins* 


Biror ANALYZING A MIXTURE OF PROTEINS by RP-HPLC, it is extremely important to evalu- 
ate the system, both HPLC hardware and column, before it is applied to a biological problem. 
This is especially true when a new column enters the laboratory and applies to both begin- 
ners and more experienced chromatographers, who wish to obtain reproducible peptide 
maps. In this protocol, only nonspecialized columns, mobile phases, and instrumentation 
readily available and easily operated are described. For HPLC troubleshooting, see the 
Further Reading suggestions at the end of this chapter and www.proteinsandproteomics.org. 

Before using a new column, the manufacturer’s instructions and recommendations for 
proper conditioning, use, and storage of the column must be read. Often, subtle operating 
conditions that are crucial for correct column usage are overlooked, resulting not only in irre- 
versible column damage, but also in the possible loss of an important sample. Once the oper- 
ating procedures and column limitations (e.g., operating pressure and stability with respect to 
pH) are understood, a test separation should be performed using a set of standards (e.g., pep- 
tide or protein mixture) that have been designed to evaluate column performance, For this 
exercise, it is essential to reproduce the standard set of operating conditions described in this 
protocol before proceeding with an analysis of the test sample (see Figure 5.9). If an equiva- 
lent elution profile cannot be obtained on the new column (see below) using these standards 
and standard operating conditions, then the new column should be considered suspect and 
discussed with the manufacturer. (Column separations that have been carried out with a rec- 
ognized set of standards are more easily returned to a manufacturer than simply telling the 
manufacturer that their column “cannot separate" an ill-defined complex mixture of interest.) 
It is important that these standard operating conditions be used on a regular basis (and care- 
fully logged) to monitor column performance and the lifetime of the column. Because sta- 
tionary phases differ widely from one manufacturer to another (see Table 5.1) ín terms of their 
physical characteristics, the functional ligand used, ligand density, base support (matrix), par- 
ticle size, and pore size, as well as column dimensions, standard protein/peptide separations 
are a useful diagnostic of these differences (and can be exploited in a purification strategy). If 
samples are to be collected for further characterization, careful attention should be given to 
the "dead volume" of the plumbing between the flow cell and outlet (see below). 


Tips FOR PREPARING AN HPLC 

« Read the manufacturer's instructions for correct column usage. 

e Ensure that the HPLC is in good working condition by running a blank (i.e., no column installed 
or sample injected). 

e Evaluate the chromatographic system by running a set of "known standards." 


*This protocol was contributed by Robert L. Moritz, Joint ProteomicS Laboratory (TPSL), Ludwig Institute of 
Cancer Research/Walter and Eliza Hall Institute for Medical Research, Melbourne, Australia. 


MATERIALS 
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FIGURE 5.9. Standard chromatographic conditions for RP-HPLC separation of protein standards. 
Chromatographic conditions: Agilent 1100 Binary HPLC with autosampler; Column: Agilent Cin 75 x 2,1- 
mm LD; Buffer A: 0.1% TFA; Buffer B; 0.0996 TFA/60% acetonitrile; Temperature: 40°C; Flow rate: 0.1 
ml/min; Gradient: 0-10096 B in 60 minutes; Sample; 2 pg each of ribonuclease B, cytochrome c, lysozyme, 
BSA, carbonic anhydrase, myoglobin, and ovalbumin, The column volume is ~90 pl. Column equilibration 
and reequilibration, 20 column volumes. 


»b Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acetonitrile (HPLC grade) (J.T. Baker, Phillipsburg, New Jersey; Mallinckrodt; or equiva- 
lent) <!> 

H;O (deionized, HPLC grade) (18 megaQ, Milli-Q, Millipore, or equivalent) 

Solvent A (14 mM) (aqueous 0.1% v/v TFA) <!> 
Add 1.0 ml of neat TFA (using a Gilson pipette, or equivalent) to 1 liter of H,O in a glass-stoppered 


measuring cylinder, mix thoroughly, and then pour into a clean liquid chromatograph/HPLC reser- 
voir bottle (extensively washed with Milli-Q H,0). 


EXPERIMENTAL TIP: It is good practice to set aside a 1-liter cylinder to be used only for mixing 
TFA and H,O. 


Solvent B (60% v/v acetonitrile/40% H,O containing 0.1% v/v TFA) 
Add 600 ml of acetonitrile to a 1-ml glass-stoppered cylinder and adjust to 1 liter with H,O. Add 1 
ml of neat TFA (final concentration 0.1% v/v TFA) and mix the solution thoroughly. 


Trifluoroacetic acid (TFA) (HPLC grade) (Pierce, Rockford, Illinois, or equivalent) 
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SOLVENT PREPARATION 


+ H both Solvent A and Solvent B are 0.1% (v/v) TFA, then the absorbance at 214 nm of 


Solvent A will be slightly higher— 20-40 milli-absorbance units (mAU)-—than that of 
Solvent B, due to the contributing absorbance of acetonitrile. This will result in a rising 
baseline during the development of a gradient from 0% to 60% acetonitrile, Although 
this does not present a problem when working with 50-100-pg amounts of material 
(using a 4.6-mm I.D. column), the rising baseline can become a serious problem (i.e., the 
baseline will be off-scale, and late-eluting peaks cannot be detected without adjusting the 
baseline, which is extremely difficult to accomplish during the course of a chromato- 
graphic run), especially when working at low-microgram levels using microbore 
columns («2.1 mm LD.). To overcome this potential problem, it is recommended that 
the amount of TFÀ in Solvent B be slightly less than that in Solvent A (e.g., 0.0996 or 900 
ul of TFA/liter) compared with 1.0% (1000 ul of TFA/liter). By these means, the amount 
of TFA can be adjusted carefully to accomplish "by trial-and-error" a flat baseline. For 
high-sensitivity work (submicrogram levels of peptide/protein), where the expected 
peak heights are in the 10-20-mAU on a 2.1-mm I.D. column (1 jig = 100 mAU and 100 
ng = 1 mAU on a 2.1-mm LD. column), 0.085% or 0.086% (v/v) TFA in Solvent B may 
be needed. 


if solvents are not degassed, then bubble formation can occur during a chromatograph- 
ic run as a result of out-gassing. Such bubbles can lodge in the flow cell, resulting in an 
erratic detector signal (i.e., baseline). To minimize this risk, Solvent B should be a "per- 
centage buffer,” i.e. containing H,O (e.g. 60% TFA, 4096 HO containing 0.1% TFA). 
Additionally, solvents should be degassed every 1-2 days with high-purity helium or 
constant vacuum over a period of ~30 minutes. 


With alcohol-type solvents (e.g, methanol and ethanol), a slow build up of esters occurs 
at room temperature, resulting in an increased baseline. For this reason, solvents must be 
prepared every 3 or 4 days. 


Solvents containing salts. Solvent systems such as unbuffered 1% (w/v) NaCl/acetonitrile 
and 0.1 M phosphate (pH 7.4)/acetonitrile are commonly used in RP-HPLC peptide sep- 
arations when seeking selectivity other than that achieved with conventional 0.196 (v/v) 
TFA/acetonitrile (for an example, see peptide mapping of IL-6 in Figure 7.8, Chapter 7). 
When using salt-containing solvents, the percentage of organic solvent in Solvent B 
should not exceed 5096; otherwise, there is a high risk that these salts will precipitate and 
clog the HPLC tubing (typically, the 7/1000-inch and 10/1000-inch capillary tubing used 
in modern HPLC systems), as well the column. Such damage can be very expensive to 
repair! To avoid possible precipitation of salts, it is recommended that salt-containing 
solvents be premixed by adding equal volumes of organic solvent to the buffer salt solu- 
tion (e.g., 2% w/v aqueous NaCl or 6.2 M phosphate at pH 7.4) to give a fina] concen- 
tration of 1% (w/v) NaCl/50% organic solvent or 0.1 M phosphate/50% organic solvent. 
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è Equipment 


HPLC chromatograph system 
This system should be equipped with programmed gradient eluuon, UV detection at 210— 20 nm, 
acquisition of chromatographic data, and integration of peak areas. Fhe system must be capable of 
accurate and reproducible solvent delivery and gradient formation. 

Reversed-phase column (100 A or 300 À pore size) 
5-0-4tm particle size packed into a column of 100 x 4.6-mm LD. or cartridge of. 100 x 2.1-ram LD. 
Commonly used columns in this laboratory are Brownlee RP-300 100 x 21 7-um 300A (C8) 
(Alltech) and Vydac 214TPB 100 x 2.1 3-um 300A (C4) (Vydae, Hesparta, California! tor «amples 
of 1-100 jig; tor samples 100 ug, use the same support, but packed into 4.6-mm LD columns. 


P Biological Sample 
Protein standards 
hen egg lysozyme {L 6876, Sigma) 
ribonuclease B (R 5750, Sigma) 
cytochrome e 
bovine serum albumin (A 3675, Sigma} 
carbonic anhydrase (high-purity grade, C 3934, Sigma) 
Dissolve the protein standards (1 mg ot each proteiniml: in H,O ican be stored troven for several 


weeks). It is recommended that the 1 mg/ml solution of protein standards be stored in small 
aliquots (e.g, 100 pl) and not freeze thawed more than five times, 


Cautionary NOTES 


e Routine for new column usage. Flush the HPLC system completely before attaching the 
column. Prime both solvent lines with Solvents A and B, respectively, to ensure that there 
are no bubbles in the system. Connect the solvent delivery tubing from the injector to 
the top end of the column and the tubing from the column outlet to the flow cell. Pass 
30 column volumes of Solvent B though the column followed by 30 volumes of Solvent 
A. The column is now ready to be evaluated using the procedure described below in Step 
l. 


© Connection of columns to HPLC plumbing. Before connecting a column to the HPLC 
plumbing, make sure that column end-fittings, ferrules, and tubing depths are correctly 
matched. Incorrect use of column end-fittings can seriously damage a column, resulting 
in solvent leakage and/or high dead-volumes (and hence poor chromatograpbic perfor- 
mance). For a technical description and possible problems that ensue from incorrect fit- 
tings, see the panel on COLUMN FITTINGS INTERCHANGEABILITY, at the end of this protocol. 
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METHOD 


l. Run a blank or "control chromatogram" (i.e., no sample loaded in Step 1). Typically, this 
is the first chromatographic run of the day. The following are the recommended standard 
chromatographic conditions for the RP-HPLC system evaluation: 


+ Linear gradient; From 0% to 100% B, where Solvent A is aqueous 0.1% (v/v), and Solvent 
B is 0.09% (v/v) TFA in 60% aqueous acetonitrile. 


e Gradient rate: t% B/min (i.e., 0-100% Solvent B in 60 minutes). 
s» Flow rate: | ml/min, 4.6-mm L.D. column (0.1 ml/min; 2.1-mm I. D. column}. 


© Temperature: 40-15°C, 


Flow rate 
Time Mobile-phase 4.6-mm LD. — 2.4-mm LD. 
Step no. (minutes) composition column column 
1 (sample injection) 0 100% Solvent A 1 ml/min 0.1 ml/min 
2 60 10096 Solvent B 1 ml/min 0.1 ml/min 
3 (re-equilibration) -* 100% Solvent A | ml/min 0.1 ml/min 


“The time taken to pass 20 column volumes of Solvent A though the column, Column volumes are 
calculated by measuring column length and internal diameter and applying the following equation 
V = mr2h, where V is the column volume, r is the internal radius of column, and Ji is the length of 


column. 


‘These conditions were used to produce the elution profile of the blank chromatogram 
shown in Figure 5.9. 

Before commencing the blank run, pump 20 column volumes of Solvent B through the col- 
umn, followed by an equal volume of Solvent A (ensure that baseline is stabilized, Le, 
remains flat). Then adjust the flow rate to an operating flow rate of 1 ml/min (or 0.1 ml/min 
for 2.1-mm LD. column) and wait until the baseline stabilizes {~z minutes). 


2. Run protein standards (contro! run). In Step 1, inject 100 ul of protein standards (0.1 -mg 
stock standards/ml) for the 4.6-mm 1.D. column (1e., 10 ug of each protein standard; 10 
ug = 100 mAU at a flow rate of 1 ml/min). For a 2.1-mm L.D. column, load 10 pl of 0.1- 
mg protein standards/ml, i.e., | ug of each protein standard (1 wg = 100 mAU at a flow 
rate of 0.1 ml/min). 

These conditions were employed to produce the elution profile of the control protein stan- 
dard chromatograms shown in Figure 5.9. An alternative to protein standards is a tryptic 
digest of cytochrome c (Figure 7.9c, Chapter 7 


3. Run the test peptide digest mixture, using the same conditions used in Step 2. 


4, For sample collection, collect eluting peptide peaks in polypropylene microfuge tubes fit- 
ted with tight-fitting caps and store at _20°C for further study. If sample collection is to 
be performed manually, the dead-volume of the plumbing from the detector flow cell to 
the outlet tubing must be determined accurately. 

EXPERIMENTAL TIP: A good approximation of the dead volume (d pl} can be made using 


the following equation: d = 17h, where h is the length of the tubing from the flow cell to the 
outlet (in mm), and r is the diameter (in mm) of the tubing. The preferred method for 
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determining the dead volume involves the injection of a dye faqueaus 0.019% wiv C oomassie 
Blue; and accurately determining the time (using a stopwatch) taken from the first appear- 
ance of dye in the flow cell (e, off-scale detector signal} and the appearance ol blue dye 
from the outlet tubing ‘easily seen by spotting the eluent onto white blotting paper). When 
performing this task, disassemble the column from the HPLC equipment and replace with a 
comparable length of tubing that is then connected to the flow cell. 


5. For long-term column storage, flush the columns and store with an aqueous organic sol- 
vent (Cga 5096 v/v propanol/50% H,O}, For overnight storage, pump the columns with 
Solvent B at a very slow flow rate (e g., 50 l/min. 


COLUMN Fittings INTERCHANGEABILITY 


There are many different fittings in HPLC, and so many that look alike, that interchange- 
ability is a real problem. The following explanation is an attempt to clarify the matter, 


l. Take a zero dead-volume union, male nut, and ferrule from one manufacturer. 
2. Put the male nut and ferrule on a piece of 1/16-inch O.D, stainless steel tubing. 


3. Insert the tubing, ferrule, and nut into the ZDV union. Make sure that the tubing bot- 
toms out in the union. 


4. Swage the ferrule onto the tubing by tightening the male nut finger tight, and then make 
à three-quarters turn with a wrench. 


5. Now remove the male nut, ferrule, and tubing from the fitting. 


6. Measure the distance from the end of the ferrule to the end of the tubing. This is 
Dimension X. 


Dimension X varies from one manufacturer to another, and it can also vary for fittings 
from the same manufacturer due to variation in tolerances. Because of the typical variances 
seen among the array of manufacturers, new fittings, new ferrules, and new connections 
are recommended each time assemblies are connected, although such care is not necessary 
in all cases. For example, Valco, Parker, Swagelok, and Upchurch Scientific zero dead-vol- 
ume fittings are interchangeable at the initial assembly and may be interchangeable in sub- 
sequent assemblies. However, Dimension X is long with a Waters fitting. If a ferrule has 
been swaged in a Waters fitting, and is then inserted into an Upchurch fitting, the ferrule 
will not seat and it will leak. Dimension X is short with an Upchurch fitting. If the ferrule 
has been swaged in an Upchurch fitting and it is inserted into a Waters fitting, the tubing 
will not bottom out and threre will be a dead volume/mixing chamber. (Adapted from the 
Upchurch Scientific Catalog, 2000 [httpz//www.upchurch.com].) 


PROTOCOL 2 


Packing Capillary Columns for RP-HPLC* 


MATERIALS 


Coram OR MICROCOLUMN («0.5 mm LD.) HPLC is an extremely powerful technique for 
separating small quantities of proteins and peptides. It has distinct advantages over wider- 
bore column HPLC, such as increased mass sensitivity and lower flow rates, which make it 
particularly attractive for coupling to detection techniques such as mass spectrometry (see 
Chapter 8). Although the advantages of column miniaturization have been known for some 
years now (Ishii et al. 1977; Yang 1982; Novotny and Ishii 1985)—microcolumn («1 mm 
LD.), also known as capillary liquid chromatography—it is only recently that capillary col- 
umn RP-HPLC has become a widely used technique in mass-spectrometry-based proteomics 
studies (see Chapter 8). Unlike capillary electrophoresis (for reviews, see Wiktorowicz 1992; 
Li 1993), which has gained wide acceptance and for which many commercial instruments are 
available, capillary HPLC still remains a specialist technique. It is only recently that commer- 
cial capillary column RP-HPLC instruments and packed capillary columns have become 
available (e.g., Agilent Capillary LC, WatersCapLC, LC Packings Ultimate; for further infor- 
mation, see Table 5.2). 

This protocol describes a procedure for adapting conventional HPLC systems to provide 
accurate low-flow rates (0.44 j.l/min) and gradients required to operate slurry-packed cap- 
illary columns. A key component of this system is a commercial axial-beam longitudinal flow 
cell (Chervet et al. 1989) that can be fitted to a number of commercial UV detectors (Moritz 
et al, 1994). Procedures are described for the fabrication of 0.32-mm 1.D. polyimide-coated 
fused-silica columns, slurry-packed with reversed-phase chromatographic supports. T he 
chromatographic performance of these colurnns is illustrated using a series of standard pro- 
teins and a mixture of peptides derived from a Staphylococcus aureus V8 protease digest of 
recombinant murine IL-6 (see also the alternative “bomb” method in Chapter 8). 


p Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <.>. 


Methanol (50% aqueous v/v) <!> 
n-propanol <!> 
Reversed-phase silica <!> 
Trifluoroacetic acid (TFA)/acetonitrile solvent system <tt> 
Solvent A is 0.1% (v/v) aqueous TFA 
Solvent B is 60% (v/v) acetonitrile, 40% H,O containing 0.0996 (v/v) aqueous TEA 


*This protocol was contributed by Robert L. Moritz, Joint Proteomic$ Laboratory (JPSI.), Ludwig Institute of 
Cancer Research/Walter and Eliza Hall Institute for Medical Research, Melbourne, Australia. 
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b Equipment 


Bath sonicator 


Chromatographic syringe (Microliter 700 series, Hamilton, Reno, Nevada) 
Column frits 
A variety of porous membranes can be used as column frits to retain the column packing. 
Zitex membrane 
Glass fiber filter paper 
Polyvinylidene difluoride, hydrophilic 
End fittings, standard male liquid chromatography (0.5-mm 1.D, flowthrough ports, 1/4-inch 
column end-fittings) 
Epoxy resin (24-hour curing-time type) 
Ferrules (vespel/graphite) (100/0.4-VG1, Alltech) 
Flow cell 
Axial-beam longitudinal capillary flow ceils with small internal volumes (26.5 nl) can be obtained 
for various UV detectors such as Spectra-Physics, Applied Biosystems Inc., Carlo Erba, Hitachi, and 


Waters. The flow cells can be purchased from LC Packings (Amsterdam, The Netherlands; San 
Francisco, California) 


Flow cell holder (LC Packings, Amsterdam, The Netherlands; San Francisco, California) 
Forward optics scanning detector (Thermo Galactic/Spectra Physics) 
Tbe detector should be quipped with U-shaped longitudinal capillary flow cell (6-mm path length, 
illuminated volume ~26.5 nl}. 
Heat gun 
Injector 
Rheodyne Model 7520 fitted with 0.2—1.0 pl internal sample rotor 
Or 
Model 8125 injector fitted with 0.5—50- jl injection loops 
Liquid chromatography column packing pump (Shandon, Cheshire, UK) 
Microcolumns 
0.8-0.2-mm LD. microcolumns can be obtained commercially from LC Packings (Amsterdam, The 


Netherlands, San. Francisco, California, Agilent, and Waters, MicroChrom) or constructed. as 
described below. 


Fused-silica tubing: Fused-silica tubing is used to provide back pressure to the split tee (see Figure 
5.10). Various lengths and internal diameters can be used (e.g., 0.050 mm ILD. X 100 mm or 0.100- 
mm LD, x 400 mm). The tubing is held in place at the split tee by sleeving the fused silica with a 
10-mm length of snugly fitting Teflon tubing to provide an O.D, of 1/16 inch (see Figure 5.10). 
Fused-silica capillary column (Polymicro Technologies Ine., Phoenix, Arizona; Millipore; Selleys 
Pty. Ltd.,Victoria, Australia; Alltech Associates Inc., Deerfield, [llinois) 

è Stationary phase can be purchased loose from many manufacturers or obtained by unpacking 
prepacked columns. 

e One-piece PEEK nut or one-piece fitting (1/16-inch) (32233, Alltech). 

» 1/16-inch O.D. x 0.5-mm E.D. Teflon tubing. Other internal-diameter tubing can be used to pro- 
vide a tight fit around the column body depending on the size of fused silica used (20033, 
Alltech). 

+ Polyimide-coated fused-silica tubing (Polymicro Technologies Inc.) of 0.540-mm O.D. x 0.320- 
mm LD, 0.340-mm Q.D. x 0.200-mm I.D., and 0.200-mm O.D. x 0.100-mm LD. cut to lengths 
of 300-mm to provide the column body. 

s Frits far these columns are punched from sheets of 0.45-\1m porosity hydrophilic PVDF (HVLP 
04700, Millipore). 

+ Epoxy glue is used to join the column body and exit tubing together (04-465, Selleys). 

æ Exit tubing which fits into these column bodies (i.e., 0.275-mm O.D. x 0.075-mm [.D., 0.199- 
mm O.D. x 0.050-mm İ.D., and 0.090-mm O.D. x 0.050-mm L.D.) cut to lengths of 200 mm. 


Nuts, fingertight (1102- LK, Swagelok, Solon, Ohio) 
Polypropylene tube (1.5 ml) 
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FIGURE 5.10. Gradient capillary HPLC system (A). Component parts (indicated by circled numbers): (1) 
solvent delivery systems; (2) split tee; (3) fused-silica tubing; (4) Polysil tubing; (5) sample injector; (6) 
microcolumn; (7) fused-silica capillary column; (8) Tefion connector; (9) flaw cell; (10) UV detectors; (11) 
signal data collection. (For more details, see Materials list in this protocol.] (B) An expanded view of the 
polyimide-coated fused-silica capillary column. (a) Stationary phase; (b) one-piece PEEK nut; (c) Teflon 
tubing 1/16-inch O.D. x 0.5-mm 1.D.; (d) polyimide-coated fused-silica tubing; (e) frits; (f) epoxy glue join- 
ing; (p) exit tubing. (Redrawn, with permission, from Moritz et al. 1994.) 


Polysil tubing 
Small-bore Teflon-coated, aluminum-clad fused silica (50 x 0.05-mm 1.D.) is used to connect the 
split tee to the injector. This tubing provides a low dead-volume flow path for the column flow 
(PT50-05, Scientific Glass Engineering Pty. Ltd., Victoria, Australia) 

Preinjection solvent split (1/16-inch) stainless steel tee (see Figure 5.10) 

Safety glasses 

Sample injector [ln 
Small internal port injectors with either internal or external loops (0.0650 pl) are used to inject 
samples directly onto the capillary columns. Injectors used in this study are a Rheodyne injector 
Model 8125 or 7520 or a Valco injector Model CaW or C14W (Rheodyne, Cotati, California; Valeo 
Instruments Co. Inc., Houston, Texas}. 
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Sapphire knife (23740U, Supelco, Bellefonte, Pennsylvania) 


Signal data collection 
Data collection devices such as strip chart recorders, integrators, or signal processing software can 
be used for data storage and subsequent manipulation (Kipp & Zonen, Amsterdam, The 
Netherlands; Hewlett-Packard, Waldbronn, Germany; Spectra Physics, Fremont, California). 
Silicon rubber septa 


Slurry reservoir 
The reservoir is manufactured from an empty 50 x 2-mm LL. glass-lined stainless steel tube (SGE 
Inc., Ringwood, Victoria, Australia). 

Solvent delivery system 
Hewlett-Packard HP1090M (or HP1100 series) liquid chromatograph. 
Almost any conventional liquid chromatograph can be used to provide micro flow rates by operat- 
ing the instrument at standard flow rates (i.e, 1 ml/min) and splitting off the microflow required. 

Split tee 
Flow split tee junctions with internal ports of 0.5-mm 1D. and 1/16-inch female tubing connections 
(U-428, Upchurch Scientific Inc., Oak Harbor, Washington). 
All tubing used in the micro flow path, originating at the split tee, incorporating the injector and 
microcolumn, and terminating at the detector flow cell, are kept as short as possible to minimize 
any flow dispersion. 

Teflon connectors 
Teflon tubing of 1/16-inch O.D. and various internal diameters are used to provide tight, leak-free 
tubing unions, If an internal diameter close to size of the outer diameter of the fused-silica tubing 
cannot be found, slightly larger tubing can be stretched between two pliers to draw out the Teflon 
tubing and decrease its internal diameter until the required size is obtained {Alltech Associates Inc., 
Deerfield, filinois; Upchurch Scientific Inc., Oak Harbor, Washington). 

UV detector 
Detectors suitable for the capillary HPLC system, described here, can be purchased from Spectra 
Physics (Model Spectra-100, Spectra Focus) or Applied Biosystems Inc., Model 783 (Spectra 
Physics, Fremont, California; ABI/Perkin Elmer., Foster City, California). 


Configuration of Capiliary Liquid Chromatography System 


The following solvent delivery system was used (see Figure 5.10): A Hewlett-Packard 
HP1090M liquid chromatograph utilized as a "slave" pump and microgradient source. 


Flow Generatlon 


1. For accurate flow rates of 0.05-20 l/min through the capillary columns and repro- 
ducible gradient formation, install a preinjection solvent split (1/16-inch tee) that directs 
>95% of the solvent flow through a length (~100 cm) of 0.10-mm LD. x 0.26-mm Q.D. 
fused-silica tubing to waste. 


2. Connect capillary columns directly to either a Rheodyne Model 7520 injector fitted with 
0.2--1.0-il internal sample rotor or to a Model 8125 injector fitted with 0.5—50- Hl injec- 
tion loops. 
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3, Sct the flow through the column (0.4—4 L/min) accurately either by adjusting the length 


of fused-silica tubing going to waste or by varying the pump tlow rate (100-500 Lf 
min). 


With this split-flow approach, it is important that the actual flow rate through the column 
be monitored on a regular basis (e.g., every day). This is readily achieved by installing a 10- 
ul chromatographic syringe at the effluent port and accurately timing the advancing menis- 
cus. Details of the various components used in configuring this system and their commer- 
cial sources are listed in Materials. 


UV Detection 


1. Monitor column eluent from both capillary HPLC systems by absorbance at 214 nm 
using a Thermo Galactic/Spectra Physics forward optics scanning detector equipped with 
a U-shaped longitudinal capillary flow cell (6-mm path length, illuminated volume ~ 26.5 
nl} installed in the conventional flow cell holder. 


2. Send the data either collected to a strip chart recorder via the single-wavelength mode or 
collected on a personal computer with Spectra Focus software, or equivalent, installed 
and operated in the multiple-wavelength mode (195-340 nm, 5-nm intervals). 


3. For capillary liquid chromatography/mass spectrometry (LC/MS), connect the eluent 
from the UV detector directly to the electrospray inlet via a 50-cm length of 0.05-mm 1.0. 
x 0.19-mm O.D. fused-silica tubing. 


This tubing, which replaces the standard stainless steel electrospray needle, extends to the tip 
of the spray needle assembly. 


4. For nanoflow capillary HPLC, connect the outlet directly to the nanoflow tips (New 
Objective, Inc., Woburn, Massachusetts; http://www. Newobjective.com |. 


Construction of Capillary Columns 


Numerous reports in the literature describe the construction of microcolumns (<0.5 mm 
D.) from such diverse materials as Teflon tubing (Ishii et al. 1977), glass-lined stainless steel 
tubing (Chervet et al. 1989), and polyimide-coated fused-silica tubing (Yang 1982). We pre- 
fer to construct microcelumns from the latter. Compared to the other materials mentioned 
above, polyimide-coated fused-silica offers a number of distinct advantages (sec Table 5.8). 
Uppermost of these are its flexibility and ease of handling, its transparency, its commercial 
availability in a wide range of inner and outer diameters, and its relatively low cost. 

The following step-by-step procedure for constructing 0.1-0.32-mm 1D. fused-silica 
microcolumns uses the construction of a 250 x 0.20-mm LD. column as an example. 


1. Cut two lengths of polyimide fused-silica tubing, one 250-mm length (0.200-mm T.D. x 
0.340-mm O.D.) to provide the column body and another 200-mm length (0.050- mm 
LD. x 0,190-mm O.D.) to provide the column exit tubing. 

CAUTION: Wear safety glasses and adhere to recommended safety precautions during col- 
umn packing. 
» Thin-walled capillary tubing has a tendency to fraciure easily when subjected to high 


pressures (~400 bar) during the slurry-packing of chromatographic supports. For 
instance, in our experience, fused-silica tubing of 0.320 mm Up, x 0.540-mm O.D. and 
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TABLE 5.8. Fused-silica microcolumn characteristics 


Column LD. Slave pump flow Flow rate (ul/min) Optimal protein 
(mm) rate (ml/min) Split ratio Operating Loading load (ug) 

0.3 0.50 1:140 3.6 20 r 

0.2 0.50 1:310 l.6 5 1.0 

0.1 0.50 1:1250 0.4 2 0.2 


0.200-mm LD. x 0.340-mm Q.D. can withstand packing pressures of up to 450 bar, 
whereas tubing of 0.320-mm I.D, x 0.450-mm O.D. fractures readily at these high pres- 
sures, In the latter case, slurry-packing pressures should not exceed 50 bar. Finally, it 
should be noted that superior column efficiencies can be achieved by packing at the high- 
er pressures afforded by using the thick-wall fused-silica tubing. 


To avoid jagged ends and to obtain square-cut capillaries, use a sapphire knife instead of 
the more common ceramic squares. Tubing with a jagged end will result in imprecise frit 
positioning. 

* Tefion tubing is used to connect lengths of fused-silica tubing (see Figure 5.10). Because 
there are slight variations in the outer diameters of fused-silica tubing from different 
manufacturers, it is often necessary to fabricate such connectors to obtain tight-fitting 
Teflon connectors. This is readily achieved by stretching a slightly oversized inner diam- 
eter piece of Teflon tubing by hand with two pairs of pliers. 


2. Cut the PVDF frit from a large sheet of PVDF membrane using a short length (50-mm) 
of fused-silica silica tubing (0.32 [or 0.20j-mm I.D. x 0.45 [or 0.34]-mm O.D.) as a disc 
cutter. This is accomplished by placing the PVDF sheet on a silicon rubber septum and 
punching the frit. The column exit tubing is used to insert the fabricated PVDF frit into 
the microcolumn body to a depth of ~10 mm. 


A variety of porous membranes can be used as column frits to retain the column packing. 
These include Zitex membrane (Davis and Lee 1992 ), glass-fiber fitter paper (Davis and Lee 
1992), and hydrophilic polyvinylidene difluoride (PVDF) (Moritz and Simpson 1992a,b). 
For the columns described in this study, a 0.45-11m porosity hydrophilic PVDF membrane 
was used. 


3. Position the microcolumn body and the column exit tubing permanently by placing a 
small drop of precured epoxy resin at the junction of the two pieces of tubing. 


The epoxy resin is partly polymerized, or precured, with a heat gun to increase its viscosity 
so that it will not seep into the capillary tubing and block the frit. 


Inline MicroFilter assemblies from Upchurch Scientific, comprising a Micro Fingertight 
Fitting ( F-125), a Micro End Fitting (M-120), and a Filter Union Body (M-520-01), can be 
used as column frits to retain column packing. For instructions on assembling the Inline 
MicroFilter, see the Upchurch Scientific Web Site: http:/Avww.upchurch.com, 


Column Packing 


Once the PVDF frit has been positioned and the fused-silica capillary column constructed, a 
slurry-packing procedure is used to pack the microcolumn. 


1. Manufacture a slurry reservoir from an empty 50 x 2-mm I.D. glass-lined stainless steel 
tube with standard male liquid chromatography (0.5-mm I.D. flowthrough ports, 1/4- 
inch column end-fittings). 
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. Connect the fused-sihca microcolumn to the standard 1/4-inch liquid chromatography 


end-fitting by vespel/graphite and fingertight nuts. 


. Sonicate a slurry of reversed-phase silica in n-propanol (40 mg/200 ul) for 15 minutes in 


a 1.5-ml polypropylene tube. 


. Prior to packing, fill the capillary column with the packing solvent (ii-propanol) under 


high pressure (400 bar) using a standard liquid chromatography column packing pump 
(Shandon). 


This step prefills the microcolumn with packing solvent and also allows the system to be 
checked for tight seals. 


, Disconnect the slurry reservoir from both the column packer and the microcolumn, 


empty it, and reposition it in place. Fill it with 150 yl of the prepared slurry mixture, 


, Pack the column at a constant pressure of 400 bar for 20 minutes, consolidate with 50% 


(v/v) aqueous methanol for a further 20 minutes at the same pressure, and then allow the 
column to depressurize. 


. Dismantle the column carefully and seal it by inserting both the top and bottom ends of 


the column into silicon rubber septa. 


COLUMN OPERATION 


Standard conditions for operating a capillary column RP-HPLC are given in Protocol 1. All 
gradient separations were achieved using a standard trifluoroacetic acid/acetonitrile sol- 
vent system, i.e. a linear 60-minute gradient from 096 to 100% Solvent B, where Solvent A 
was 0.1% (v/v) aqueous trifluoroacetic acid and Solvent B was 6096 (v/v) acetonitrile, 4096 
HO containing 0.09% (v/v) aqueous trifluoroacetic acid. For mass spectrometric analysis, 
the trifluoroacetic acid concentration was reduced to 0.0596 and 0.04596 for Solvents A and 
B, respectively. 


PROTOCOL 3 


Purification of Large “Troublesome” Polypeptides 


Ta SEPARATION AND PURIFICATION OF LARGE SYNTHETIC. nonpolar polypeptides or polypep- 
tide fragments of proteins derived from cyanogen bromide cleaving procedures can often 
present formidable challenges due to their low solubility or the propensity of these molecules 
to self-associate. RP-HPLC provides one avenue to purify such troublesome examples pro- 
vided certain steps and precautions are taken. 

Large polypeptides are often troublesome to handle because of their low solubility or the 
presence of secondary structural elements that favor supramolecular self-self assembly. 
Often, these attributes are a direct consequence of the presence in the large polypeptide of a 
higher coment of nonpolar amino acids in the sequence, or alternatively, the order of the 
amino acids favors the formation of intercalating B-sheets or fibrils, or @-helical coiled-coil 
dimers or higher oligomers. Hence, different strategies are needed for their purification. 

Notwithstanding these challenges, HPLC is still the best technique currently for purify- 
ing and analyzing such types of large polypeptides. Due to the vastly different properties of 
these molecules, four different modes of separation can be used: reversed-phase, ion- 
exchange, size-exclusion, and hydrophobic interaction. Inevitably, the latter three modes 
require the use of eluents that contain low concentrations of nonionic or ionic detergents or 
surfactants, such as 0.1% Brij 25, Tween-26, Triton X-100, or even SDS, or alternatively, a rel- 
atively high concentration of a chaotrope such as urea or guanidinium hydrochioride (Hearn 
1998}, Because of the presence of these additives, ion-exchange, size-exclusion, and 
hydrophobic interaction HPLC cannot be considered to be the ultimate purification step. 
Instead, procedures intended to enable other contaminants to be removed selectively from 
the large polypeptide or protein of interest should be considered. However, as a single tech- 
nique, RP-HPLC satisfies most of the requirements for such analyses and purifications, and 
as a consequence, it is the most established and widely used procedure. Thus, RP-HPLC 
methods not only enable concomitant desalting, removal of additives that aid dissolution of 
the polypeptide or protein, but also permit resolution as well as maintenance of a reasonably 
high concentration of the solutes, due to the presence of the (often) low-pH, aquo-organic 
solvent conditions. This protocol addresses the group of large polypeptides that have proved 
to be troublesome to handle due to their low solubility or the presence of secondary struc- 
tural elements that favor supramolecular self-self assembly. As noted above, included in this 
group are polypeptides with amphipathic a-helical or B-sheet structures with extensive runs 
of nonpolar (hydrophobic) amino acid side chains or, alternatively, proline-rich sequences. 
Related procedures are equally germane to polypeptides that have been lipidated or subject- 
ed to chemical modifications with nonpolar moieties, as well as to some core cyanogen bro- 
mide fragments of large proteins. In the cases described below, the use of the polar “ion-pair- 
ing” reagent trifluoroacetic acid with the “dipole clustering” solvent, acetonitrile, is described. 
Other ion-pairing reagents (Hearn 1980, 1985), high concentrations of formic acid 
(Heukeshoven and Dernick 1985; Mant and Hodges 1991a), the presence of low concentra- 
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tions of anionic or cationic detergents (van der Zee et al. 1983; Hancock 1984; Hearn 2000b), 
or organic solvents of lower polarity, such as 2-propanol or tetrahydrofuran (van der Zee et 
al. 1983; Mant et al. 1997), can be used to generate alternative mobile-phase systems. 


This protocol was contributed by Hooi Hong Keah and Milton T.W. Hearn (Centre for 


Bioprocess Technology, Department of Biochemistry and Molecular Biology, Monash 
University, Clayton, Victoria 3800, Australia). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <! ». 


P Reagents 


Buffer A: Add 1 ml of trifluoroacetic acid (FA) <!> to 1 liter of Milli-Q H;O in a l-hter mea- 
suring cylinder 

Buffer B: Add separately 600 ml of acetonitrile <!>, 400 ml of Milli-Q H,O, and 1 ml of TFA 
in a 1-liter measuring cylinder 


Use a sintered glass funnel to filter Buffers A and B separately through a 0.2-|1m filter (nylon 47- 
mm filter membrane, Part No. 2034, Alltech Assoc., Deerfield, Illinois) directly into a graduated 
flask, Take care at this stage not to over-aspirate or evacuate the flask to prevent evaporation of the 
organic solvent, and thus reduce the organic solvent content in the Buffer B. Adjust the volume of 
acetonitrile and H,O to make a different percentage of acetonitrile content bufter when necessary, 
but maintain the volume of TEA. inject ~5 pl of the aliquot for these evaluations. 


IMPORTANT: TFA used for protein sequencing may not be suitable for the preparation of RP- 
HPLC eluents because it can contain anti-oxidants. 


p Equipment 


Columns 


For Sample Purification: A semipreparative C,, or another appropriate n-alkylsilica sorbent of 
suitable particle size (i.e., between 5 and 25 pm), porosity tie., between 150 and 1000 A), and 
phase ratio packed in a preparative column of appropriate dimensions, e.g., 10 pam, 300 À, 300 
x 21.5-mm I.D. The TSK-ODS-120T (Tosoh Corp.) C,, column (300 x 21.5- mm LD., particle 
size 10 pm), protected by a guard column, represents one suitable option. 


For Sample Analysis: An analytical C, or another appropriate n-alkylsilica sorbent of suitable 
particle size (i.e, between 1.5 and 10 um), porosity (Le., between 150 and 1000 A), and phase 
ratio packed in an analytical column of the appropriate dimensions, ¢.g., 5 Hm, 300 A, 150 x 4.6- 
min LD. The TSK-ODS-120T (Tosoh Corp.) Cu column (150 x 4.6- mm, particle size 5 um) rep- 
resents a suitable option or, alternatively, the Agilent SP-C18, Supelco Discovery Cip Hypersil 
ODS C, or one of the available types of reversed-phase columns from the 1500 or so vendors. 


HPLC system 


Binary/quaternary solvent delivery pumps, such as a Waters Model 601 system or equivalent. 
Gradient controller, such as a Waters 600 controller or equivalent. 

Sample injector (automated injector for analysis, manual injection for separation), such as a 
Waters Wisp 712 or equivalent. 

Fraction collector, such as the Amersham Biosciences Frac100 or equivalent. 

Variable wavelength or photodiode array UV detector, such as a Waters 486 detector or equiva- 
lent. , 

Computer system for data collection, and subsequent analysis and presentation, such as the NT- 
based Millenium software operating on a PC (Pentium system) of adequate Ram (264 Mbyte) 
and hard disk memory (>1 GB) or equivalent. 
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Lyophilizer 

Microfuge 

Modular or dedicated HPLC instrumentation (e.g., Beckman System Gold, Agilent 1100 
Series HPLC) 


This instrumentation can be used in analogous fashion depending on the vendor preference of the 
investigator. 


Nitrogen gas <!> 


y Biological Sample 
Large nonpolar polypeptides or proteins for purification 


METHOD 


Sample Preparation of Large Nonpolar Polypeptides or Proteins 
for Purification by RP-HPLC 


1. Prepare the polypeptide sample (between 30 and 50 mg if derived from solid-phase 
chemical synthesis methods, although significantly smaller sample amounts, i.e., down to 
microgram quantities, can be similarly used if the sample is derived from other sources) 
by dissolving in the elution buffer (Buffer B, ideally in 1 ml if 30-50 mg of sample is avail- 
able or in smaller volumes as appropriate to the sample size) in a microfuge tube. 


Depending on the hydrophobicity of the protein and large polypeptide, i.e, the more 
hydrophobic it is, the higher the acetonitrile content (v/v) will be needed in the buffer. 
Generally, 6096 (v/v) of acetonitrile in H,O with 0.1% (v/v) TEA is sufficient for this purpose. 
With polypeptides that are poorly soluble, a solvent of 9096 (v/v) acetonitrile-H,O contain- 
ing 0.1% (v/v) TFA may be required. It may be necessary to use a detergent-based, urea- 
based, guanidine-hydrochloride-based solvent or a 60% (v/v) formic-acid-based solvent. 
Detailed investigations (Hearn 1991a) have shown that the optimal concentration for maxi- 
mal resolution of many large polypeptide or small proteins in RP-HPLC often occurs near to 
I5 mM TFA. 


2. Centrifuge crude samples in a microfuge at 2000g for 5 minutes to ensure that all insolu- 
ble material and/or precipitates are removed before injection. 


For additional details on procedures that can be adopted to achieve suitable high-resolution 
conditions, see Protocols 4, 5, and 6. 


Analysis of Large Nonpolar Polypeptides or Proteins 
A small quantity of the sample (50 pl) is required for analysis. 


1. Prepare ~5-10 aliquots for evaluation of the separation conditions with different types of 
RP-HPLC sorbents, under various elution conditions, i.e., different flow rates, tempera- 
tures, or gradient slope parameters. 


2. Inject ~5 ul of the aliquot for these evaluations. 
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Purification of Large Nonpolar Polypeptides 


1. Sparge the buffers by bubbling nitrogen through the buffers for 20 minutes at 100 ml/min 
per reservior. 


This step is done to exclude any air bubbles before running through the HPLC system. When 
the system is running, sparging can be slowed down to 20 ml/mir per reservoir. 


It is important to ensure that no ait bubbles are in the line of the HPLC system. Air bubbles 
in the system will result in inconsistency of the flow of buffers. The pressure transducer on 
the pump should indicate stable back-pressure if the system is properly purged and primed; 
ie, all of the lines are filled with the new buffer. This 1s done by pulling some buffer through 
each pump and checking the consistency of flow at the outlet before the column is connect- 


ed. 


2. Connect the column. Make sure that the column is free of any impurities by running elu- 
tion buffer through it for 15 minutes at 6 ml/min. Wash the column with buffer for an 
additional 20 minutes. The system should now be readv for injection. 


3. Start the software for data collection and check that the detector is calibrated to the wave- 
length required. 


For polypeptides containing aromatic side-chain amino acids, 254 nm can be used; other- 
wise, separation can be carried out at 230 nm. Photodiode array detectors provide a flexible 
option for obtaining difference, derivative, and higher-order spectra, as well as utilizing peak 
track algorithmic approaches. 


4. Set up solvent gradients for elution on the computer. 


A gradient of 2-98% Buffer B (elution buffer) for 60 minutes is commonly used. Very shal- 
low gradients are normally used to separate peaks that run very close together. 


5. Use a manual injector to inject the sample for separation while maintaining the flow rate 
of the loading buffer (Buffer A) at 6 ml/min. 


6. Collect small fractions (—1—2 ml) in test tubes for analysis as the various peaks elute, as 
shown for the first case study by the high absorbances of the chromatogram as seen in 
Figure 5.11. 


Case STUDY 1: PURIFICATION OF A CRUDE SAMPLE OF SYNTHETIC FS-1 POLYPEPTIDE 


In this case study, the purification of a crude sample of the synthetic FS-1 polypeptide, 
related to the carboxy-terminal region of the activin-binding protein follistatin-315 
(FS315) and implicated as a binding domain, was achieved with a TSK-ODS-120T C,, col- 
umn of dimensions 300 x 21.5 mm with an n-octadecy! silica sorbent of average particle 
size of 10 um using a linear gradient of 0.196 TEA in H,O to 10096, 0.1% TFA in H,O/ace- 
tonitrile (10:90 v/v) for 1 hour ata flow rate of 6 ml/min, with detection at 230 nm (Figure 
5.11). Despite the fact that it contains several polar amino acid residues, the FS-1 polypep- 
tide has a relatively low solubility in aqueous solutions and tends to precipitate at higher 
pH values. 
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FIGURE 5.11. Puri ation of crude FS-1 polypepude was achieved with a [SK ODS-120T €, colin. 
(300 x 21.5 mm, particle size HU pim? using a linear gradient of Bufter A‘ 195 1 FA ir HO: ie T1030, Butet 
B (0.196 TRA in H.Ofaceronitrile {£090 v/v|) for | hour at a flow rate af 6 milena, sth detection t 2560 
nm. 


7. Dispense the collected fractions into sample vials for subsequent analysis. 


RP-HPLC analysis, high-performance capillary electrophoresis /HPGZE: capilla elec 
trochromatographic (CEC), or mass spectroscopic ESI-MS or MAI DT ATS: analencal pro- 
cedures can be routinely employed, Generally, 50 ui per sample is reserved for evaluation a! 
peak purity, etc, and an injection of 30 ul is usually sufficient for all of these analyses. 


8. For the analysis by RP-HPLC of the fractions collected, prepare the HPIC sstem i 
repeating Steps 1-6, but in this case, reduce the flow rate to ! ml/min with the detector 
wavelength set at 214 nm and a suitable analytical column connected instead of the 
preparative column. 


For rapid analytical separations, the gradient time van be reduced to 25 wiinutes tor analy 
sis and 10-15 minutes af reequibbration time after that to ensure that the system i. read 
for the next analysis. 


Cast Stupy 2: CHROMAIOGRAM OF CRUDE SAMPLE CONTAINING THE FS-1 POLYPEPTIDE AND 
THE Purimep FS-3 POLYPEPTIDE 


Figure 5.12, A and B, shows the corresponding analytical chromatograms of the crude sam- 
ple containing the FS-1 polypeptide and the purified FS-1 polypeptide collected from the 
preparative run described above (H. Keah and M. Hearn, unpubl.). In this case study, the 
analytical RP-HPLC separations were carried out using TSK C, columns (150 x 4.6 mm, 
particle size 5 zm) with detection at 214 nm and with elution achieved using a linear gra- 
dient of Buffer A (0.196 TFA in H,O) to 100% Buffer B (0.1% TFA in H,O/acetonitrile 
[10:90 v/v]) for 25 minites at a flow rate of 1 ml/min, followed by 5 minutes of column 
wash in 10096 Buffer B prior to reequilibration with Buffer A. 
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FIGURE 5.12. RP-HPLC profiles of crude (A) and purified (B) FS-1 peptides. The analytical RP-HPLC sep- 
arations were carried out on TSK C, columns (150 x 4.6 mm, particle size 5 mm with samples eluted using 
linear gradient of Buffer A (0.196 TFA in H,O) to 100% Buffer B (0.140 TFA in H-Oacetomitrile | 10:90 v/v]) 
for 25 minutes at a flow rate of 1 ml/min, followed by 5 minutes in 1004» Buffer B with detection at 214 nm. 


(22 LABEXPERT 


Application of such analytical procedures to the resolution and purification of "difficult" 
polypeptides and proteins is particularly amenable to the recently developed real-time 
knowledge-based approach, called G2 LabExpert (I et al. 2002), allowing intelligent 
automation of HPLC method development. Unlike other computerized method develop- 
ment systems, the G2 LabExpert operates in real time, using an artificial intelligence sys- 
tem and design engine to provide experimental decisions as weil as to control the instru- 
mentation, column selection, and mobile-phase choice. The G2 LabExpert has been pro- 
grammed to control every input parameter to a HPLC data station, such as an Agilent 
series 1100 HPLC Chemstation system, including operation of the diode array detector and 
quaternary pump system equipped with degasser, autosampler, and software. It evaluates 
each output parameter of the HPLC data station as part of its decision process. On the 
basis of a combination of inherent and user-defined evaluation criteria, the program uses 
a reasoning process, applying chromatographic principles and acquired experimental 
observations to provide iteratively a regime for the 4 priori development of an acceptable 
HPLC separation method. Because remote monitoring and control are also functions of 
the G2 LabExpert, the system allows full-time use of analytical chromatographic instru- 
mentation and associated laboratory resources within a peptide synthesis/protein chem- 
istry laboratory. Extensive sample handling is also avoided, a feature important for the 
purification or analysis of large polypeptides that are troublesome to handle because of 
their low solubility or propensity to undergo self-self supermolecular assembly due to the 
presence of secondary structural elements such as coiled-coil motifs. 
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FIGURE 5.13. RP-HPLC profiles of the erude and purified activin B, subunit polypeptide, Ac. 1i 2-1156, 
‘The analytical RP-HPLC separations were carried out on the TaK ©. column (150 x 4.6 mm. particle size 
5 ym} with samples eluted using linear gradient of 0.196 TPA in H,O (Butfer A3 to 10095 Buffer B (014a 
TEA in H,O/acetonitrile | 20:80 v/v]) for 25 minutes at a Mow rate of i ml/min, with detection at 214 nm. 
Purification of the crude Act, (12-1151 polypeptide was achieved with a l'SK-ODS-120T C , column (300 
x 21.5 mm, particle size 10 pm) using an optimized gradient of Buffer A to 10090 Buffer B over 2 hours at 
a flow rate of 6 ml/min, with detection at 230 nm. (set) ESI-MS transformation spectrum of Act, 
(12-115) polypeptide with an observed molecular mass of 12296,5 (Calculated mass: 2289.8]. The mass 
peak at [2227.1 corresponds to the des monoAcm-ActB, (12. 115). 


Case Srupy 3: CHROMATOGRAPHIC PROFILES FOR CRUDE AND PURIFIED SAMPLE OF 
ACTIVIN 8, SUBUNIT 


Figure 5.13 shows the chromatographic profiles for the crude and purified sample of the 
activin B, subunit, ActB, (12-115). This fully synthetic analog of the pleiotropic homo- 
dimeric growth factor activin B,B, contains two large antiparallel B-strand loops, built 
around a catenating cystine knot framework, which creates the potential for self-associa- 
tion under inappropriately selected elution conditions. As evident from the figure, this 
large polypeptide could be easily purified using the protocols outlined above and summa- 
rized in greater detail in the legend to the figure. In particular, the authenticity of this fully 
synthetic 103 mer could be confirmed from the ESI-MS transformation spectrum, which 
should be that the purified ActB, (12-115) polypeptide had an observed molecular mass 
of 12296.5 (calculated mass: 12289.8 amu). 


Cast Stupy 4: CHROMATOGRAMS OF CRUDE AND Punirigp MIPXT POLYPEPTIDE 


Figure 5.14, A and B, illustrates the corresponding chromatograms of the crude and puri- 
fied sample of MTPX1 polypeptide, respectively. The MTPX1 polypeptide represents the 
hydrophobic arnino-terminal leader sequence of a CRISP-related mouse sperm tail protein 
and was required for studies on sperm protein function and localization (Keah et al. 
2001a,b). This polypeptide very readily forms aggregates, and following lyophilization, it is 
extremely difficult to redissolve in low-pH aqueous buffers of low organic solvent content. 
Generally, an autoinjector instead of a manual injector is used for the analysis. This enables 
the set of samples to be injected and analyzed automatically with minimal time delays for 
the operator. 
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FIGURE 5.14. RP-HPLC profiles of the crude (A) and purified (B} MTPX1 polypeptide. The analytical RP 
HPLC was carried out on TSK C,, columns (150 x 4.6 mm, particle size 5 um? with samples eluted using a 
linear gradient of Buffer A (0.196 TEA in H,O) to 100% Buffer B (0.1% TRA m HO/acetonitrile [10:90 v/v |) 
for 30 minutes at a flow rate of 1 ml/min with detection at 214 nm. Also illustrated is an important issue 
with regard to the control over the mobile-phase composition, temperature, or constancy of flow rate. As 
evident from this figure, the retention time of the pure MT'PX1 polypeptide following gradient elution (4) 
is smaller than that for the separation of the crude MTPX1 sample shown in A. [his result arose from a 
deliberate (but small) variation in flow rate to simulate either transient gas bubble formation in the pump 
head, etc., due to inadequate “degassing” of the mobile phase or pressure flunctations caused by particulate 
matter lodging in the HPLC tubing/valves, etc., due to inappropriate filtration of the sample/solvents 
through a 0.2 4um filter, with the appearance of the chromatographic profile and retention time vf the desire 
polypeptide affected. Similar variations can arise when changes in the temperature control ol the buffer 
reservoirs or column occur, or decreases in the organic solvent content of Buffer B used for the gradient elu- 
tion arise due to evaporation, resulting in a longer retention time for the analytes. 


9. When all of the fractions for analysis have been prepared, freeze-dry the samples in the 
test tubes on a lyophilizer overnight. 


10, Carry out the various analytical experiments, and based on the peak profiles and associ- 
ated mass spectrometric data, pool the relevant peak traction tubes. 


11. If the recovered lyophilized fractions prove to be difficult to resolubilize, use DMSO to 
bring the dried powder back into solution when conjugation to a carrier protein is 
planned as part of the preparation of a haptenic immunogen. When the planned studies 
involve subsequent aliquoting steps, followed by spectroscopic analysis, etc., use DME, 
formic acid, or a high content of acetonitrile. 


12. Store the polypeptides as dry fractions or as dissolved samples at -207C.. 
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CASE STUDY 5: VARIATION OF FLOW RATE AND RETENTION TIME OF A LARGE, SYNTHETIC, 
NOMPOLAR POLYPEPTIDE 


The above general protocols have been applied with various refinements in this laboratory 
for more than 20 years (Hearn 1984; Forage et al. 1986) with large polypeptides as well as 
with cyanogen bromide fragments of proteins or with globular proteins themselves, They 
have been found to be robust in terms of their reliability and reproducibility, However, as 
illustrated in Figure 5.12, a final “case study” is presented. Here, HPLC conditions were 
deliberately implemented to demonstrate how the retention time of a large synthetic non- 
polar polypeptide, in this case the MTPX1 polypeptide, will be affected if a variation in 
flow rate occurs, with the appearance of the chromatographic profile thus affected. Similar 
variations can arise when inadvertent changes in the temperature contro! of the buffer 
reservoirs or column occur, or decreases in the organic solvent content of Buffer B used for 
the gradient elution arise due to evaporation, resulting in a longer retention time for the 


analytes. 


PROTOCOL 4 


Purification of Peptides from Solid-ohase Peptide Synthesis 
with RP-HPLC 


Prortricarion OF PEPTIDES DERIVED FROM SOLID-PHASE PEPTIDE SYNTHESIS (SPPS) requires the 
removal of deletion peptides resulting from incomplete coupling/deprotection steps, from 
racemization or side-chain rearrangement, and from various chemical substances introduced 
during the deprotection or cleavage stages of an SPPS procedure. This protocol details the 
purification and analysis of many synthetic peptides of 2-65 amino acid residues. These pep- 
tides contain a reasonable number of ionizable or polar side chains, but do not contain sec- 
ondary structural elements that favor supramolecular assembly, such a B-sheets, amphipath- 
ic structures with extensive runs of nonpolar side chains, proline-rich sequences, or confor- 
mationally “fragile” motifs that flip between helical and sheet structures in response to minor 
changes in mobile phase composition (Hearn 2000b). Protocol 3 described the purification 
and analysis of very large polypeptides that are troublesome to handle due to the presence of 
one or more of the above features. 

This protocol was contributed by Reinhard I. Boysen and Milton T.W. Hearn (Centre for 
Bioprocess Technology, Department af Biochemistry and Molecular Biology, Monash 
University, Clayton, Victoria 3800, Australia}. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <! >. 


b» Reagents 
Acetone <!> 
Acetonitrile (HPLC grade) <!> 
H,O (deionized, from a Milli-Q water purification system or equivalent) 
Phosphoric acid (H,PO,) «t» 
2-Propanol <!>, methanol «!», or other suitable organic solvents with excellent transparen- 
cy in the UV to 210 nm 
Sodium nitrate <!> 
Sodium phosphate (NaH;PO,) <!> or other suitable salts 
Thiourea <!> 
Trifluoroacetic acid (TFA) <!> 


b Equipment 
Analytical HPLC apparatus l 
This apparatus consists of a pump module, a mixing chamber, spectrophotometer with an analy- 
tical flow cell, an analytical column, a column oven, an autosampler, PC, printer and software, 
HPLC system with attendant data management systems and system automation controllers. 
Helium gas is required for buffer degassing and nitrogen for autosampler operation. rhe column is 
an analytical C,, or another n-alkylsilica sorbent of suitable particle size (i.e., between 1.5 and i0 
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um), porosity (i.e., between 150 and 1000 A), and phase ratio packed in an analytical column of the 
appropriate dimensions (e.g., 5 Lm, 300 A, 150 x 4.6-mm I.D.). 
Centrifuge (benchtop, for sample preparation) 
Conical vials (for autosampler} 
Glassware 
four 1-liter eluent bottles 
two 1-liter measuring cylinders 
two waste bottles 
All glassware should be rinsed three times with Milli-Q H;O. 
Hamilton glass syringe (graduated 25 Li, to fit injector port) 
Helium gas <!> 
Lyophilizer 
Mobile-phase filtration facility 
vacuum pump 
1-liter reservoir 
support base with glass frit and integral vacuum connection 
funnel 
camp 
47-mm membrane filter (0.2-4m PTFE) 
Nitrogen gas <!> 
Preparative HPLC apparatus 
This apparatus consists of a preparative pump module, mixing chamber, spectrophotometer with a 
preparative flow cell, a manual injection valve with preparative (560-1006 ul) sample loop, fraction 
collector, PC, printer and software with data management system, and system automation con- 
troller shared with analytical apparatus. Helium gas is shared with the analytical apparatus. The col- 
umn is a preparative C,, or another n-alkylsilica sorbent of suitable particle size (Le., between 5 and 


25 um), porosity (i.e., between 150 and 1000 A), and phase ratio packed in a preparative column of 
appropriate dimensions (e.g., 10 im, 300 A, 300 x 21.5-mm I.D.). 


Safety glasses 
Stopwatch 
Syringe (1 ml) with shortened (cutoff) needle 


b Biological Sample 


Peptide or protein for analysis 


Preparation of the Mobile Phase 


1, Prepare Eluent A (weak mobile phase) and Fluent B (strong mobile phase), 1 liter each, 
for example: 


Eluent A: 0.1% TFA in H,O 
Fluent B: 0.09% TEA in 60% acetonitrile/40% H,O (v/v) 


IMPORTANT: FFA employed for protein sequencing may not be suitable for the preparation 
of RP-HPLC eluents because it can contain anti-oxidants. 


The volumes of organic solvent and H,O must be measured in two different cylinders and 
then combined because of volume contraction upon mixing, which may be up to 30 militer 
prepared solvent (depending on the nature of the organic solvent), Failure to do so can lead 
to substantial errors in mobile-phase composition. 
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EXPERIMENTAL FIP; To compensate for the baseline shift in gradient elution (because 
most organi components absorb more light at low wavelengths than H,O) when working 
with ELOvargant solvent eluents, decrease the amount of ion pair reagent (TPA, H PO) in 
Eluent Bi Mant and Hodges (1991a} by 10-15% in comparison with Fluent A, yielding a 
flatter baseline (Dolan 19911, 


2. Mix the solvent either by stirring with a Teflon-coated magnetic flea or by shaking in a 


stoppered cylinder (time depends on volume). 


WARNING: Do not use Paratilm (plastic wrap) under any circumstances to cover flasks or 
cylinder: containing the eluents used in RP-HPLC. The organic solvents in combination 
with the acidic wn-pair reagent components in the eluents will dissolve components in the 
Param. yielding extraneous peaks in the chromatogram. 


3. Filler Flucnt A first and then Eluent B through a 0.2-4m PTFE filter. 


Filtering ot eluents increases the column life time and contributes to degassing. 


4. Seal unused eluent bottles with a stopper to avoid evaporation of the organic solvent. 


Preparation of Peptide Sample 


l 


Dissolve the sample with half the target volume of Fluent A (weak mobile phase} to 
achieve the desired concentration. If the sample is not soluble, add a small amount (typ- 
ically (22595 of total volume) of Fluent B (strong mobile phase). 


Small amounts of strong mobile phase may cause preelution of the sample depending on the 
sample loop size, column dimensions, and starting mobile-phase composition of the gradient. 


Inspect the sample for clearness. Filter the sample through a 0.2-pm PTFE filter if insol- 
uble, opalescent, or solid particles are present, Alternatively, centrifuge the sample using 
the supernatant for injection. 


IMPORTANT: Uo not inject samples that are not fully dissolved because they can block the 
injector and column. ' 


Store the sample if not in use at 4°C or -2?C, depending on the planned storage time and 
usage. Peptides and protein samples can degrade at room temperature. 


Avoid repeated freczing-thawing of the sample. Instead, prepare small aliquots of the pep- 
ude or protein sample. which are kept at -20°C and are used for one task. 


Purification ot Peptides from Solid-phase Peptide Synthesis with RP-HPLC 


1. Test the instrumental HPLC system. The following test allows the evaluation of the col- 


umn bed integrity (low integrity will be associated with split, fronting, or tailing peaks) 
and column performance (in terms of plate numbers}. This test also allows, if repeated at 
regular intervals, the monitoring of the performance during the life time of a column, 
and the assessment of batch-to-batch differences of column fillings (see Protocol 1). 


a. Produce a blank run (inject Eluent A) and run a gradient 100% A — 10096 B under 
the same conditions as intended for the separation of the peptide or protein sample. 
Repeat if “ghost” peaks occur. 

This procedure cleans the column of peptides and proteins from previous separations, 
which have not been removed by the flushing process. 
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bh. Measure the dead volume of the column with thiourea or sodium nitrate vor am 
othe: parinieractive solute !. 


€. “lest the column pertormance with a gradient run and an appropriate test mixture: An 
example of an RE HP1t test mixture is 


0.154% Cw/v] dimethyipitthalate 

0.1 596 (wv? diethylphthalate 

0.199 iwi} diphenyl 

0.0396 (wA) O- terphenyl 

U.32% Cwivi dioctvlphthalate in methanol 


d. Test the column performance with à gradient run and peptide standards for column 
testing described, for example, in Mant and Hodges (19911). 


2. Perform analytical RP-HPLC of the crude peptide. 


a. Priot to sample analysis, conduct at least one blank run under the same conditions as 
intended for the peptide sample. Repeat if peaks occur. 


b. Separate the crude peptide (- 100 ug) with an analytical RP-HPLC procedure to assess 
the sample in terms of purity, peak profile. and eiution conditions with the experi- 
mental conditions as outlined below in the panel. 


c. Identify the component to be purified. 


CHROMATOGRAPHIC CONDITIONS FOR THE ANALYTICAL SEPARATION OF CRUDE PEPTIDE AND 
PROTEIN MIXTURES 


The following chromatographic conditions are representative of the type of standard ana- 
lytical protocol that can be employed. Numerous variations in terms of the choice of 
reversed-phase sorbent, gradient slope and duration, flow rate, mobile-phase composition 
(including the choice of different organic solvent modifiers, buffer species, or ion-pairing 
reagents), and pH, temperature, and sample size can be considered. Listed below is a linear 
gradient procedure that represents a robust protocol around which these variations can be 
constructed. 


e Column: For example, Ca Co Cip etc., reversed-phase sorbent appropriately packed into 
a suitable column (5 um, 300 A, 150 x 4.6-mm ID.) 


« Sample size: <2 mg peptide/protein 

« Sample loop size: 20-200 jl 

e Linear A — D gradient 

e Eiuent A: 0.996 aqueous TFA 

* Eluent B: 0.196 TFA in acetonitrile/H,O 

« Gradient rate: 196 B/min. for example, 60% acetonitrile/H,O 
* Gradient range and time: 0-100% B in 60 minutes 

« Flow rate: 1.0 ml/min 

» Detection: 214 um 


s Temperature: Room temperature 
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3. Prepare RP-HPLC. 


a. Separate the crude peptide mixture ( -25-100 mg) with a preparative RP-HPLC pro- 
cedure using the experimental conditions as outlined in the panel below. 


To avoid detector response overloading effects, wavelengths from 240 to 280 nm are 
usually chosen, However, small amounts of chemical scavengers (used during the SPPS 
procedures) present in the crude peptide solution can absorb very strongly in this wave- 
length range. It may be worthwhile to sacrifice up to a milligram of sample and to per- 
form a preparative separation with detection at 214 nm to determine the retention time 
of the main peptide product unambiguously. 


b. Coliect HPLC fractions (3—7.5 ml). 


CHROMATOGRAPHIC CONDITIONS FOR THE PREPARATIVE SEPARATION OF PEPTIDES 


The following chromatographic conditions are representative of the type of standard 
preparative protocol that can be employed. Numerous variations can be contemplated in 
terms of the choice of reversed-phase sorbent, gradient slope and duration, flow rate, 
mobile-phase composition {including the choice of organic solvent modifiers, buffer 
species, or ion-pairing reagents), and pH, temperature, and sample size. Listed below is a 
linear gradient procedure that represents a robust protocol around which these variations 
can be constructed. 


e Column: For example, Cy Ca Cig, etc., reversed-phase sorbent appropriately packed into 
a suitable column (10 jum, 300 A, 300 x 21.5-mm I.D.) 


» Sample size: «150 mg peptide/protein 

+ Sample loop size: 1 ml, multiple injection 

« Linear A — B gradient 

e Eluent A: 0.996 aqueous TFÀ 

. Eluent B: 0.1% TFA in acetonitrile/H,O 

« Gradient rate: 0.66% B/min, for example, 60% acetonitrile/H,O 
e. Gradient range and time: 0-100% B in 90 minutes 

« Flow rate: Activate pumps to deliver 7.5 ml/min 

s Detection; 254 nm 

« Temperature: Room temperature 


4. Perform analytical RP-HPLC. 
a. Analyze aliquots (30-50 pl) of the collected fraction with an analytical RP-HPLC. 


A blank, the crude peptide solution, the fraction of interest, and the two fractions 


before, and two fractions after are typically analyzed. 


b. Identify the fractions containing the desired protein in sufficient purity (ie., 29590) 
by a comparison of the retention time of the purified component with the retention 
time of the target component in the chromatogram of the crude peptide. 


c. Weigh the peptide into microfuge tubes and store them at -20°C for further investi- 
gations. 
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5. Perform further analysis. 


a. Freese dry fractions containing the peptide with a purity 295% as assessed by the 
analytical RP-HPLC separation, by high-performance capillary zone electrophoresis 
( HP-CZE) (Sitaram et al. 1999: Hearn ct al, 20001, or by capillary electrochromatog- 
raphy (HP-CEC) (Walhagen et al. 2000a,b. 2001) for another appropriate high -reso- 
lution analytical technique. 


b. Carry out off-line or on-line electrospray ionizatton—mass spectroscopy | PPS: ol 
the purest fraction to confirm the correct synthesis and correct identification of the 
component to be purified, 


if the received mass differs from the expected mass it is sometimes worthwhile te ines 
ugate other fractions with ESE Ma. 


Case Stupor: RP-HPLC AND PuRIFICAHON OF PRODUCTS FROM SOLID-PHASE PEPTIDE SYNTHESIS 


Examples of the use of RP-HPLC in the analysis and purification of products from solid- 
phase peptide synthesis are shown in Figures 5.15 and 5.16. The preparative isolation of the 
synthetic 26-mer polypeptide product representing the core of the Thermus thermophilus 
ribosomal L36 protein, NH,-RIC(Acm)DKC(Acm)KVIRRHGRVYV IC( Acm)ENPKIIKQ- 
COOH (Boysen and Hearn 2000), was achieved on a TSK-ODS-120 T column (300 x 21.5 
mm LD., 120 A, 10 um, end-capped) usinga flow rate of 7.5 ml/min and a 90-minute gradi- 
ent from 096 B to 100% B (A: 0.196 TFA in H,O; B: 60% acetonitrile, 0.09% TFA} at pH 2.1, 
as depicted in Figure 5.15, with the major peak eluting at 58.77 min. The acetamidomethy! 
protecting groups were left on the peptide to avoid unwanted disulfide bridge formation. 

The resolution of the peptide from the crude mixture, as shown in Figure 5.16A, and the 
purified product shown in Figure 5.16B with analytical RP-HPIC was achieved with a T5K- 
ODS-120 F column (150 x 4.6-mm I.D., (20 A, 5 pm, end-capped) using a flow rate of 1 
ml/min and a 60-minute gradient from 100% A (0.1% TFA in H,O} to 100% B (90% ace- 
tonitrile, 0.09% TEA) at pH 2.1. When both the preparative and the analytical run of the 
crude peptide are compared, it is apparent that the scavenger from the peptide cleavage, 
which has a relatively strong absorbance at 254 nm, elutes between 80 and 100 minutes in the 
preparative run (Figure 5.15), while absorbing little in the analytical run at 214 nm (Figure 
5.16A). The basis for achieving the obtained high purity of the end product, as seen in Figure 
5.168, was a relatively shallow gradient slope in the preparative RP-HPLC in conjunction 
with narrow peak fractioning. Adequate equilibration of the analytical column ensures run- 
to-run reproducibility, facilitating peak identification according to retention times. 
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FIGURE 5.15. Use of RP-HPLC in the preparative isolation of 
synthetic polypeptides. The synthetic 26-mer polypeptide prod- 
uct representing the core of the T. thermophilus ribosomal L36 
protein, H-RICCAcn)DKC( Acm)KVIRRHGRVYVIC(Acm] 
ENPKHKQ-OH (Boysen and Hearn 2000) (70 mg), was puri- 
fied from the crude mixture obtained from solid-phase peptide 
synthesis on a TSK-ODS-120 T column (300 x 21.5-mm 1.D., 
120 A, 10 um, end-capped) using a flow rate of 7.5 ml/min and 
a 90-minute gradient from 096 B to 100% B (A: 0.1% TFA in 
H:O; B: 60% acetonitrile, 0.09% TFA) at pH 2.1. 
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FIGURE 5.16. Use of RE-HPLC in purification of peptides derived from solid-phase peptide synthesis. 
Resolution of the crude product {A}, H-RIC( Acm)DKC( Acm)KVIRRHGRVYVI-C(Acm)ENPKHKQ-OH 
(Boysen and Hearn 2000), and the purified synthetic peptide (B) was achieved with a TSK-ODS-120 p col- 
umn (150 x 4.6-mm LD., 120 À, 5 pm, end-capped) using a flow rate of 1 ml/min and a 60-minute gradi- 
enl from 100% A (0.1% TEA in H,O) to 100% B (90% acetonitrile, 0.09% TFA} at pH 2.1. 


PROTOCOL 5 


Desalting of Peptides and Protein Mixtures by 
RP-HPLC Techniques* 


Tue RP-HPLC TECHNIQUE CAN BE USED TO "DESALT" PEPTIDE or protein samples derived from 
extraction procedures, from chemical reactions such as reductive alkylation in the presence 
of urea or guanidine hydrochloride, citraconylation, iodination, or cyanogen bromide cleav- 
age, or recovered from other chromatographic separation modes (Hearn 1998). The peptide 
or protein solution is injected onto a small RP-HPLC column. An aqueous buffer is used to 
elute the salts while the peptides or proteins are concentrated at the top of the column. After 
elution of the salts, monitored by UV detection, the peptides or proteins are eluted with 
H,O-acetonitrile or H,O-2-propanol mobile phases. The loading capacity of an analytical 
column (100—300 x 4 mm I.D.) is typically ~8 mg, whereas the loading capacity for a semi- 
preparative column (30 x 16 mm L.D.) is ~34 mg (Pohl and Kamp 1987). 

This protocol was contributed by Reinhard I. Boysen and Milton T.W. Hearn (Centre for 
Bioprocess Technology, Department of Biochemistry and Molecular Biology, Monash 
University, Clayton, Victoria 3800, Australia). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <‘>. 


b» Reagents 
Acetonitrile (HPLC grade) <!> 
H,O (deionized, from a Milli-Q water purification system or equivalent) 
2-Propanol «!» 
Sodium nitrate <!> 
Thiourea <!> 
Trifluoroacetic acid (FPA) <!> 


P» Equipment 

Analytical HPLC apparatus 
This apparatus consists of a pump module, mixing chamber, spectrophotometer with an analytical 
flow cell, an analytical column, PC, printer and software, HPLC system with atteridant data man- 
agement systems, and system automation controllers (see Figure 5.10). 

Column 
Analytical C,, C,, Ci, or another appropriate alkylsilica sorbent appropriately packed into a suit- 
able column (e.g., 5 yum, 300 A, 40 x 4-mm LD.). 


Eluent bottles (two, 1-liter) 
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Hamilton glass syringe (graduated 25 ul) 
Helium for buffer degassing <!> 
Measuring cylinder (two, 1-liter) 

Alt glassware should be rinsed three times with Milli-Q H,O. 
Microfuge (benchtop) (capable of ~20,000g, for sample preparation) 
Mobile-phase filtration facility 

vacuum pump 

] liter reservoir 

support base with glass frit and integral vacuum connection 

funnel 

clamp 

47-mm membrane filter (0.2-um PTFE) 

Safety glasses 

Stopwatch 

Syringe (1 ml, with truncated needle) 
Vials (conical for autosampler) 
Waste bottle 


» Biological Sample 
Protein or peptide sample for analysis 


METHOD 


Preparation of the Mobile Phase 
1. Prepare Eluent A (weak mobile phase) and Eluent B (strong mobile phase), 1 liter each, 
for example, 


Fluent A: 0.1% TFA in H;O 
Eluent B: 0.09% TFA in 2-propanol 


IMPORTANT: TFA employed for protein sequencing may not be suitable because it can con- 
tain anti-oxidants. 


2. Mix the solvent either by stirring with a Teflon-coated magnetic flea or by shaking a stop- 
pered cylinder (time depends on volume). 


3. Filter Eluent A and then Eluent B through a 0.2-im PTFE filter. 


A. Seal unused eluent bottles with a stopper to avoid evaporation of the organic solvent. 


Preparation of Peptide and Protein Samples 


1. Inspect the sample for clearness. 


2. Filter the sample through a 0.2-pm PTFE filter if insoluble, opalescent, or solid particles 
arc present. Alternatively, centrifuge the sample using the supernatant for injection. 


REVERSED-PHASE HIGH-PERFORMANCE LIQUID CHROMATOGRAPHY æ 267 


Desalting of Proteins with RP-HPLC 


1. Test the instrumental HPLC system. 


a. Produce a blank run (inject Eluent A) and run the step elution procedure first with 
isocratic 10096 Eluent A and then with isocratic 100% Eluent B under the same «on- 
ditions as intended for the protein sample. 


b. Measure the dead volume of the column with thiourea or sodium nitrate tor any 
other noninteractive solute, e.g., salt solution) to identify the elution time of the 
salt. 


2. "Desalt" the proteins. 


a. Inject the sait-containing sample onto the reversed-phase column using 100% Fluent 
A. The salt elutes at or near to the column dead (void) time. 


Analogous methods can be used to elute urea or guanidine hydrochloride following 
reductive alkylation or solubilization/refolding studies, and mans low-inolecular- 
weight chemical compounds present in the sample as a consequence of chemical derive- 
tization, backbone cleavage, or partial reduction reactions, subunit-subunit dissocistion 
or following ion exchange, biospecific affinity, or hydrophobic interaction chroniato- 
graphic steps. 


b. Elute the protein either with a step of up to 100% Eluent B or, alternatively, by gradi- 
ent elution procedures. 


c. Collect the fraction containing the protein for further analysis. 


CHROMATOGRAPHIC CONDITIONS FOR THE DESALTING OF PEPTIDES AND PROTEINS 

The following chromatographic procedures are representative of the type of standard ana- 
lytical protocol that can be employed. 

* Chemicals: Acetonitrile, 2-propanol, TFA 


* Column: For example, C, C, Cip or another appropriate alkylsilica sorbent appropri- 
ately packed into a suitable column (10 um, 300 A, 300 x 4-mm L.D.) 


* Sample size: 8-mg peptide or protein sample 

« Sample loop size: 1 ml 

« Step elution 

* Eluent A: 0.196 aqueous TFA 

+ Eluent B: 0.196 TFA in acetonitrile or 2-propanol 

+ Elution conditions: 100% Eluent A for 3 minutes, then 100% Eluent B for 3 minutes 
« Flow rate: 2.5 ml/min 

* Detection: 230 nm 

* Temperature; Room temperature 
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FIGURE 5.17. Use of RP-HPLC in the desalting of peptide and protein samples isolated by other tech- 
niques. In this example, the 50S ribosomal protein from T. aquaticus on a C, column (40 x 4-mm 1.D., 300 


A, 5 jum) was desalted at a flow rate of 2.5 ml/min using a step elution protocol with 3 minutes of elution 
with Eluent A (0.1% TFA in H,O) and 3 minutes of elution with Eluent 8 (0.196 TFA in 2-propanol), pH 


2.1, (S) Salt; (P) protein; (A) time of injection. 


Case Stupv: RP-HPLC iN THE DESALTING OF PEPTIDE AND PROTEIN SAMPLES 


The example shown in Figure 5.17 is illustrative of the use of RP-HPLC in the desalting of 
peptide and protein samples isolated by other techniques for the step elution of a 508 ribo- 
somal protein sample derived from Thermus aquaticus preparation on a C, column (40 x 
4-mm LD., 300 A, 10 pm) using a flow rate of 2.5 ml/min. 


PROTOCOL 6 


separation of Proteins with RP-HPLC Using a Computer- 
assisted Method to Optimize Gradient Conditions 


For THE SEPARATION OF DIVERSE COMPONENTS OF A SAMPLE containing peptides or proteins of 
unknown composition, with a given solvent system and a given column, separation condi- 
tions that result in different retention times of the various components must be optimized. 
Empirical concepts describe the retention behavior of peptides and proteins with a ligand in 
the presence of different solvent combinations. The most commonly adapted concepts are 
based on the Solvophobic Theory (Horvath et al. 1976, 1977a; Hearn and Zhao 1999; Hearn 
2002) and the Linear Solvent Strength Theory (Snyder 1980; Purcell et al. 1999). These con- 
cepts allow the development of fast, robust, and cost-effective separation methods (Boysen et 
al. 1998). 

This protocol was contributed by Reinhard I. Boysen and Milton T.W. Hearn (Centre for 
Bioprocess Technology, Department of Biochemistry and Molecular Biology, Monash 
University, Clayton, Victoria 3800, Australia). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


P Reagents 
Acetone <!> 
Acetonitrile (HPLC grade) <!> 
H,O (deionized, from a Milli-Q water purification system or equivalent) 
Phosphoric acid (H,PO,) <!> 
Sodium dihydrogen phosphate {NaH,PO,) <!> or other suitable salts 
Thiourea <!> or sodium nitrate <!> 
Trifluoroacetic acid (TFA) «1» 


b Equipment 
Analytical HPLC apparatus 
This apparatus consists of a pump module, a mixing chamber, spectrophotometer with an analy- 
tical flow cell, an analytical column, a column oven, a manual injector with sampie loop, PC, print- 
er and software, e.g., Beckman, System Gold; Agilent HP-1090A (or 1100 series) liquid chromato- 
graph, or Waters 600/486 HPLC system with attendant data management systems and system 
automation controllers, 


Column 
Analytical C,, or another appropriate ri-alkylsilica sorbent appropriately packed into a suitable col- 
umn (eg. 5 um, 300 A, 250 x 4-mm LT). 
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Conical vials (for autosampler} 
Eluent bottles (two, 1-liter) 
Hamilton glass syringe (graduated, 25 pl) 
Helium gas (for buffer degassing) and nitrogen (for autosampler operation) <!> 
Measuring cylinder (two, kb-hter) 
All glassware should be rinsed three times with Milli-Q H,0. 
Microfuge (benchtop) 
Mobile-phase filtration facility 
vacuum pump 
| -liter reservoir 
support base with glass frit and integral vacuum connection 
funnel 
clamp 
47-mm membrane filter (0.2-[1m PTFE) 
Safety glasses 
Stopwatch 
Syringe (1 ml, with truncated needle} 
Waste bottle 


p Biological Molecules 
HPLC peptide standards 
See respective sections below. 
Protein sample 


METHOD 


Preparation of the Mobile Phase 


The following considerations are relevant to the preparation of mobile phases to be used with 
samples containing peptides and proteins. 


1. Prepare Eluent A (weak mobile phase} and Eluent B (strong mobile phase), | liter each, 
for example, 


Eluent A: 0.196 TEA in HO 
Eluent B; 0.09% TEA in 60% acetonitrile/40% H,O (viv) 


The volumes of organic solvent and H,O must be measured in two different cylinders and 
then combined because of volume contraction upon mixing, which may be up to 30 ml/liter 
prepared solvent (depending on the nature of the solvent). Failure to do so can lead to sub- 
stantial errors in mobile-phase composition. 

To compensate for the baseline shift in gradient elution (because organic components 
absorb more light at low wavelengths) when working with H,O/organic eluents, the amount 
of ion-pair reagent (TFA, H;PO,) in Eluent B is usually decreased by 10-1596 in compari- 
san with Eluent A, yielding a flat baseline (Dolan 1991}. 


IMPORTANT: TFA employed for protein sequencing may not be suitable because it can con- 
tain anti-oxidants. 
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2. Mix the solvent either by stirring with a Teflon-coated magnetic flea or by shaking a stop- 
pered cylinder (time depends on volume). 


WARNING: Do not use Parafilm under any circumstances to cover the eluents, because the 
organic solvents in combination with acidic ion-pair reagent components in the eluents will 
dissolve components in the Parafilm, yielding extra peaks in the chromatogram. 

3. Filter Eluent A and then Eluent B through a 0.20-j1m PTFE filter. 


Filtering of eluents increases the column lifetime and contributes to degassing. 


4. Seal unused eluent bottles with a stopper to avoid evaporation of the organic solvent. 


Preparation of the Protein Sample 


T. Dissolve the sample with half the target volume of Eluent A (weak mobile phase). If the 
sample is not soluble, add a small amount (typically «2596 of total volume) of Eluent B 
(strong mobile phase). 


Small amounts of strong mobile phase may cause pre-elution of the sampie depending on 
the sample loop size, column dimensions, and starting mobile-phase composition of the 
gradient. 


2. Inspect the sample for clearness. Filter the sample through a 0.2-11m PTFE filter if insol- 
uble, opalescent, or solid particles are present. Alternatively, centrifuge the sample using 
the supernatant for injection, 


Do not use samples that are not fully dissolved for injections, because they can block the 
injector and column. 


3. Store sample if not in use at 4°C or —20*C, depending on the planned storage time and 
usage. Peptides and protein samples can degrade at room temperature. 


Avoid repeated freezing-thawing of the sample. Instead, prepare small aliquots of the pep- 
tide or protein sample, which are kept at -20°C and are used for one task. 


Test the instrumental HPLC System 
The following considerations are relevant to the testing of the HPLC system for use with sam- 
ples containing peptides and proteins. 


1. Produce a blank run (inject Fluent A) and run a gradient 100% A — 100% B under the 
same conditions as intended for the peptide or protein sample. Repeat if peaks occur. 


This procedure cleans the column of peptides and proteins from previous separations, 
which have not been removed by the flushing process. 


2. Measure the dead volume of the column with thiourea or sodium nitrate (or any other 
noninteractive solute). 


3. Test the column performance with a gradient run and an appropriate test mixture. 


An example of the RP-HPLC test mixture is 0.1596 (w/v) dimethylphthalate, 0.1596 (w/v) 
diethylphthalate, 0.01% (w/v) diphenyl, 0.03% (w/v) O-terphenyl, 0.32% {wiv) dioctylph- 
thalate in methanol. 
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This test allows the evaluation of the column bed integrity (low integrity will be associated 
with split, fronting, or tailing peaks} and column performance fin terms of plate numbers). 
It also allows (if repeated at regular intervals) monitoring of performance during the life- 
time of a column, and the assessment of batch-to-batch differences of column filings. 


Peptide standards tor column testing are described, for example. in. Mant and Hodges 
(1991b). 


4. Measure the gradient delay (dweil volume) of the HPLC system (for another way of going 
about this task, see Chapter 7). The following steps must be done without a column con- 
nected to the HPLC instrumentation: 


a. Connect the injector directly to the detector with union piece (vero-length column). 
b. Prepare a special Eluent A and Fluent B, 200 rni each: 


Eluent A: acetonitrile 
Eluent B: acetonitrile, 0.2% acetone 


c. Run a gradient of 10-90% Eluent B for 10 minutes at a flow rate of 2.0 ml/min. 
Detection is carried out at 254 nm. 


The measured value of the dwell volume can be influenced by the injecuon technique. 
If after the injection, the valve remains in the inject position, the dwell volume will 
include the volume of the sample loop; if the valve is put back in load position, the 
dwell volume will not. This effect can produce errors with sample loops 7100 il. The 
same consideration is valid if the sample loop is exchanged from an analytical separa- 
tion (e.g., a sample loop of 50 pl) to a semipreparative separation (ega d sample loop 
of 2500 pl) on the same column. 


d. Determine the gradient delay and present the results graphically in a format similar 
to that shown as Figure 5.18. 


The dwell volume, Vi, is the volume of eluent from the pump heads to the column 
inlet (including the mixing chamber volume). The dwell volume values range from 2 
ml to 7 ml; autosamplers in particular make a large contribution to the delay volume. 
It should be determined with an accuracy of +0.5 ml. The obtained profile can be used 
for diagnostic purposes, because the volume accuracy of the pump delivery is also 
monitored. 


Knowledge of the gradient delay is essential for method development, because it allows 
the accurate calculation of the 5 and ky values (ioc. cit). Its determination is particular- 
ly important when establishing segmented gradients (since various errors can accumu- 
late here) and when an established HPLC method is transferred from one instrument to 
another instrument. 


Perform Analytical RP-HPLC of the Protein Sample 


1. Prior to sample analysis, conduct at least one blank run under the same conditions as 
intended for the protein sample. Repeat if peaks occur. 


2. Separate the peptide or protein sample (100 ug) with two analytical RP- HPLC runs dif- 
fering by a factor of 3. 
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FIGURE 5.18, Graphical illustration of the approach employed to determine the gradient delay. The pre 

gradient (1095) and postgradient baselines (90%) are used to determine the time when the 509: value ta 
reached. This 15 then used to calculate the gradient delay by subtracting the theoretical half-wav time under 
consideration of the flow rate as depicted in the formula. 


CHROMATOGRAPHIC CONDITIONS FOR THE ANALYTICAL SEPARATION OF CRUDE PEPTIDES AND 
PROTEINS 


+ Column: For example, C,, C,, Cig: or another appropriate m-alkylsilica sorbent appropri- 
ately packed into a suitable column etc (5 um, 300 A, 250 x 4.0-mm 1.D.) 

e Sample size: <2 mg peptide/protein 

e Sample loop size: 100-500 ul 

e Linear A — B gradient 

+ Fluent A: 0.996 aqueous TFA <!> 

» Eluent B: 0.1% TFA in acetonitrile <t>/H,O 

e Gradient rate: 1%B/min, for example, 6096 acetonitrile/H,O 

+ Gradient range and time: 0-100% B in 60 minutes 

e Flow rate: 1.0 ml/min 

» Detection: 214 nm 


» Temperature: Room temperature 


274 " CHAPTER 5 


Optimization of the Gradient Conditions 


For resolution optimization, the strategy takes advantage of the relationship between the gra- 
dient retention time of a protein (expressed as the median capacity factor, k ) and the medi- 
an volume fraction of the organic solvent modifier @ in regular RP-HPLC systems based on 
the concepts of the Linear Solvent Strength theory (Horvath et al. 1976; Snyder 1980; Hearn 
and Grego 1983c; Purcell et al. 1999; Hearn 2002), such that l 


Ink z1Ink,- S9 


where k, is the capacity factor of the solute in the absence of the organic solvent modifier, 
and S is the slope of the plot of Ink versus @. The values of Ink, and 5 can be calculated by 
linear regression analysis, The underlying principles of an intuitively perlormed optimization 
and manually achieved optimization (using Excel spreadsheets to calculate, e.g., the In ko and 
S values), or, alternatively, optimization via computer simulation software (e-go Simplex 
methods, multivariate factor analysis programs, and DryLab G/plus) are essentially the same. 
However, the outcomes result in different levels of precision. The approaches are collectively 
outlined below. 
Resolution, R, of peak zones is optimized through adjustment of k hy successive change 
of the parameters t; and Ag in the gradient elution mode according to the steps: 
1. Initial experiments. 

. Choice of starting conditions. 

. Calculation of Ink, and 5 values from initial chromatograms. 

. Peak tracking and assignment of the peaks. 


. Optimization of gradient run time fẹ over the whole gradient range. 


. Calculation of new gradient retention times ty 


2 

3 

4 

5 

6. Determination of new gradient range. 
7 

8. Change of gradient shape (optional). 
9 


. Verification of the results. 


1. Initial experiments 


The peptide or protein sample in initial experiments is separated under two linear gradient 
conditions differing by a factor of 3 in their gradient run times t; (all other chromatograph- 
ic parameters being held unchanged) (Dolan et al. 1989; Boysen et al. 1998] to obtain the RP- 
HPLC retention times of each of the peptides or proteins. Irrespective of what optimization 
strategy will then be used, it is advisable to separate any sample with at least two different gra- 
dient run times to identify overlapping peaks. For the optimization of the gradient shape and 
to achieve maximum resolution between adjacent peak zones, the ability to determine the 
retention times of the peptides or proteins and to classify the parameters that reflect the con- 
tributions from the mobile-phase composition and column dimensions (see Table 5.9) is 
essential (Ghrist and Snyder 1988a,b; Ghrist et al. 1988). Although the determination of the 
volume, V, is useful (Stadalius et al. 1984), determination of dead volume and gradient 
delay is essential (Snyder 1980; Boysen et al, 1998). 

With these compiled input parameters, various algorithms, including DryLab Giplus, can 
generate the relative resolution map (RRM), based on the calculation of the co responding S 
values and k, values for each component. If Drylab G/plus is not available, the resolution 
information can be extracted directly from the vertical distances of the individual R, versus 
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TABLE 5.9. Parameters required for the calculation of In kj, and 5 values, for manual optimization, and 
optimization with DryLab G/plus 


Optimization Computer-assisted 
utilizing basic optimization 
Parameter equations (Excel) (DryLab) 
Gradient delay volume V,, (ml) y y 
Dead volume tn (ml) N - 
Gradient range (% B) y N 
Gradient run time fonio (min) V Y 
Column length L (mm) - y 
Column diameter d, (mm) y 
Flow rate F (ml/min) - N 


Column plate number N - 
Retention times £,, for ail peaks N 
Retention times t» for all peaks 


at et! 


xL 


tg plots, whereby a crossover of lines reflects a peak overlap as shown in Figure 5.19a,b,c. Over 
a small range of Ẹ values, the S values and k, values for an individual peptide or protein wilt 
remain essentially constant. it is assumed that no significant conformational change occurs 
for the individual peptides or proteins as a consequence of the differences in column resi- 
dency time and gradient time (Purcell et al. 1993, 1999). The typical retention behavior of the 
peptides and proteins is depicted in Figure 5.19 as plots of Ink versus $. These plots become 
nonparallei, and frequently intersect, as the mobile-phase composition is varied. 


2. Choice of starting conditions 


The S values of peptides and proteins can be empirically correlated with their molecular 
weights (Snyder et al. 1983), based on a correlation of 


S = 0.48(MWwy# 


Thus, S values, estimated with this equation or, alternatively, values of similar peptides or 
proteins taken from the literature, can be used to calculate a reasonable starting point for the 
initial experiments according to the following equation: 


Vp AD*S*K 
0.87 + F 


fos 


The required gradient run time t; for a separation of peptides and proteins with expect- 
ed S values of ~20 can be calculated for a gradient of 0-100% (6096 acetonitrile) at a flow rate 
of I ml/min, and an ideal k value of 5 as follows: 

1.9 ml * 0.6 * 20* 5 . 
ies m 2a 131 min 
0.87 * 1 ml/min 


On the basis of this calculation, the initial gradients from 096 to 10096 B for gradient 
times of 1- and 3-hour duration can be selected. On the other hand, availability of the 5 val- 
ues, derived from two gradient elution RP-HPLC experiments, can be used as an analytical 
criterion early in the separation of the target peptide or protein from other components in 
soluble extracts of biological sources to distinguish low- from high-molecular-weight mole- 
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FIGURE 5.19. (A) Plots of logarithmic capacity factor In & versus the volume fraction of organic solvent, @ , [or six different ribo- 
somal proteins, denoted 1—6, obtained from T. aquaticus on a C,, column (Molnar et al. 19894). As evident from this plot, the inter- 
section of two plots represents complete overlap of the peak zones at the specific gradient run times t, 90 and 270 minutes. (H) 
Plots of the resolution R. (resolution taken as the difference of two adjacent peaks) versus the gradient run time h, of these six dif- 
ferent proteins from T. aquaticus, which is the blueprint for generation of the relative resolution map (RRM). Because the plotted 
values of the resolution are absolute values, negative resolution values of peaks that change their elution order are depicted as bro- 


ken lines. (C) RRM of the six different proteins from T. aquaticus, depicting only the critical peak pairs, including their change in 
retention order. The resolution optimum for this specific set of chromatographic conditions occurs at 1; = 150 minutes. 
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cules and, for example, to exclude peptide fragments participating in the optimization 
process, without having to resort to SDS-PAGE experiments. The RP-HPLC behavior of small 
peptides with molecular weights from 300 to 1000 are approximately correlated (Hearn and 
Grego 1981; Snyder 1990) with S values between 3 and 10. For medium-molecular-weight 
globular proteins, S values above 20 are expected (Aguilar et al. 1985). As an example, 
cytochrome c, with a molecular weight of ~12,000, has an S value of 28.8 on a Nucleosil C,, 
column (Stadalius et al. 1984). 


3. Calculation of In ko and 5 values from initia! chromatograms 


The retention times ¢,, and t, for a peptide or protein solute separated under conditions of 
two different gradient run times (t,, and t; whereby to; < tga) can be given by the equations 
(Quarry et al. 1986; Ghrist et al. 1988; Purcell et al. 1999): 


te) = Ge log(2.3k,b,) Y fot tp 


ty = C log(2.3k,b,) + ty + fy 


with 
bi - te =p 
b, fg 


Here, tgr t are the gradient run time values of ts for two different gradient runs, resulting 
in different values of b (b,, b,) and t, (fyn ty) for a single solute; t, ty are the gradient reten- 
tion times for a single solute in two different gradient runs; b, b, are the gradient steepness 
parameters for a single solute and two gradient runs differing only in their gradient times. 
Steep gradients correspond to large b values and small k. values; kọ is the solute capacity fac- 
tor at the initial mobile-phase composition; B is the ratio of t,» and tẹ, which is equivalent to 
the ratio of b; and 8); t, is the column dead time; and tp is the gradient delay time. 

For peptides and proteins, there is an explicit solution (Stadalius et al. 1984; Quarry et al, 
1986) for b and kp namely: 


ty log p 


b= 
Cer ER 


b 
log ky = a (t, — to ~ ty) - log(2.35)) 
ty 


From the knowledge of b and kp the values of k and 9 can be calculated (Snyder 1980; 
Hearn 1991c): 
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Here, k is the value of &' (capacity factor) for a solute when it reaches the column midpoint 
during clution; @ is the volume fraction of solvent in the mobile phase; Aq is the change in 
p for the mobile phase during the gradient elution (Ag = 1 for a 0-190% gradient}; 9 is the 
effective value of p during gradient elution and the value of @ at band center when the band 
is at the midpoint of column, and ¢! is the normalized gradient time with tf, = t;/Aq. 

By linear regression analysis, using k and 9, the S value (empirically related to the 
hydrophobic contact area between solute and ligand) can be derived from the slope of the log 
k versus © plots, and Ink, (empirically related to the affinity of the solute toward the lig- 
and) as the y intercept (Horváth et al. 1976; Hearn 2000a, 2002). 


Ink, —Ink 
S= EE 
9 


4. Peak tracking and assignment of the peaks 


Complex chromatograms that result from reversed-phase gradient elution can often exhibit 
changes in peak order when the gradient steepness is changed. Before In ky and S values are 
calculated, or computer simulation is used, the peaks from the two initial runs must be cor- 
rectly assigned. Several approaches to peak tracking have been described, using algorithms 
based on relative retention and peak areas (Glajch et al. 1986; Lankmayr et al. 1989; Molnar 
et al. 1989b) or based on diode-array detection (DAD) (Berridge 1986; Strasters et al. 1989; 
Round et al. 1994). The assignment of peaks can be done in the following steps: 


e Integrate the chromatograms (using the integration software of the HPLC system or an 
integrator) of the initial runs and correct integration due to baseline drift or other instru- 
mental causes where necessary. 

e Print out both chromatograms including the peak area percent reports. 


Number all relevant peaks in the chromatogram with the better resolution. Relevant peaks 
have, for example, an area greater than 0.596 of the overall peak area. 


« Assign the peaks according to their peak areas allowing a reasonable elution window. 


The total peak areas, A,, and Aj; for initial runs 1 and 2 (whereby t;, < to) are deter- 
mined às 


A= 2A 


and their ratio is calculated: 
7 Ay 


An, 
Consequently, the ratio of the peak areas, A and A,, of a baseline-separated single compo- 
nent in initial runs 1 and 2, respectively, is 
A, 
R- 
Ay 
if the peaks are correctly assigned, R, = R, difficulties arise when peaks partially or completely 
overlap. If Ry << R, and the peak in run 2 is composed of a single component, the peaks are 


wrongly matched. Hf R. >> R, then the peaks might be correctly matched, but the peak area 
in run | could be enlarged by a hidden additional peak. To resolve these difficulties, the devi- 


I 
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ation (%R;) of R, from R, can be calculated and taken as a measure of the likelihood that a 
peak assignment hypothesis is correct: 


r 


— 100 


100. 
wR, = 
T 


5. Optimization of the gradient time, fg, over the whole gradient range 


The capacity factor k is a linear function of the gradient run time t; if Aq is kept constant. 
Hence, 


k 0.87F 
- = const. = C 
ts V.'AQ"S 


The optimized gradient run time teggu can be obtained from the RRM or, alternatively, 
from the plot of R; versus t, (see Figure 5.19), and yields for each peptide or protein the new 
values of K w by togam being multiplied with C: 


C tau = K new 


6. Determination of the new gradient range 


If the gradient run time teggu is changed in relation to Aq with 1,*,— const., the k values do 
not change, as can be seen from the following equation: 


Torre Vie *S* k 


td = n 
AQ 0.87 * F 
where 
Io 
A = 5 ~ 
fc 


and the retention time t, of the first peak is greater than (t; + tp) and the retention time t, of 
the last peak is less than Atg,,,. 


7. Calculation of the new gradient retention times, 1; 


On the basis of knowledge of the $ and the Ink, values, new gradient retention times can then 
be calculated. 


8. Change of gradient shape (optional) 


The multisegmented gradient should only be performed when the gradient delay has been 
measured. With multisegmented gradients, an error in the gradient delay will rcoccur at thc 
beginning and at the end of each gradient step. In addition, the effect of V,,, (determined 
according to the procedures described in Ghrist et al. 1988), which modifies the composition 
of the gradient at the start and end (rounding of the gradient shape), can lead to deviation of 
the experimentally determined retention times from the predicted "ideal" retention times in 


DryLab G/plus simulations. 


9. Verification of the results 


After completion of the optimization process, the simulated chromatographic separation can 
now be verified experimentally using the predicted chromatographic conditions. 
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FIGURE 5.20. 505 ribosomal subunit proteins of T. thermophilus were separated by gradient elution RP- 
HPLC, using Fluent A (500 mat NaH PO, 500 mm H,PO, at pH 2.1) and Eluent B (200 mm H,PO, at pH 
2. klacetonitrile] (28:75) (v/v) (Boysen et al. 1998). The initial linear gradients for aptimization were from 
10% to 8096 R of either 3 hours or 9 hours duration with a flow rate of 1.0 ml/min. All experiments were 
carned out at 20°C. The location of the zinc finger ribosomal protein 1.36 of T. thermophilus is indicated by 
the arrow The sequences of all other T. thermophilus 505 ribosomal subunit proteins have becn similarly 
determined by automated Edman microsequencing following optimization of this RP- HPLC separation, 


Cast Stupy: RP-HPLC or THERMUS THERMOPHILUS BOS RIBOSOMAL PROTEINS 


The method of de novo optimization of the resolution of complex protein mixtures using 
the computer-assisted method development was successfully applied to the RP-HPLC of T. 
thermophilus 50S ribosomal proteins (Figure 5.20) (Boysen et al. 1998). The separation 
approach for these basic ribosomal proteins used a mobile phase of high ionic strength to 
suppress silanophilic interactions with a non-end-capped LiChrospher RP-18 sorbent. 
These conditions were found to be a key requirement for achieving good resolution with 
minimal peak-tailing. The retention times of the 50S ribosomal proteins were observed ta 
be in very close agreement with values predicted by computer simulation procedures based 
on linear solvent strength concepts, with an average error of only 0.596. 
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kk 1954, FREDERICK SANGER AND HIS COW FAGUES at the University of Cambridge provided a 
landmark in protein chemistry when, after 10 years of intensive work on msulin, they suc 
ceeded in achieving the first complete description of the chemical structure of insulin, a low- 
molecular-weight protein molecule (for review, see Thompson 1955). Insulin, the pancreatic 
hormone that governs sugar metabolism in the body, is a small protein of just 51 amino acids. 
It comprises two chains that are held together by interchain disulfide bonds. “For his work on 
the structure of proteins, especially that of insulin.” Sanger was awarded the Nobel prize for 
Chemistry in 1958. (Sanger also received a share of the 1980 Nobel prize in Chemistry tor his 
method of determining the base sequences in nucleic acids.) The ability of the protein 
chemist to determine the covalent structure of protein molecules has been instrumental both 
to the development of three-dimensional structures of proteins and to furthering our under- 
standing of protein structure-function relationships. 

One of the key technologies that enabled Sanger and his team to solve the structure of 
insulin was the DNP (for dinitrophenyl group)-labeling method that they developed for 
covalently modifying the end amino acid in a peptide (Sanger 1945). The DNP group, which 
gives the peptide a distinctive yellow color, acts as a chemical marker thal remains attached 
to the amino group after the peptide is hydrolyzed with either dilute acid or enzymes. For 
example, Sanger first broke each chain of the insulin moiecule into small peptides and then 
determined the sequence of each of the peptides in turn. À critical factor in this strategy was 
to identify overlaps in sequences that would enable the different pieces ol the jigsaw puzzle 
to be assembled correctly, i.e., build up a contiguous sequence by aligning all of the peptides. 
Analysis of the peptide sequences proceeded as follows. 

First, the peptides were treated with the DNP-labeling reagent 2,4-dinitro-1 -Huoroben- 
zene (DNEB) and then broken down into their constituent amino acids hy acid hydroiysis. 
o-DNP amino acid derivatives are resistant to acid hydrolysis and can be recovered with 
yields ranging from 2596 to 9096 after boiling in azeotropic hydrochloric acid (5.7 N Ht Dor 
heating with 12 N HCI for 12 hours at 105°C, The amino-terminal amino acid, say A, was 
identified by its yellow color using two-dimensional paper chromatography (Figure 6.1). The 
process is repeated with a second sample of the peptide, but this time, the peptide is only par- 
tially hydrolyzed so that two amino acids remain as a dipeptide derivative colored yellow. li 
amino acid B is partnered with, for example, amino acid A in this peptide fragment, then one 
knows that the sequence must be AB, and the order im the original sample, say tripeptide, was 
therefore ABC. 

A second key technology that had an indispensable role in solving the insulin sequence 
was the partition chromatography method (also called filter paper chromatography) for sep- 
arating amino acids and peptides, invented by A.I.P. Martin and R.L.M Synge. “For their 
invention of partition chromatography”—the foundation of modern chromatography— 
Martin and Synge were awarded the 1952 Nobel prize in Chemistry (for à review, sec Moore 
and Stein 1951}. 

During the intervening years since Sanger’s Herculean effort in deternuning the primary 
structure of insulin, the methodology for polypeptide sequencing has developed at a fantas 
lic rate. This has been largely due to the development of the Fdman degradation procedure 
whereby amino acids in a polypeptide can be removed in a stepwise fashion from the amino- 
terminal end, This subject, along with carboxy-terminal sequencing, is discussed in this chap- 
ter. The strategy of the Edman degradation relies upon the polypeptide having a free a 
amino group at the amino terminus. In many situations, the o-amino group is derivatized or 
blocked (e.g, by acetylation and formylation), making the polypeptide refractory to Edman 
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FIGURE 6.1. Two-dimensional paper chromatogram of a synthetic mixture of DNP amino acids according 
to a method described by Levy (1954). Development was first carried out with the “toluene” method (first 
dimension) and then with 1.5 m phosphate buffer (second dimension). Pl and P2, respectively, denote the 
position of &-DNP-amino-6-chlorovaleric acid and 6- DNP-amino-o-chlorovaleric acid. (Redrawn, with 
permission, from Narita et al. 1975 [GSpringer-Verlag].) 


degradation. Methods for deblocking are also discussed. The actual day-to-day operation and 
maintenance of autamated sequencing instruments require much technical prowess, and it is 
thus recommended that these applications be left in the hands of the aficionado. For a 
detailed discussion of protein sequencer maintenance and troubleshooting, see Dunbar 
(1997) and Geisow and Aitken (1989). In this chapter, emphasis is placed on instrument and 
chemical limitations, as well as sample-loading constraints. 


Cont FaCiUTIES FOR PROTEIN SEQUENCING 


To find a laboratory that offers biomolecular analytical services on an commercia! basis, 
check the searchable yellow pages of the Association of Biomelecular Resources Facilities 
(ABRF} Web Site (http://www.abrf.org). 


With the advent of mass spectrometric procedures for sequencing small peptides (as 
described in Chapter 8), the use of the automated Edman degradation procedure has 
declined markedly. However, it must be stressed that many biological problems require 
knowledge of the amino or of the carboxyl terminus of a polypeptide and that mass spectro- 
metric methods for assigning the amino or the carboxy) terminus of a protein or peptide are 
limited. 
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AMINO-TERMINAL SEQUENCING OF POLYPEPTIDES USING 
EOMAN DEGRADATION 


The stepwise degradation of peptides was first successfully applied (n. 1930 b+ Abderhalden 
and Brockmann, who used phenvlisocvanate (PIC) as an amino group coupling reagent for 
the production of an intermediate that rearranged under acidic conditions, cleaving the 
derivatized terminal amino acid from the parent peptide (Abderhalden and Brockmann 
1930), This method was extended by the Swedish scientist Pehr Victor Edman, who change 
the coupling agent to phenylisothiocyanate (PITC), a modification that yields à more readi- 
ly cyclized intermediate (and hence more easily cleaved amino terminal amino acid) than 
that derived from phenylisocyanate (Edman 19491, For an anecdotal account of the history 
of peptide stepwise degradation procedures, see Doolituie (19821. 

Alf current amino-terminal sequencing methods are based on the procedure developed 
by Pehr Edman, referred to by the eponym, “Edman degradat ion" procedure. The character- 
istics of this method are that the reaction removes the derivatized amino acid | phenylthio- 
hydantoin [PTH] amino acid) from the protein, but does not destroy the remaining peptide 
chain, so that a sequential degradation of the peptide can be performed. Thus, each amino 
acid of the peptide chain is identified by one cycle of Edman chemistry and one cycle of high- 
performance liquid chromatography (HPLC) to analyze the PTH amino acid. This method 
has been developed in a variety of modes (Edman and Henschen 1975), and eventually in the 
form of an automated sequencing instrument in 1967 (Edman and Begg 1967). Not long 
afterward, the solid-phase sequencer was constructed for the degradation of peptides that 
were covalently attached to a solid support (Laursen 1971). 

The Edman degradation procedure is divided into three steps: coupling. cleavage, and con 
version (Figure 6.2), In the coupling reaction, the Edman PIVC chemically modifies the free 
amino-terminal G-amine group of a polypeptide to form a phenylthiocarbamyl (P IC? 
polypeptide. At pH 9, coupling is favored at G-amino groups and takes place within 15. 50 min- 
utes at a temperature of 40-55°C in a very high yield. joupling is inhibited by amino-terminal 
modifications such as formylation, acetylation, fatty aad acylation, and cych ation of glutamine 
residues to form pyroglutamate (Brown and Roberts 1976; Brown 1979; Hirano et al. 19921. 

[n the cleavage reaction, the PIC amino-terminal residue is rapidly cleaved trom the 
polypeptide chain with anhydrous acid. This process is facilitated by the proximity of the 
nucteophilic sulfur atom of the derivatized amino terminus to the carbonyl carbon of the first 
peptide bond to yield a five-membered heterocyclic derivative, an a nilinothiazolinone (ATZ) 
amino acid, and the #— 1 polypeptide. The shortened n - | polypeptide has a reactive amino- 
terminal -amino group, which can undergo another cycle of coupling and cleavage. This 
procedure is repeated in an iterative Manner. 

The solubility of the small, hydrophobic ATZ amino acid 15 significantly different from 
that of the hydrophilic polypeptide and can be extracted selectively by à nonpolar solvent 
such as chlorobutane or ethylacetate. In the third, conversion, step of the cycle, the unstable 
ATZ derivative amino acid is converted to à more stable PTH derivative. Conversion is 4 two- 
step reaction and occurs by treatment with aqueous acid. Cleavage and conversion can be 
accomplished with just one aqueous acid reaction, but the anbedrous acid optimizes the spe- 
cific cleavage of the peptide bond of the amino-terminal amino acid. First, the unstable ring 
structure of the ATZ amino acid is opened by aqueous acid and increased temperature 
(~60°C) to form a PTC amino acid, Rearrangement then takes place, yielding the more sta- 
ble PTH amino acid. It has been suggested (Farnswarth and Sicinberg 1993) that the thiohy- 
dantoin is actually formed preferentially when cleavage is carried out in the presence of a 
thiol such as dithiothreitol (OTT). The Ldman degradation cycle is schematically summa- 


rized in Figure 6.3. 
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FIGURE 6.2. Procedures for removal of amino- and carboxy-terminal amino acids as thiohydantoins. Both 
derivatives may be detected in the UV because of their high absorbance at 268 nm {Emar ~17,500). Classical 
Edman chemistry used phenylisothiocyanate (X = phenyl! for amino-terminal degradations. (Redrawn, 
with permission, from Inglis et al. 1995.) 


Automation of Edman Degradation Using a Spinning-cup Sequencer 


The repetitive nature of Edman degradation suggested to Edman's technical assistant 
Geoffrey Begg that the procedure could be automated. Edman and Begg produced a proto- 
type protein sequenator in 1961. Automation of the coupling and cleavage steps (Edman and 
Begg 1967; Laursen 1971) enabled researchers to determine the sequence of the first 3040 
amino residues in a protein (0.3 umole) routinely. Figure 6.4 illustrates the instrumental 
setup devised by Edman and Begg—the spinning-cup sequencer—which consisted of a sol- 
vent delivery system that delivered solvents and reagents by nitrogen pressure to the reaction 
cartridge (the spinning cup) via an electronically operated valve block. In the spinning-cup 
reaction vessel (Figure 6.5), the protein was held against the inner wall by centrifugal force 
(Edman and Henschen 1975). Reagents and solvents delivered to the spinning cup (by nitro- 
gen pressure) were precisely measured to wet only the protein film. Excess reagents and sol- 
vents were removed by evaporation in vacuo. A pick-up line was installed in an upper groove 
of the cup to remove the waste. One of the early limitations of the Edman sequencer was the 


292 = CHAPTER 6 


Protein +@ 


Amino terminus 


PITC 


Oy NE 
a ~*— Coupling and cleavage 
XS Amino-terminal residue of protein 
ATZ amino acids — (9 couples with PITC to form a PTC 
~ z amino acid. 
E 
Acid cleavage removes the amino- 
» terminal residue as an unstable ATZ 
T amino acid. A shortened peptide with 
a reactive amino terminus is left. The 
S a new amino terminus undergoes the 
a sg a same cycle. 
"-7U0. : 
t ) 4— Conversion 
— um 
FIM amino acids «^ " b Unstable ATZ amino acids are then 
= converted to stable PTH amino acids. 
~@ 
Ala Leu Ser Gly Thr Analysis 
E 


4 N AN Amino acids sequentially identified by 
V \/ HPLC. 


FIGURE 6.3. How amino acids are sequentially cleaved in Edman degradation: A conceptual view of the 
Edman degradation chemistry cycle. (Redrawn, with permission, from the handbook, Preparing samples for 
protein sequencing: A newcomer's guide [©The Perkin-Elmer Corporation ].) 


poor performance of short peptides due to sample “wash out” from the reaction vessel. This 
problem was overcome by the introduction of the polymeric quarternary ammonium salt 
Polybrene (Klapper et al. 1978; Tarr et al. 1978), which anchored both proteins and peptides, 
thereby allowing the sequencing of even short hydrophobic peptides to completion. The 
Edman spinning-cup sequencer was adapted for a commercial instrument by Beckman Inc. 
in 1969 and was in use until 1980. Edman did not patent his sequenator. 

After coupling and cleavage, the released ATZ amino acids were delivered to a cooled 
fraction collector. ATZ amino acids were manually converted off-line to the PTH derivatives 
in batches. Initially, PTH amino acids were separated and identified by chromatography. For 
this purpose, paper chromatography, thin-layer chromatography (TLC), and partition chro- 
matography, in either a liquid-liquid system or a gas-liquid system, were proposed (Edman 
and Henschen 1975). For a representative TLC separation and identification of PTH amino 
acids, see Figure 6.6. The inherent limitations of the TLC and hydrolytic methods for PTH 
analysis were eventually overcome by the development of gas chromatographic and HPLC 
PTH amino acid separation methods (Zimmerman et al. 1973; Pisano 1975). Later workers 
added devices for automatic conversion of the separated thiazolinones to the PTH amino 
acids (Wittmann-Liebold et al. 1976). These devices were followed by on-line detection of the 
PTHs using HPLC. The performance of the original spinning-cup sequencer (sensitivity 
limit, 0.3 umole; repetitive yield, 92-95%) is listed in Table 6.1. 
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FIGURE 6.4. Diagram of spinning-cup sequencer. (A) Spinning cup; (B) electric motor; (C) reagent (sol- 
vent) reservoir; (D) valve asssembly; (E) outlet stopcock assembly; (F) fraction collector; (G) waste con- 
tainer; (H) nitrogen cylinder; (J) pressure gauges; (K) pressure regulators; (M) 3-way valve; (N) 2-way valve 
with bypass; (P) rotary vacuum pump; (Q) bell jar; (R) feed line; (S) effluent line. Gas lines are open, and 
liquid lines are filled. (Redrawn, with permission, from Edman and Begg 1967 [OG Blackwell Science Ltd.].) 


FKSURE 6.5. Spinning-cup reaction chamber. (A) Glass 
cup; (B) motor shaft extension; (C) cup support; (D) 
grub screws; (E) motor support; (F) column; (G) base 
plate; (H) side tube; (1) feed line; (K) effluent line; (L) 
adjustable stand; (M) bell jar; (N) electrodes; (O) rub- 
ber padded ring; (P) sealing bolt; (Q) oil seal; (R) PTFE 
sleeve: (5) oil reservoir; (T) band heater. A top view of 
the cup, the feed line, and the effluent line is shown in 
the lower right hand corner. (Redrawn, with permis- 
sion, from Edman and Begg 1967 [©Blackwell Science 
Ltd.].) 
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FIGURE 6.6. Chart showing the TLC of PTH amino acids in systems [> (xvlene/formamide), E (n-butyl! 
acetate/ propionic acid/ formamide}, and H (ethylene chloride/acetic acid). PTH derivatives of asparagine 
(an), serine (se), methionine sulfone (ms), glycine (g), alanine (a), $-methyl cysteine (mc), valine (v), pro- 
line (p), glutamine (gn), threonine (1), tyrosine (ty), lysine (ly), methionine (m), S-allyl cysteine (ac), 
jeucine (1), aspartic acid (ap), glutamic acid (gu), tryptophan (tp), S-ethy! cysteine (ec), phenylalanine {ph}, 
and isoleucine (il). (Redrawn, with permission, from Edman and Henschen 1975 | ©Springer- Verlag].) 


Microsequencing with a Gas-phase Sequencer 


Automated microsequencing came of age with the release of a commercial instrument, the 
gas-phase sequencer, which used miniaturized components and gaseous coupling and cleav- 
age reactions of the protein immobilized on a glass fiber disc (see Table 6.1) (Hewick et al. 
1981). This instrument revolutionized protein microsequencing by affording an almost 1000- 
fold increase in sensitivity compared with the spinning-cup instrument and, in so doing, 
eliminated most other protein sequencing techniques based on modified Edman reagents 
(see below Alternative Amino-terminal Reaction Methods). Sample wash out was minimized 
by addition of a polyamide (Polybrene) to the glass support. The two key reagents, a coupling 
base (triethylamine) and a cleavage acid (trifluoroacetic acid, TFA), which are both capable 
of dissolving proteins/peptides, are delivered to the reaction vessel (e.g., the glass cartridge 
block in Figure 6.7A and the reaction cartridge in Figure 6.7B) in the gaseous phase via a 
stream of argon or nitrogen. By these means, the appropriate pH conditions for coupling and 
cleavage could be achieved without risk of sample wash out from the reaction vessel. Selective 
extraction of reaction by-products and ATZ amino acids (but not protein/peptide) was 
accomplished by judicious choice of organic solvents that were transferred to the reaction 
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TABLE 6.1. Advances in amino acid sequence automation 


Instrument 


Spinning cup? 


Gas Phase 
Applied Biosystems" 
Model 470A 


Applied Biosystems 
Model 477A 


Apphed Biosystems 
Model 49xPracise 


Applied Biosystems 
Model 49xProcise cLC 


Hewlett-Packard 
Model HPGIOOIA 


Sample Repetitive 
required yield (RY) Comments and drawbacks 
1970 G3 umole — 92-9596 Manual conversion only; peptide wash out 
1973 Auto conversion; improved valves 
1978 Polybrene peptide anchor 
1978 HPLC detection of PTH amino acids 
1981 100 pmoles 92—95% Miniaturized components; glass fiber support; low-volume 


reaction vessel; on-line conversion of ATZ amino acids 


1985 On-line PTIT amine acid analysis; 0.5 pmole detection 


2 pmoles 29496 x- L2, or 4 depending on the amount of cartridges; RY based 
on L, L and V with 10-pmole load of B-lactoglobulin; threshold 
of detection ~250 fmoles; signal/noise (S/N) ratio of 2:1 for PTH 
alanine; cycle time <35 min; 9-mm reaction cartridge for volumes 
up to 15 ul 


500 fmoles — 29296 x= 1,2, or 4 depending on the amount of cartridges; RY based 
on L, |, and V with 10-pmuole load of B-lactoglobulin; threshold 
of detection ~50 fmoles; S/N ratio of 2:1 (or PTH alanine; cycle 
time <50 min; 6-mm reaction cartridge for volumes «7.5 pl 


1992 3-5 pmoles 97% Biphasic column reaction vessel; on-line PTH amino acid 
analysis; 0.5 pmole detection 


“Original prototype reported 98% repcütive yield for myoglobin. 
"Applied Biosystems, Foster City, Calitornia. Protocols and other electronic information on protein sequencing from Applied Biosystems can be found 
at: http//www.appliedbiosysterns.com/apps/techniques.ctm. 


vessel in the liquid phase. Extracted ATZ amino acids were automatically converted to their 
stable PTH derivatives following their transfer from the reaction vessel to a conversion flask. 
A combination of the miniaturized glass reaction vessel (cartridge), zero-dead volume and 
inert valves, and especially purified chemicals contributed largely to a user-friendly and high- 
ly reliable instrument that enabled sequencing of less than 100-pmole samples. With the sub- 
sequent addition of an on-line HPLC system and optimized PTH amino acid separation, 
sequence analyses at the 10-pmole level or even lower were readily achievable. With the devel- 
opment of capillary columns («0.32 mm LD.), the sensitivity of PTH amino acid analysis has 
been extended to the low femtomole level (see Figure 6.8) (Moritz and Simpson 1992; 
Erdjumen- Bromage et al. 1993). 


A More-refined Pulsed-liquid-phase Sequencer 


Further refinements of the gas-phase sequencer (simplification of the valve blocks, gas regu- 
lation, software analysis, and reagent/solvent delivery protocols) led to the development of 
the pulsed-liquid-phase sequencer. The basic principles of these two instruments were essen- 
tially the same, the main difference being that the cleavage acid (TFA) was delivered as a liq- 
uid pulse. The amount of acid delivered was precisely controlled, sufficient to wet the sample 
but not enough to wash the sample from the reaction vessel. Liquid-phase cleavage resulted 
in faster cleavage times (1.e., shorter cycle times and accelerated sequencing). Following cleav- 
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acid is converted into a stable PTH amino acid. The 
PTH amino acid is then automatically injected onto 
an HPLC. 


-- Analysis 


Using HPLC, each amino acid is identfied by the 
elution time of the peak it produces on chromatogram. 


— 


FIGURE 6.7. Automated Edman sequencing using a gas-phase sequencer. (A) Exploded view of the reaction cartridge system. The 
protein sample is introduced into the reaction cartridge either on a PVDF membrane or glass fiber filter. The membrane/filter is 
sandwiched between the upper and lower glass cartridge blocks, and the entire assembly is placed into the protein sequencer. 
(Redrawn, with permission, from Applied Biosystems Procise Protein Sequencer user's manual.) (8) Coupling, cleavage, and con- 
version reactions (see Figure 6.2) all occur automatically within the protein sequencer. The identity and sequence of the cleaved 
amino acids are determined by HPLC. (Reproduced, with permission, from the handbook, Preparing samples for protein sequenc- 
ing: A newcomer's guide [©The Perkin-Elmer Corporation].) (Courtesy of Lynne Zieske, Applied Biosystems.) 
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FIGURE 6.8. Separation of 1-pmole amounts of PTH amino acids by reversed-phase capillary liquid chro- 
matography. Column: 150 x 0.32-mm I.D. Applied Biosystems PTH-C,, (packed by LC packings). (Solvent 


À) 8.3 mM sodium acetate/5% (v/v) tetrahydrofuran (pH 4.1); (Solvent B) acetonitrile. Column tempera- 
ture is 55°C and flow rate is 5 pl/min. PTH amino acid notation is shown using the one-letter code for 
amino acids, DPTU denotes diphenylthiourea. (Reprinted, with permission, from Moritz and Simpson 1992 


[GElsevier Science.) 

age, the volatile TFA was removed (without loss of sample), With careful optimization of 
reaction cycle protocols and reagent/solvent deliveries, cycle times of less than 30 minutes are 
achievable (Totty et al. 1992). With the implementation of microcolumn (1.0 mm L.D.) PTH 
amino acid analysis technology, these instruments (Applied Biosystems, model Procise 
sequencer) can operate in the range of 0.5 pmole and even lower with a repetitive yield of 


~95% (see Table 6.1). 
Improvements Provided by the Biphasic Column Sequencer 
Although analyses on the gas-phase sequencer showed improved repetitive yields when com- 


pared with the spinning-cup instruments, they did not provide the high repetitive yields 
(29596) essential for long sequencer runs. To overcome this problem, in the early 1990s, 
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Hewlett-Packard (now Agilent) introduced an automated sequencer that provided a ditferent 
approach for immobilizing protein/peptides for Edman chemistry. The reaction vessel in this 
instrument comprises an adsorptive biphasic column—one half of the column contains à 
solid hydrophobic support and the other half contains a hydrophilic support (see Figure 
6.9A). In a manner analogous to loading a reversed-phase HPLC column, very large volumes 
of protein/peptide are applied to the hydrophobic portion of the biphasic sequencer column 
under dilute acidic conditions (see Figure 6.9B). The sample is retained on the top of the col- 
umn, and any inorganic salts and buffers can be washed away. The hydrophobic and 
hydrophilic halves of the column are then reassembled, and the resulting biphasic column is 
positioned in the sequencer (sce Figure 6.9A, inset). Allowing the flow direction of the sol- 
vents to be reversed minimizes sample wash out from the column. Aqueous solvent flows are 
directed toward the hydrophobic half of the column, which immobilizes the sample via 
hydrophobic interactions. Alternatively, organic solvent flows are directed toward the 
hydrophilic half of the column, which retains the sample when organic solvents are employed 
to elute hydrophobic contaminants/ reaction products (e.g., diphenylthiourea [DPTU] or the 
ATZ amino acid). The addition of nonvolatile alkylamines to the biphasic reactor column in 
this sequencer allows state-of-the-art repetitive yields (79896), presumably by both ensuring 
that the protein amino group is uncharged for the coupling reaction and competing with the 
amino terminus for blocking species in the reaction system (see Figure 6.2). For a description 
of sample loading onto the biphasic column reaction vessel, see Protocol 1. 


SEVERAL RATE-LIMITING STEPS ARE ASSOCIATED WITH SEQUENCE ANALYSIS 


A number of considerations must be borne in mind before engaging in protein/peptide 
sequencing, especially when dealing with low-picomole levels of material, including sample 
purity, chemical limitations (1.e., contaminants that either block the amino terminus of a pro- 
tein/peptide or inhibit the Edman degradation procedure), and sample loading conditions. 


Sample Preparation (Purity) 


Sample purity is one of the rate-limiting steps for successful protein sequencing (for a review, 
see Simpson and Nice 1989). Samples should be homogeneous with respect to the polypep- 
tide chain, i.e., contain one protein component only; sample microheterogeneity due to dit- 
ferential glycosylation (such samples often electrophorese as a broad band on SDS-PAGE or 
a broad peak on reversed-phase HPLC) or phosphorylation will not interfere with amnno-ter- 
minal sequence analysis. However, under normal operating conditions, the PTH derivatives 
of glycosylated or phosphorylated amino acids are not extracted from the reaction vessel, and 
these cycles appear as "blank cycles.” Analysis of sites of protein phosphorylation using 
Edman degradation are best performed using a solid-phase sequencing approach (see Aitken 
and Learmonth 1997 and below for protocols). Likewise, a blank cycle is observed with cys- 
teine residues unless the sample is alkylated (e.g., using iodoacetic acid, iodoacetamide, or 
vinyl pyridine) prior to sequencing; in this situation, cysteine is detected as the S-alkylated 
PTH cysteine. In some cases, seemingly homogeneous samples prepared by HPLC or SDS- 
PAGE yield ambiguous sequencing results. For example. more than one sequencing signal 
will be produced if samples contain more than one protein component or if à single protein 
has been proteolytically cleaved during purification but is held together by disulfide bonds. 
For this reason, samples should be reduced with DTT and the cysteines alkylated with a suit- 
able alkylating agent prior to Edman degradation. If the multiple components in the sample 
are present in nearly equivalent amounts, the sequence of any one component is almost 
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FIGURE 6.9. Biphasic sequencer column. (A) Exploded view of the biphasic column. (inset) Insertion of 
the biphasic column into the sequencer, (Redrawn, with permission, from The Agilent model G100A 
Protein Sequencer manual.) (B) Protein sample preparation using the biphasic column prior to sequenc- 
ing. The protein sample is applied to the top of hydrophobic component of the reaction cartridge, which 
resembles a reversed-phase column. Large volumes can be loaded; the sample is retained on the top of the 
column and excipients such as buffer salts, which otherwise interfere with the Edman chemistry, are washed 
away. After sample loading, the hydrophobic and hydrophilic components of the reaction cartridge are 
reassembled and inserted into the sequencer. For a full description of sample loading onto the biphasic col- 
umn sequencer, see Protocol 1. (Redrawn, with permission, from Hewlett-Packard HP GLOOSA advertising 
literature.) 
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MM to decipher. However, if the molar ratio of major to minor components is signif- 
icantly different, then it is possible to distinguish among the primary, secondary, and even 
tertiary components in the mixture (Simpson and Nice 19841. 


AMINO-TERMINAL BLOCKING 


Various blocking reagents might render the amino group of a peptide/protein unreactive 
to the Edman degradation procedure. Partial amino-terminal blockage of a sample during 
purification can reduce sequencing yields by reducing the initial sequencing yield. 
Important blocking reagents are aldehydes that tend to form Schiff bases. Needless to say, 
precautions must be taken to remove traces of aldehydes, which occur universally, from 
solvents and reagents used during purification (and, especially, the sequencer procedure). 
Cyanate is another common contaminant that may react with amino termini of pep- 
tides/proteins. (Note: Ammonium cyanate is generated in significant quantities in concen- 
trated alkaline urea solutions that are often used during protein purification.) For meth- 
ods by which amino-terminal modifications can occur, see http://www.urme.rochester, 
edu/research/propep/microseq.html. 


Chemical Contaminants 


Chemical contaminants that interfere with the Edman degradation must be removed from the 
sample prior to loading onto the sequencer, This is particularly important when using the 
pulsed-liquid instruments; in the case of biphasic column instruments, many contaminants can 
be readily eliminated during the sample loading process (see below}. Sample contaminants that 
interfere wilh the conditions required for efficient Edman degradation include the following: 


e Reagents that cause destruction of or side reaction with the Edman reagent PITC. For 
example, large amounts of primary amines will compete with the peptide/protein for PITC 
and will inhibit or prevent the coupling reaction. PITC is also destroyed by oxidizing 
reagents. 


+ Contaminants that affect the coupling pH (c.g., buffer salts with pKa values of <8,0). Such 
reagents disturb the coupling step by lowering the pH. This results in inefficient coupling 
and hence poor repetitive yields. Because buffer salts are not very soluble in the organic sol- 
vents used in sequencing instruments, they remain on the sample filter (especially in 
pulsed-liquid instruments) throughout the entire sequencing run. 


« Reagents that affect the cleavage reaction by neutralizing the cleavage acid (TVA). For 
example, large quantities of bufter or other salts in the sample that remain on the sample 
support may inhibit the cleavage reaction by neutralizing the cleavage acid TFA. This 
results in incomplete cleavage and hence poor repetitive yields. 


In addition to chemical contaminants that interfere with the Edman degradation, reagents 
that interfere with the reversed-phase HPLC analysis of PTE amino acids must be avoided, 
including any UV-absorbing (Aj) compounds that can be recovered from the sequencer 
along with the PTH amino acids, Such compounds (e.g. primary amines that react with PITC) 
can complicate the PTH amino acid chromatograms by obscuring the PTH amino acid peaks. 

A list of common reagents that can interfere with the Edman chemistry and which 
should be removed from samples prior to sequencing is shown im Table 6.2 (this list is by no 
means exhaustive); sample attributes for successful sequencing are given in Table 6.5. 

It should be remembered that microsequencing, in particular, is particularly sensitive to 
any factor that causes destruction of PITC, alteration of the pH of either the coupling or 
cleavage reaction, or blockage of the o-amino group of the amino-terminal amino acid. 
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TABLE 6.2. Reagents that interfere with sequencing 


Glycerol or sucrose These reagents are often added to purified samples for storage and handling purposes. 


SDS SDS may cause instrument blockage (and hence malfunction) and also loss of sample irom the 
sample support (especially in the pulsed-liquid instrument). 


Nonionic detergents Common nonionic detergents such as Triton X- 100, Tween, and Brij often contain aldehydes, oxi- 


dants, and other contaminants (e.g., UV-absorbing substances) that can interfere with the Edman 
chemistry and reversed-phase HPLC analysis of PTH amino acids. 


Ammonium sulfate, other These salts are often used for concentrating and fractionating proteins (see Chapter 4). 
ammonium salts, and 
guanidine salts 


Buffers and primary amines Glycine, Tris, and ethanolamine. 
Tris and glycine buffers are frequently used for protein purification and are common in samples 


recovered from SDS-PAGE. 


Reagents that interfere with protein sequencing must be avoided or removed from samples prior to sequencing (see Protocol 1 for the biphasic column 
sequencing instrument, and Protocol 6 for pulsed-liquid instruments}. Where possible, avoid dialysis as a last step in a sample preparation since dialysis 
tubing is often a source af contaminants that interfere with the Edman chemistry (also, sample recoveries from dialysis are often unacceptably low}. If dial- 
ysis cannot be avoided, use thoroughly cleaned (follow manufacturer's instructions), Ingh-quality tubing (e.g. Spectropor) and perforin the dialysis in the 
presence of a low-concentration counterion salt or acid (otherwise the contaminants will remain adsorbed to the protein and dialysis membrane). When 
a sample is electroblotted onto a PVDF-type membrane for sequencing (see Protocol 2), contaminating reagents can be easily washed away from the immo- 
bilized sample with H,O and 50% aqueous methanol. Similarly, samples that are applied to a biphasic sequencer reaction column or to the sample mem- 


brane prior to analysis using a gas-liquid-phase sequencer are immobilized, which permits removal (by washing) of excipient compounds that otherwise 
interfere with the sequence analysis. 


TABLE 6.3. Sample atiributes for successful sequencing 


Sample purity Peptide/protein must be amino-terminally homogeneous. The interpretation of PTH amino acid profiles 
obtained from homogeneous samples is usually straightforward, with a unique PTH amino acid being detect- 
ed at each cycle. For impure (or partially digested) samples, several PTH amino acid derivatives can be displayed 
at each cycle, thereby making interpretation of the original (or major) polypeptide chain very difficult, or 
impossible. Even a sample ratio of 2:1 makes correct amino acid assignment uncertain (this is essentially due to 
variable yields of many of the PTH amino acids). 


Sample volume — Ideally, the sample volume should be «150 ul. This is particularly important for the pulsed-iiquid sequencers 
where the sample is loaded onto a glass fiber disk that can only absorb 30 pl of liquid (typically, multiple 30-11 
aliquots are applied with drying of the disk between each aliquot). Sample volume is not a limiting factor when 
using the Agilent biphasic column sequencer (see Protocol 1) or the Applied Biosystems Problott apparatus in 
conjunction with the pulsed-liquid sequencer. 


Sample buffer The sample should be in a volatile solvent (e.g., H,O, acetic acid, acetonitrile, propanol, formic acid, trifluo- 
roacetic acid, or triethylamine) or buffer (e.g., ammonium bicarbonate if lyophilized repeatedly). The sample 
can contain a small quantity of detergent (e.g., 0.01-0.02% Tween-20 or «0.196 SDS). Care should be taken with 
SDS since larger amounts can wash the sample out of the reaction vessel and cause disruption of reagent/sol- 
vent delivery the instrument, Ali solvents and reagents used in protein sequencing should be of the highest puri- 
ty available (e.g., HPLC-grade solvents, "sequencing-grade" and "electrophoresis-grade" reagents). 


Sample amount Although most modern sequencing instruments can sequence 0.5—1.0 pmole of sample, it is recommended that 
a minimum of 10-50 pmoles of sample be analyzed to ensure that the sequence interpretation is correct or to 
determine whether the sample is amino-terminally blocked if no sequence is obtained. Invariably, the amount 
of sequenceable material is underestimated due to sample loss (e.g., irreversibly "sticking" to plasticware dur- 
ing manipulation and storage"), inaccurate quantitation, or amino-terminal blockage during isolation/purifi- 
cation. It is always preferable to err on the side of too much sample (¢.g., 50% more than required for analysis) 
than too little sample. Sample quantitation can be estimated from silver- or Coomassie Blue-stained gels, micro 
BCA or absorbance. For samples prepared by microbore reversed-phase HPLC, sample amounts can be esti- 
mated by their absorbance at 214 nm (e.g., 100 mAU on a 2.1-mm I.D. column is equivalent to ~] pug and 
100 mAU on a 4.1-mm LD. column is equivalent to ~10 ug). 


*Ít is recommended that liquid samples be stored in the presence of 0.010.029 Tween-20 in tightly capped polypropylene tubes (e.g. microtuge tubes! 
and samples isolated by reversed-phase HPLC not be dried down, but stored in their collected state (e.g., aqueous TFA/acetonitrile) at -209C untif analyzed, 
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How SEQUENCER PERFORMANCE Has IMPROVED 


In the 50 years since Edman’s first report of a manual procedure for the amino-terminal 
sequencing of peptides, there has been an almost 10,000-fold increase in sensitivity (com- 
pare the ~10 nmoles starting material required in 1950 to the ~1 pmole required today). 
This has been largely due to four factors: 


1. Automation drastically reduced sample loss, an inherent problem with any manual pro- 
cedure. 


2. Increased sensitivity of PTH amino acid analysis (e.g, compare modern HPLC, espe- 
cially the use of microcolumns {see Figure 6.8] with the original TLC method for PTH 
amino acid detection that was sensitive to ~} nmole of PTH derivative in a spot). 


3. Miniaturization of instrumentation. 
4. Reagent/solvent quality. 


Collectively, these factors influence the overall quality of sequence analysis. The initial 
yield and the repetitive yield, together with the sensitivity limit of the instrument, deter- 
mine the amount of sequence that can be obtained. 


» Initial yield is a measure of the percentage of the total sample loaded onto the instru- 
ment that can be sequenced (typically 50-80% unless the amino terminus is blocked). 
The reasons for this are poorly understood, but factors such as sample preparation (e.g., 
proteins can be amino-terminally blocked by exposure to poor-quality reagents/ solvents 
that contain aldehydes) during purification, the sequencer itself, and the reaction para- 
meters of the instrument clearly influence initial yield. 


« Repetitive yield is the percentage of sequence detected after each cycle of the Edman 
reaction. This value is measured by a linear regression fit to the amount of PTH amino 
acid at each cycle. Extrapolation to cycle 0 gives the initial amount of sequenceable pep- 
tide. Whereas highly skilled technicians using the manual Edman degradation procedure 
achieved repetitive yields of up to 90%, >95% repetitive yields could be readily achieved 
after automation. It can be calculated that an average repetitive yield (i.e., from one cycle 
to the next) of 90% and 95% would allow 20 and 40 residues, respectively, to be identi- 
fied from a 10-pmole peptide before reaching a sensitivity limit of 1 pmole (lowest 
amount of readable sequence). However, with a 100-pmole sample load, 40 and 90 cycles 
are allowed with instruments performing at 90% and 95% repetitive yield, respectively 
(Figure 6.10). Thus, the higher the repetitive yield, the longer the sequences that can be 
determined before background peaks generated by side reactions, spurious cleavage of 
the peptide chain during sequencing (especially acid-labile peptide bonds such as Asp- 
Pro), and out-of-frame sequences (generated by incomplete cleavage or coupling reac- 
tions, referred to as the “lag”) obscure the PTH amino acid peak derived from the 
amino-terminal residue. 

In practice, 20-25 cycles of sequence can be expected in most cases from 1 to 5 ug of 
sample, but occasionally 60 residues or more can be assigned for large peptides using 
instruments performing at >96% repetitive yield (Figure 6.11). 


e. Repetitive yield (RY), expressed as a percentage, is calculated as follows: RY = 10(M) x 100. 
For a multiple point calculation: 


M is the slope of the linear regression line calculated from a set of points 
[log (pmole) vs. cycle]. 
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Calculation of Repetitive Yield (RY) 
RY = 10(M) x 100 


For a point-to-point calculation without background correction: 
logis (Y) - log (Y) 
M zn a 10 


X, -X L 
For a point-to-point calculation with background correction: 
Logio (Z3) — logi (Zi) 
Me 
X,- X, 
where 


Y, = pmole value (uncorrected) of 2nd point 

Y,- pmole value (uncorrected) of 1st point 

X,= cycle number of 2nd point 

X,= cycle number of Ist point 

Z,- pmole value (background corrected) of 2nd point 
Z,- pmole value (background corrected) of Ist point 
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FIGURE 6.11, Amino acid sequence analysis of heat shock protein 60, from a reducing two-dimensional (2D) gel electrophoresis 
of LIM 1215 cells. Protein spots (such as the one indicated by the arrow on the gel) from three identical 2D gel/PVDF electroblots 
were combined for sequence analysis using the biphasic-column sequencer. The PTH amino acids are indicated by the one-letter 
notation used for amino acids. (Reproduced, with permission, from Ji et al. 1997 [@Wiley- VCH]. 


HPLC AND PAGE ARE USED TO PREPARE SAMPLES FOR MICROSEQUENCING 


The two high-resolution techniques that are commonly used to prepare samples for microse- 
quencing, especially trace-abundant proteins obtainable at low-nanogram amounts only, are 
high-performance liquid chromatography (HPLC) and polyacrylamide gel electrophoresis 
(PAGE) (for details of these methodologies, see Chapter 5 and Chapter 4, respectively). 
Whereas reversed-phase (RP)-HPLC is usually considered applicable at any stage in a purifi- 
cation protocol, electrophoresis is usually reserved for the last purification step. 


RP-HPLC 


With the steadily growing variety of reversed-phase, ion-exchange, and affinity chromatog- 
raphy supports available for HPLC, the task of the protein chemist in purifying trace-abun- 
dant proteins is becoming much easier. Of these, RP-HPLC remains the method of choice for 
most separation strategies, particularly for peptides and low-M, (<40,000) hydrophilic pro- 
teins. The separation of proteins and peptides by reversed-phase supports is based predomi- 
nantly on the differential hydrophobic interactions between amino acid side chains on the 
protein or peptide and the functional groups on the column support. For details on typical 
column packings used in peptide/ protein separations, see Chapter 5. The following are typi- 
cal chromatographic conditions that could be applied, as an initial approach, for the separa- 
tion of proteins and peptides: 
» Brownlee RP-300 column, made of dimethyloctyl silica at 10-~m particle size and 300 À 
pore size. 
e Linear 60-minute gradient from 0.1% aqueous TFA to 60% acetonitrile/40% H,O con- 
taining 0.089% TFA. 
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Several options are available for improving RP-HPLC separations, including changing 
the mobile-phase solvent and pH, and, especially in the case of peptide separation, switching 
to a different column packing. Selected examples of these options are given in Chapter 5 and 
Chapter 7. 

Recently, there has been an increasing awareness of the need for faster chromatography, 
especially for large-scale, high-volume sample throughput. This has led to the development 
of polystyrene divinylbenzene- based polymeric supports such as the perfusive support (sce 
Chapter 5). These supports have very large pore structures (8000 A) that transect the par- 
ticle and confer two distinct advantages: (1) increased surface area, and hence higher mass 
loading, and (2) the minimization of slow diffusive fiow that permits rapid partitioning, and 
hence resolution, of proteins and peptides within the supports over a wide range of high flow 
velocities (1000—9000 cm/hour). It should be noted that for micropreparative-scale purifica- 
tions, fast chromatographic analysis of proteins and peptides could be performed using con- 
ventional silica-based supports and standard liquid chromatographs (Moritz et al. 1994, 
1995) 


Microbore column chromatography enhances recovery and 
detection of picomole amounts of protein 


The ability to purity and subsequently manipulate (e.g., buffer-exchange, concentrate, reduce 
and alkylate, and fragment) material at low-microgram (i.e., picomole) levels, and in suffi- 
ciently small volumes for direct loading onto Edman chemistry sequencing instruments and 
mass spectrometers, presents a considerable technical challenge. Moreover, samples purified 
by conventional high-resolution procedures, although homogeneous with respect to pro- 
teinaceous material, frequently contain excipients that impede the performance of these 
instruments (e.g., buffer salts and detergents). One chromatographic approach for concen- 
trating (or trace-enriching) samples derived from the most widely used HPLC columns (i.e, 
4.6 mm LD.) employs short microbore columns (e.g., 30 x 2.1 mm LD.) (for a review, see 
Simpson et al. 1989). 

The principles and operation of microbore column chromatography are covered in detail 
in Chapter 5. Microbore HPLC provides two key advantages for preparing samples for 
microsequence analysis: 


* The ability to concentrate and subsequently recover material in small peak volumes suit- 
able for direct loading onto the gas-phase/pulsed-liquid sequencing instruments. 


* An increased level of sensitivity that enhances protein detection, thereby permitting low- 
level analytical studies to be undertaken. 


Microbore column trace enrichment of samples derived from conventional columns 
avoids the poor recoveries associated with classical concentration procedures such as 
lyophilization and precipitation (e.g., TFA or acetone). RP-HPLC-derived samples usually 
contain sufficient quantities of secondary solvent (e.g., acetonitrile) to prevent their retention 
on à similar support; thus, samples require severalfold dilution prior to injection. This is 
readily accomplished either by diluting the sample two- to threefold with primary solvent in 
the sample syringe immediately prior to injection or by repetitive injection of small aliquots 
(one tenth of loop volume) where dilution occurs in the precolumn hydraulics of the instru- 
ment. 

One of the attractive innovations of the sample-loading procedure for the biphasic col- 
umn sequencer is that large- volume samples can be concentrated, and desalted, in a manner 
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PAGE 


akin to loading a sample onto an RP-HPLC column (see ligure 6.9). Thus, samples contain- 
ing excipients such as buffer salts (e.g, 1 ml of 20 mM Tris-HCl bufter containing 1.5 M 
NaCl), guanidine and amine-containing buffers (e.g..1 m] of buffer consisting of 6 M suani- 
dine-HCl, 1 m "Iris-HCl, and 10 mat EDTA) can be applied directly onto the biphasic reac- 
tion vessel without the need for any prior trace enrichment or desalting; after the sample i» 
loaded on the sequencer column, these excipients are easily washed away prior to sequence 
analysis. 


Once an acceptable polyacrylamide-gel-based protocol has been established for separating 4 
mixture of proteins (see Chapters 2 and 4), a number of critical decisions must be made with 
respect to which strategy should be adopted for obtaining sequence information: aminao-ter- 
minal sequencing or internal sequencing of peptides (see Figure 6.12), A major consideration 
is the losses incurred when electroblotting a protein from a gel onto an immobilizing matrix 
such as polyvinylidene ditluoride (PVDF). Parameters influencing losses during ¢electroblot 
ting are the physical characteristics of the membrane ieg- specific surface area, pore size dis- 
tribution, pore volumes, and sequencing solvent/reagent permeability considerations) 
(Eckerskorn and Lottspeich 1993). For reviews of commercially available blotting mem- 
branes, the blotting process, and gel electrophoresis-induced protein modifications, sce 
Simpson et al. (1989}, Aebersold (1993), Eckerskorn (1994), Patterson (19941, and Patterson 
and Aebersold (1995). 

An example of a scquence analysis of a relatively abundant gel-separated protein (house- 
keeping protein), heat shock protein 60, following electrotransfer onto a PVDI membrane 


Electroblot to inert membrane 
{nitrocellulose or PVDF} 


Amino-terminal 
Sequence Analysis 
s Edman degradation 


Excise stained spotís) On-membrane 
and concentrate digest 
by SDS-PAGE 
= Internal Sequence Analysis 
* Edman degradation 
» Mass spectrometric analysts 
——— » peptide-mass fingerprinting 
E tandem-mass spectrometry 
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FIGURE 6.12. Experimental approaches for obtaining sequence information [rom yel-separated proteins. F 
digestion strategies, see Chapter 7; mass-spectroscapic- 
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(see Protocols 2 and 3) is shown in Figure 6.11. For the above-mentioned reasons, and tak- 
ing into consideration the effort and cost involved in obtaining a highly purified protein, 
researchers must decide whether to take the risk in attempting direct amino-terminal 
sequence of an electroblotted protein. The alternative strategy outlined in Figure 6.12 is inter- 
nal sequence analysis (see Chapters 8 and 9). Although this procedure is more time-consum- 
ing, it is a safer option for a precious sample, especially if it is not known whether the protein 
is blocked at the amino terminus. Moreover, an internal sequence strategy yields higher 
sequence coverage of the polypeptide chain (cf. amino-terminal sequence analysis) and 
increases the likelihood of providing posttranslational information. 


Concentration of gelseparated proteins prior to amino-terminal 
sequence analysis 


Empirical observations from a number of proteomics laboratories indicate that concentra- 
tion of stained excised gel spots (or bands) prior to electroblotting onto PVDF membranes 
is required to achieve high initial sequencing vields. Likewise, concentration prior to in-gel 
proteolytic (or chemical) fragmentation (see Chapters 7 and 9) is necessary to obtain high 
peptide vields. Preconcentration of protein spots/bands can be accomplished by reelec- 
trophorcsis of the stained gel pieces in a second conventional SDS-polyacrylamide gel or, 
alternatively, in a secondary gel matrix in the tip of a conventional Pasteur pipette (Gavaert 
et al. 1996) as described in Protocol 4. For this procedure, it is important to ensure that the 
total volume of combined gel pieces and equilibration buffer is smaller than 0.5 mi. When 
handling larger volumes (e.g., 1.2 ml), this procedure can be performed in 400-500-]1l batch- 
es, and these batches are then combined for further concentration in a single Pasteur pipette. 
Alternatively, larger volumes can be handled by similar means using a simple apparatus 
described by Gevaert et al. (1995) and Dainese et al. (1997). 

An alternative approach to obtaining sufficient material is to enrich low-abundance pro- 
teins prior to running a 2D gel. For example, this can be achieved by applying a total cell 
lysate onto a slab SDS-PAGE, excising a region of the gel comprising a narrow-M, range con- 
taining the protein of interest, passively eluting the proteins, and then loading them onto a 
high-resolving 2D gel for subsequent separation and microsequence analysis (Ji et al. 1993). 
Sequence data can thus be obtained from proteins in the submicrogram range. 


PROTEINS THAT WILL NOT SEQUENCE MUST BE DEBLOCKED 


Some proteins naturally undergo posttranslational modification, resulting in amino-termi- 
nal blockage with formyl, acetyl, or pyrogiutamyl groups (formation of a pyroglutamyl group 
may occur by cyclization of amino-terminal glutamine residues under acidic conditions) 
(Brown and Roberts 1976; Brown 1979). This blockage prevents the amino terminus from 
coupling with PITC. Empirical estimates from a number of laboratories suggest that 40-50% 
of naturally occurring proteins resolved by 2D electrophoresis are amino-terminally blocked, 
For example, 50% of the proteins from human myocardial tissue prepared by electroblotting 
2D electrophoresis gels were found to be refractory to Edman degradation (Baker et al. 1992), 
and ~80% of the soluble proteins from Ehrlich ascites cells are N-acetylated (Brown and 
Roberts 1976). If the protein of interest is not naturally blocked, care must be taken to ensure 
that chemical blockage of the amino terminus does not occur during sample preparation, 
especially during electrophoresis. 
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Tips 10 AVOID AMINO-TERMINAL BLOCKAGE IN GELS 


The following precautions can be adopted to minimize the chemical modification of pro- 
teins in polyacrylamide gels (Moos et al. 1988): 


+ Use HPLC-grade glycerol in the sample buffer. Heat sample only to 37°C for 10 minutes. 


. Ade 0.1-1 mM sodium thioglycolate or 10 mm reduced glutathione in the Laemmli 
uffer. 


» Allow running gel to polymerize overnight and/or preelectrophorese it before adding 
the stacking gel (acrylamide monomer can react with free amino termini of proteins). 


+ Use Towbin buffer (Towbin et al. 1979) if the protein of interest is reduced and alkylat- 
ed; do not use a pH 11 electroblotting buffer. 


For these reasons, erring on the side of caution is warranted when estimating the quan- 
tities of protein required for sequence analysis by Edman degradation. Although a number of 
elegant methods exist for deblocking electroblotted proteins (Wellner et al. 1990; Hirano et 
al. 1992, 1993), they have not proven to be very efficient (24096) or generally applicable. 
Hence, these methods are not recommended for low-abundance proteins. 


DEBLOCKING METHODS 

The following deblocking methods are listed by the group(s) they remove. The conditions 

apply to proteins in solution or stained protein bands on a PVDF-type membrane (for fur- 

ther details, see Hirano et al. 1997). 

+ Formyl group removal, Incubate with 0.6 N HCI for 24 hours at 25°C. 

e. Amino-terminal acetylserine or acetylthreonine removal. Incubate with TFA for 30 
minutes at 60°C in a sealed microfuge tube under nitrogen. Some cleavage of internal 
serines and threonines may also occur. 

e Amino-terminal pyrogiutamic ácid removal, Pyroglutamate aminopeptidase (Boeh- 
ringer Mannheim or TaKaRa) is reported to remove amin o-terminal pyroglutamic acid 
from proteins bound to PVDF membranes or absorbed to glass fiber filters. 

+ Acetylated amino-terminal amino acid removal. Acylamino-acid-releasing enzyme 
(TaKaRa) is reported to remove acetylated amino-terminal amino acids from peptide 
fragments recovered in solution from in situ trypsin digestion. 

Enzymatic removal of amino-terminal blocking groups is more efficient from some sam- 
ples than for others. The efficiency of the deblocking method can sometimes be improved if 

the sample is first reduced and alkylated (for reduction/alkylation conditions, see Chapter 9). 


MICROSEQUENCE ANALYSIS AROUND PHOSPHORYLATION SITES CAN HELP 
DELINEATE SIGNALING PATHWAYS 


Reversible protein phosphorylation is a fundamenta! intracellular mechanism for transducing 
extracellular stimuli to the nucleus. The class of enzymes that phosphorylate proteins and (i.e., 
protein kinases) exhibits widely preferred substrate specificities (for a review, see Aitken 1999) 
and can be broadly classified into groups depending on which amino acid they phosphorylate 
(e.g., tyrosine protein kinases and serine/threonine protein kinases). In general, protein kinas- 
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es are subject to a variety of regulatory mechanisms such as phosphorylation/dephosphoryla- 
tion, second messengers, calcium ions, and the presence of pseudosequences in their regulato- 
ry domain, to name a few (for a review of protein kinases, see Hunter 1987). To delineate the 
circuitry of a signaling pathway, it is important to establish the class of protein kinase(s) 
responsible for regulating the various protein components of that pathway. This can be accom- 
plished from knowledge of the amino acid sequence surrounding a site of phosphorylation. 

Analysis of sites of protein phosphorylation (serine, threonine, or tyrosine) can be 
accomplished by various procedures such as thin-layer electrophoresis (see Appendix 2), 
derivitization of phosphoamino acids to facilitate PTH amino acid identification (e.g., phos- 
phoserine to S-ethylcysteine; see Appendix 2), manual Edman degradation of ?P-labeled 
phosphopeptides, mass-spectrometry-based techniques (see Chapter 8), and microsequenc- 
ing of phosphopeptides. For a description of these procedures, see Aitken and Learmonth 
(1997). Of these methods, phosphopeptide mapping by thin-layer electrophoresis and solid- 
phase sequencing of *P-labeled phosphopeptides (see Protocol 5) are considered to be the 
most sensitive. 


THE BEST RECOVERY OF PHOSPHOAMINO ACIDS IS ACHIEVED WITH 
SOLID-PHASE SEQUENCERS 


3?P cpm 


The characteristics of solid-phase sequencing are the removal of the thiazolinone derivatives 
of the residues at each cycle of Edman degradation, along with a high percentage of the ?P- 
labeled phosphate, by washing with anhydrous TFA. The covalent linkage of the peptide or 
protein to the derivatized PVDF membrane supports allows such a strong wash. These mem- 


Yeast 14-3-3 


Sequence 


FIGURE 6.13. Automated sequencer analysis on PE Biosystems 477 of a phosphopeptide labeled with #P orthophosphate. The 
radioactive protein band from an immunoprecipitate was digested in-gel with trypsin, and the ?P-labeled phosphopeptide was 
recovered by microbore HPLC on a 1-mm Vydac C,, column (Dubois et al. 1997). The material extracted from the sequencer by the 
aqueous methanol after each cycle of Edman degradation was measured by Cerenkov counting. The amount of ?P radioactivity 
loaded onto the instrument was ~ 1000 cpm by Cerenkov counting, and the recovery of 1100 cpm in the peak by scintillation count- 
ing (which gives more than double the efficiency) indicated a typical recovery of 40-50% at cycle 11. The analysis shows that the 
threonine residues at cycles 4 and 7 and the other serines at cycles 8 and 13 are not phosphorylated. (Courtesy of Alastair Aitken, 
University of Edinburgh.) 
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branes enable the attachment of proteins via side-chain lysines and amino termini 
(Sequelon- DITC) or via side-chain carboxy] groups (Sequelon-AA). Sequelon-AA mem- 
branes can be used in sequencing phosphorylated peptides in conventional gas- or pulsed- 
liquid-phase protein sequencers. With the use of the Applied Biosystems sequencer, butyl 
chloride is replaced with 9:1 methanol:H,O (containing 2 mM phosphate), resulting in almost 
quantitative recovery of very low levels of radiolabeled phosphate (see Figure 6.131. The pro- 
cedure for solid-phase microsequencing of a ?P-labeled peptide on an Applied Biosystems 
sequencer is described in Protocol 5. 


ALTERNATIVE AMINO-TERMINAL REACTION METHODS 


A number of variations on the Edman reagent, PITC, have been proposed, primarily to 
increase the analytical sensitivity of the method. Both colored (e.g., dabsyl-isothiocyanate 
reagen:) (Chang and Creaser 1976) and fluorescent isothiocyanates have been used for spe- 
cific purposes, but none have superseded the Edman reagent for routine sequencing purpos- 
cs, primarily because they have not provided quantitative coupling, presumably due to cither 
the increased size of the reagent or deactivation of the isothiocyanate group. 


CARBOXY-TERMINAL SEQUENCING METHODS ARE MORE PROBLEMATIC 


Although there were active efforts to develop chemistry for sequential removal of carboxy- 
terminal amino acids from a protein some 25 years prior to Edman’s first report on amino- 
terminal sequencing, advances have lagged badly behind amino-terminal methodology. 
Despite a plethora of potentially useful chemical modifications of the carboxy-terminal 
amino acid, which include conversions of the carboxy! group to thichydantoins, carboxylic 
acid esters, alcohols, acylureas, isothioureas, azides, and hydrazides, investigators have found 
the published procedures difficult (for review, see Inglis 1991}. Consequently, enzymatic 
methods using carboxypeptidases have provided most of the carboxy-terminal sequence data 
obtained over the years. An alternative solution has been 10 isolate the carboxy-terminal pep- 
tide from a digest of the protein and analyze it using either the amino-terminal degradation 
procedure or mass spectrometry. 


AUTOMATED CARBOXY-TERMINAL SEQUENCING 


Carboxy-terminal sequence analysis complements mass spectrometry and Edman degrada- 
tion for protein characterization. The carboxy-terminal portion of a pratein sequence can be 
directly analyzed without previous proteolytic cleavage and without identification of the car- 
boxy-terminal peptide for analysis by mass spectrometry. 

The most widely studied method for carboxy-terminal sequencing has been one in which 
the thiohydantoin of the carboxy-terminal amino acid is first formed and the adjacent pep- 
tide bond is then cleaved in either acidic or basic medium. This yields an amino acid thiohy- 
dantoin and a shortened protein that may be subjected to a further degradation cycle. The 
carboxy-terminal thiohydantoin procedure can be used repetitively, and there are no major 
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interfering side reactions. This approach has similarities to the Edman reactions for amino- 
terminal sequencing, as shown in Figure 6.2. Although the latter employ initial coupling with 
“stable” isothiocyanates such as PITC, the carboxy-terminal procedure can use either thio- 
cyanate salts or organic isothiocyanates. 

The reaction mechanisms may differ—the amino group is best coupled at pH 9 (to 
remove H*j in a polar environment, whereas the carboxyl group has been found to couple 
under anhydrous conditions over a wider range of pH values (from slightly basic to acidic 
conditions). Both coupling reactions cause a weakening of the adjacent peptide bond, which 
is subsequently cleaved to release relatively stable thiohydantoin derivatives of the terminal 
amino acids. The common PTH amino acids (x = phenyl in Figure 6.2) are less polar than 
the thiohydantoin amino acids, but they have similar properties, the UV spectra especially 
being important because of the high extinction coefficient {Ep ~17,500) at ~269 nm that 
may be used for identification purposes. This chemistry has been moderately successful in the 
commercial carboxy-terminal sequencers developed by Hewlett Packard (now Agilent) and 
Perkin Elmer (now Applied Biosystems). 


CARBOXY-TERMINAL ANALYSIS CAN BE PERFORMED WITH INCREASED 
SENSITIVITY AND IMPROVED PERFORMANCE 


Bergman et al. (2001) have modified parameters in the protocol for a carboxy-terminal pro- 
tein sequencer operated according to the priuciples of Boyd et al. (1992, 1995). The method- 
ology has been applied to carboxy-terminal analysis of a broad range of polypeptides with 
improved results with respect to standard protocols, particularly regarding sensitivity and 
length of degradation, and the ability to sequence through prolines and to perform both 
amino- and carboxy-terminal sequence analyses on the same sample. 

Dilution of the chemicals and reagents used in the sequencer and during chromatogra- 
phy (Table 6.4) lowers the background and produces a flatter baseline in the chromatograms 
of the alkylated thiohydantoin (ATH) amino acids generated during the degradation cycle. 
The sensitivity is thereby increased, aliowing the use of 10 pmoles instead of the manufactur- 
er's recommended 100 pmoles of ATH amino acid standard (Figure 6.14). During pretreat- 
ment with phenylisocyanate (PIC), which blocks the amino groups, 196 PIC in diisopropy- 
lethylamine (DIEA) is used, instead of the recommended 2% PIC. In addition, a single PIC 
treatment, instead of the three applications of PIC recommended by the manufacturer, is suf- 
ficient to lower the levels of artifact peaks in the chromatograms and improve the sensitivity. 

For the reversed-phase separation of the ATH amino acids, ethanol is used in both sol- 
vents A and B instead of tetrahydrofuran (THF) (Table 6.4). Ethanol changes the elution time 
for most ATH amino acids, and the ethanol concentration is thus a parameter that can be used 
to optimize separation and improve resolution. The separation of ATI1-Asn and ATH-Gln is 
significantly improved, and baseline resolution is achieved for the pairs ATH-Asp/ATH-Glu 
and ATH-Trp/ATH-Thr (Figure 6.14) if ethanol replaces THE Ethanol is also less toxic and 
less hazardous than THF (THF generates peroxides and poses a risk of explosion). 

Only acetone is used in solvent A to balance the baseline (DIEA is excluded), improving 
chromatographic behavior. Instead of a U-shaped baseline at high sensitivity with DIEA pre- 
sent, a flat baseline is obtained allowing sensitive detection of all ATH amino acids. In addi- 
tion, the column temperature is decreased from 45°C to 40°C, which improves resolution and 
stability of the ATH amino acids, in particular of ATH-Ser. The recovery of ATH-Ser and 
ATH-Thr is further improved by also lowering the temperature of the transfer flask, from 
45°C to 40°C (Table 6.4). 
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FIGURE 6.14. Chromatography of ATH standard amino acids (10 pmoles) using the modified protocol 
specified in Table 6.4. nmtc denotes naphthylmethylthiocyanate, the major by-product formed. during 
sequence analysis. Note the resolution for the pairs N/Q, D/E, and W/T that are normally difficull to sepa- 


rate. (Courtesy of Tomas Bergman, Ella Cederlund, and Hans Jórnvall, Karolinska Institutet.) 


TABLE 6.4. Parameters altered with respect to the manufacturer's protocol for improved carboxy-terminal sequencing 


Parameter changed 


Before 


After 


Dilution 
(C1*} ATH amino acid standard 
(C3) N-Methylimidazole/Acetonitrile 
(C4) Piperidine thiocyanate/Acetonitrile 
(£6) Acetic anhydride/Lutidine/ Acetonitrile 
(C8) Bromomethyl-naphthalene/Acetonitrile 
(C10) Tetrabutylammonium thiocyanate/Acetonitrile 
(C11) DIEA/ Heptane 
2% Phenylisocyanate/DIEA 


Chromatography 
Solvent A 


Solvent B 
Temperature 


Column 
Transfer Bask 


100 pmoles/injection 

as delivered 

as delivered 

as delivered 

as delivered 

as delivered 

as delivered 

according to Applied 
Biosystems recommendation 


3.596 THF in 82.5 mM 
sodium acetate (pH 3.8), 
LHEA/acetone 

18% THF in acetonitrile 


450 


45°C. 


LO pmoles/injection 

diluted 1-1 with acetonitrile 
diluted 1:1 with acetonitrile 
diluted 1:1 with acetonitrile 
diluted 1:1 with acetonitrile 
diluted 1:1 with acetonitrile 
diluted 1:1 with heptane 
diluted 1:1 with acetonitrile 


7.1% ethanol in 30 mx 
sodium acetate (pH 3.8), acetone 


7.1% ethanol in acetonitrile 


"Label according to Applied Biosystems nomenclature, 
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To minimize the yield of by-products from degradation and alkylation, the sequencer pro- 
gram is altered to accommodate two additional washing steps with ethyl acetate in each cycle. 
With three washes instead of one wash, chromatography is clean with a low background. 

A reaction cartridge with a vertical slit for the sequencer membrane, instead of a horizon- 
tal slit, is recommended. The liquid thus flows along the entire length of the membrane instead 
of perpendicular to it, resulting in more efficient solvent extraction and reagent penetration. 
The increased liquid hold-up time improves the recovery of ATH amino acids and gives þet- 
ter sensitivity. The results show that this type of cartridge, designed for amino-terminal 
sequence analysis of electroblotted samples, further increases the overall yield (Figure 6.15). 

More than 200 proteins and fragments ranging from 10 to 600 residues have been ana- 
lyzed using the modified protocol (Bergman et al. 2001). The average length of degradation 
is five residues, but extended degradation lengths have also been achieved (Figure 6.16). 
Samples are applied to PVDF membranes, either by electroblotting the proteins that have 
been separated on polyacrylamide gels or by direct application of polypeptides in solution. 
The amounts of protein vary from a few hundered pmoles down to 10 pmoles (Figure 6.17). 
The average initial yield is 1596 and the average repetitive yield is less than 70%. The recov- 
eries and lengths of degradation are very much dependent on the protein sequence. In par- 
ticular, proline and acidic and hydroxylated residues can stop degradation or lower the yields. 
However, using the procedures in Protocol 6, sequencing past proline and identification of 
the next residue in the carboxy-terminal sequence are possible and have been shown with 
several proline-containing sequences. 


COMBINED AMINO- AND CARBOXY-TERMINAL SEQUENCE ANALYSIS 
CAN MAXIMIZE SEQUENCE INFORMATION 


Chemical sequence analysis of proteins from both ends of each molecule using the same sam- 
ple application is an attractive way to maximize the sequence information recovered from 
limited amounts of sample, particularly when the sensitivity in carboxy-terminal analysis is 
down to the 10-pmole level. This has been tested quite extensively and the following proce- 
dure is recommended. The sample is applied to a PVDF membrane and analyzed in the 
amino-terminal sequencer for 5-25 cycles. The membrane is removed from the sequencer 
reaction cartridge and treated with PIC to block the a-amino group, followed by transfer to 
the reaction cartridge of the carboxy-terminal sequencer. The program begins with an ethyl 
acetate wash in the cartridge to remove by-products from the Edman chemistry (mainly 
diphenylthiourea). Lysine residues will not be derivatized with PIC since their €-amino group 
is already derivatized with PITC in the Edman degradation, resulting in the formation of 
phenylthiocarbamyl (PTC) lysine derivatives. The corresponding ATH-PTC-Lys derivative, 
or a breakdown product thereof, elutes close to ATH-Asp and is clearly detected using the 
modified protocol. Carboxy-terminal sequence information can regularly be recovered for 
four to eight residues (Figure 6.18). 

Amino-terminally blocked proteins are common and often acetylated. Sequence analysis 
of blocked proteins can be performed using tandem mass spectrometry applied to the corre- 
sponding amino-terminal proteolytic peptide (Jonsson et al. 2000) and by sequencer degra- 
dation after application of deblocking protocols to the intact protein {Gheorghe et al. 1997). 
In this respect, carboxy-terminal sequence analysis is an attractive alternative, particularly 
because carboxy-terminal blocking of proteins is not commonly encountered. 


314 m CHAPTER 6 


A Residue 1 B Residue 1 
2.00 nmtc 200 A nmtc 
1.00 A 1.00 
0.00 0.00 
-1 00 -1.00 
12.0 16.0 200 24.0 28.0 32.0 12.0 160 200 240 28.0 32.0 
Residue 2 Residue 2 
2.00 2.00 nmtc 
m~ 1.00 ~ 
- 3 1.00 
f oo f 0.00 L 
E 100 | E 1.00 
i 12.0 16.0 20.0 24.0 28.0 32.0 a 12.0 16.0 20.0 240 280 320 
s Residue 3 t Residue 3 
è 2.00 nmtc E 2.00 nmtc 
$ E 
9 1.00 8 1.00 
3 s 
0.00 A * 000 A 
-1,00 -1.00 
12.0 16.0 20.0 240 28.0 32.0 120 160 20.0 240 28.0 32.0 
Residue 4 Residue 4 
T] 
2.00 nmtc 
1.00 
0.00 F 
T 
-1.00 i i — i tb I- lL -1.00 NENNEN 
120 160 200 240 28.0 32.0 120 16.0 200 240 280 32.0 
Time (min) Time (mín) 


FIGURE 6.15. Carboxy-terminal sequence analysis using a standard cartridge with liquid flow perpendicular to the membrane 
(A) and a reaction cartridge (intended for amino-terminal analysis of electroblotted samples) with liquid flow along the mem- 
brane (Hi. The same amount of protein (49 pmoles of bovine serum albumin) was applied from solution in buth runs. The car- 
boxy-terminal sequence is -Thr-Ala-Leu-Ala-OH. (Courtesy of Tomas Bergman, Ella Cederlund, and Hans jornvall, Karolinska 
Institutet.) 
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FIGURE 6.16, First ten cycles of carboxy-terminal sequence analysis of a 37-kD protein (400 proles) illustrating the improved 
chromatography with a flat baseline after application of the modified protocol (Table 6.4). The carboxy-terminal sequence was ana- 
lyzed for 15 residues and the part shown is -Ásn-ile- Lys- Lys-Leu-Leu-Phe-Trp-Gln-Lys-OH. (Courtesy of Tomas Bergman, Ella 
Cederlund, and Hans Jornvall, Karolinska Institutet.) 
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FIGURE 6.17. Carboxy-terminal sequence analysis using 11 pmoles of a 67-kD protein (bovine serum 
albumin) illustrating the achievable sensitivity. The sequence shown is -Ala-Leu-Ala-OH. (Courtesy of 
‘Tomas Bergman, Ella Cederlund, and Hans [órnvall, Karolinska Institutet.) 
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FIGURE 6.18. Combined amino- and carboxy-terminal sequence analysis of a preparation of an amino-terminaliy truncated pro- 
tein (corresponding to a mixture of 28- and 27-kD polypeptides in amounts of 140 and 60 proles, respectively). The sample first 
underwent amino-terminal degradation for five cycles (A), followed by carboxy-terminal analysis for an additional five cycles (B). 
The combined sequence result is N-Lys-Ser/Pro-Ile/Glu-Gly/Pro-His/Thr- - - - - Glu-Val-Phe-Glu-Asp-OH. The amino-terminal 
sequences both had lysine in position 1, whereas positions 2-5 revealed two residues each. For the latter, the first residue given 
belongs to the major sequence (28-kD component) and the second residue given belongs to the minor sequence (27-kD compo- 
nent). (Courtesy of Tomas Bergman, Ella Cederlund, and Hans Jórnvall, Karolinska Institutet.} 


PROTOCOL Í 


sample Loading onto a Biphasic Column Sequencer 


MATERIALS 


Tin BIPHASIC COLUMN APPROACH FOR IMMOBITIZING PROTEINS and peptides for the Edman 
degradation procedure was introduced by Hewlett Packard (now Agilent). The absorptive 
biphasic column consists of separate halves: One component is hydrophobic and the other 15 
hydrophilic. These two components are assembled together to form the reaction cartridge 
and are mounted in the sequencer (see Figure 6.94). The sample is applied to the top of the 
disassembled hydrophobic component, which resembles a reversed-phase column (see Figure 
6.9B). Thus, like reversed-phase chromatography (see Chapter 3), very large sample volumes 
can be applied; the sample is retained on top of the column and inorganic salts and buffers, 
which otherwise interfere with the Edman chemistry, can be readily washed away. After sam- 
ple loading, the hydrophobic and hydrophilic column halves are reassembled and inserted 
into the sequencer. The biphasic column acts as a reaction cartridge with all Edman reagents 
and solvents flowing through. A novel feature of the instrument is that the flow direction of 
the solvents can be reversed, helping to prevent sample washout. All aqueous solvent flows 
are directed against the hydrophobic half of the column, which retains sample by hydropho- 
bic interactions. With organic solvent flows, the flow direction is against the hydrophilic half 
of the column, which retains the sample, whereas hydrophobic contaminants, by-products, 
and products of the Edman procedure (e.g, DPTU and ATZ amino acid derivatives) are elut- 
ed to waste or analysis. 


p Reagents 


p» Equipment 


CAUTION: See Appendix 3 for appropriate bandling of materials marked with <!>. 


Methanol (HP]C grade) <!> 
Trifluoroacetic acid (TFA) (296) in 11,0 «1» 


Biphasic sequencing column (Agilent) 

Nitrogen gas supply (pressure regulator to allow 80 psi) <!> 
Polypropylene tube (50 ml) or small beaker 

Sample applicator 

Sample loading funnel 


» Biological Sample 
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Sample solution l 
This solution can be any aqueous solution that will bind to a reversed-phase column, i.e., can tol- 


erate high salt concentrations but not high concentrations (> 10%) of organic solvents. 


METHOD 
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Wetting the Column 


L 


Remove the hydrophilic column half (male connector) from the prepared biphasic col- 
umn and set it aside. 


. Mate the hydrophobic column half (female connector) to the sample loading funnel. 


. Install the sample loading funnel/column half into the sample loading chamber by push- 


ing down on the sample loading funnel and turning it clockwise to lock it into the cham- 
ber. 


. Fill the sample loading funnel with 1 ml of HPLC-grade methanol. 
. Tighten the sample loader screw top. 


. Turn on the nitrogen (80 psi) long enough to just empty the sample loading funnel, but 


not long enough to dry the column packing. 


. Remove the screw top and repeat Steps 4—6, replacing the methanol with 1 ml of 2% TFA 


in H;O. It the column packing is allowed to dry, repeat this step. 


Preparing and Loading the Sample Solution 


8. 


10. 


12. 


13. 
14. 


15. 
16. 


Adjust the sample solution to a total volume of 1-5 ml by diluting it with 2% TFA in H,O. 
The dilution should be at least 1:1. If the sample solution is 1 ml before dilution, make 
sure to save the loading effluent. 


. Place a clean small beaker or 50-ml polypropylene tube under the column to collect the 


flowthrough. 


Remove the screw top from the sample applicator. 


. Fill the sample loading funnel with sample solution. One milliliter of sample solution 


should pass through the column within 2 minutes. 


Up to 5 ml of sample solution may be loaded at one time, but will require a longer loading 
time, 


Tighten the screw top and switch on the nitrogen pressure to 80 psi. 


EXPERIMENTAL TIP: Save the effluent from Steps 11 and 12 until the sample has been 
sequenced. 


Repeat Steps 10—12 as necessary to load larger sample solution volumes. 


Once the entire sample has been applied to the column, dry the column with nitrogen at 
80 psi for 30 seconds. 


Remove the screw top. Twist the sample loading funnel counterclockwise to unlock it. 


Lift out the sample loading funnel and attached column half. Disconnect the funnel from 
the hydrophobic column half. 


PROTOCOL 2 


a — —nÀÀ——A 


Electrotransfer of Proteins from Gels to PVDF Membranes 


Evscrrorransrer (ELECTROBLOTTING) OF ELECTROPHORETICALLY RESOLVED PROTFINS to immo- 
bilizing matrices, first described in 1979 by Renart et al. (1979) and Towbin et al. (1979), is a 
routine method for immunodetection (western blotting) of proteins (see immunoblotting 
section in Spector et al. [1998] and Sambrook and Russell [2001]). For a detailed discussion 
of transfer and immobilization optimization methods and protein visualization, see reviews 
by Beisiegel (1986) and Tovey et al. (1987). In the mid 1980s, electroblotting was adapted for 
the direct amino acid sequence analysis of proteins resolved by one- and two-dimensional (1D 
and 2D) PAGE (Vandekerckhove et al. 1985; Aebersold et al. 1986; Bauw et al. 1987; Bergman 
and Jórnval 1987; Matsudaira 1987; Eckerskorn and Lottspeich 1989). The following are key 
features of this electroblotting method for direct sequencing using Edman chemistry: 


« Sequencing of proteins in complex protein mixtures is simplified, especially that of mem- 
brane proteins, by being coupled to high-resolving protein separation methods such as ID- 
and 2D-PAGE. 


* Sample handling procedures are minimized due to the direct electrotransfer of acrylamide- 
gel-resolved proteins onto an inert membrane. 


+ Samples can be prepared in a relatively short time, free of incipients such as buffer salts. 


e Low-picomole amounts of protein can be handled and sequenced with high yields 
(Matsudaira 1987). 


In addition to direct sequencing, immobilized samples can be subjected to alternate char- 
acterization methods, such as amino acid composition analysis (Eckerskorn et al, 1988; 
Nakagawa and Fukuda 1989; Ploug et al. 1989; Tous et al. 1989), internal amino acid sequence 
analysis following proteolytic cleavage (Aebersold et al. 1987), chemical fragmentation (Scott 
et al. 1988; Jahnen et al. 1990; Patterson et al. 1992), and mass spectrometry (Eckerskorn et 
al. 1992; Strupat et al. 1994). 

The critical step of this method is the quantitative transfer and immobilization of the 
proteins onto a suitable immobilizing matrix (for reviews, see Xu and Shively 1988; Simpson 
et al. 1989; Jungblut et al. 1990; Eckerskorn 1994). This electrotransfer protocol has been 
adapted from Ji et al. (1997). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


» Reagents | | 
CAPS (3-{cyclohexylamino]-1-propane sulfonic acid) (2.2% w/v) in H,O 
Adjust pH to 11.0 with HCI. <!> 
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P Equipment 


Methanol: '> 
‘Transfer buffer 


Mix | part 2.286 € APS, ! part methanol, and 8 parts H O. For alternative transler buffers and bos 
an improving electrotranster of proteins, refer to the panel below. 


Electroblotting apparatus 


The Trans-Blot (Bio-Rad), Xcell HE Blot Module (NOVEX, Invitrogen i. TF 22 Mighty Small [ransphor 
Tank Transfer Unit (Hoefer), and Transphor Tank Transfer Unit :Hoeler? are a fea exampies of suat- 
able electroblotters. 


Plastic containers 
Polyvinylidene difluoride (PVDF) membrane 


Almost quantitative retention of proteins during eiectrotransfer trom  acrvilamide geb can be 
obtained with PYPE membranes with a high specific surface area (« [500 m-) and narrow pores 
[«0.350 Lim) such as immobilon PSO (Millipore), Trans-Blot :Bio-Rad?, and tluorotrans i Pai, 
Dreieich, Germany), whereas PVDF membranes with a relatively low specific area iega Immohbilon 
P from Millipore, 380 mi’ exhibit significantly reduced vieids | Fekerskorn and *ottspeich 1993: 


P Biological Sample 


Polyacrylamide gel (unstained) containing the desired protein samples 


Procedures for preparing 1D and 2D polyacrylamide gels, «ample ioading, and. Aectrophoresis ore 
described in Chapters 2 and 4, respectively. 


ELECTROTRANSFER BUFFERS 


The choice of electrotransfer buffer is not critical provided ionic strengths similar to those 
of the CAPS buffer in the Materials list are employed. Alternative electroblotting buffers 
include: 


« 50 mM sodium borate (pH 8.0), 0.02% B-mercaptoethanol, 20% methanol ( Vandekerck- 
hove et al. 1985) 


+ 25 mM N-ethylmorpholine (pH 8.3), 0.5 mM dithiothreitol (Aebersold et al. 1988) 
e 25 mM Tris-HCl, £0 mM glycine (pH 8.3), 0.5 mm dithiothreitol (Aebersold et al. 1986) 


For the sodium borate buffer system, significantly decreased electrotransfer yields occur if 
the ionic strength is reduced below 10 mM or increased above 100 mM (Jungblut et al. 1990), 

In SDS-free transfer buffers, high-M, proteins and hydrophobic proteins such as mem- 
brane proteins are poorly soluble and elute from acrylamide gels in poor yield. The bulk of 
these proteins remain precipitated in the gel. Although increased solubilization of this class 
of protein can be improved by the addition of SDS (up to 0.0196 final concentration) to the 
transfer buffer, it must be balanced against the fact that SDS generally prevents protein 
absorption to the immobilizing matrix (especially hydrophilic and low-M, proteins). 

Because the stability of SDS-protein complexes is influenced by methanol (e.g., such 
complexes dissociate more readily at increasing concentrations of methanol, leading to 
increased interaction of protein with the immobilizing matrix), varying the amount of 
methanol used for different proteins is a useful way to increase the recovery of a target pro- 
tein onto a PVDF-type membrane. Typically, PVDF membranes require the presence of 
10—2096 methanol for optimal protein absorption. For proteins 780,000 daltons, use 5% 
methanol; for proteins between 20,000 and 80,000 daltons, use 1096 methanol; and for low- 
M, proteins, use 20% methanol. 
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METHOD 


IMPORTANT: Wear gloves throughout this procedure to avoid contaminating the gel or mem- 
brane with proteins. 


1. Immediately following gel clectrophoresis, transfer the gel to a plastic container. 


Gels should nal be stained before electroblotting because exposure to methanol/acetic acid 
will fix the proteins in the acrylamide gel, thereby significantly reducing the yield of protein 
transferred out of the gel during electroblotting. 


2. Add sufficient transfer buffer to cover the gel. Allow the submerged gel to stand for 5-10 
minutes al room temperature. 


3. Cut a piece of PVDF membrane to the size of the gel. Wet the membrane with 100% 
methanol and then transfer it quickly to another plastic container containing transfer 
buffer. Soak the membrane in the transfer buffer for 5 minutes at room temperature. 


4. Remove the gel and the PVDF membrane from the transfer buffer and assemble them in 
the electroblotting apparatus according to the manufacturer's instructions. 


5, Electrophoretically transfer the proteins from the gel to the membrane for 5 hours at 4eC 
and 500 mA. Consult manufacturers’ instructions for specific conditions. Suggested 
transfer conditions for specific equipment are as follows: 


If using the Mini Trans-Blot Electrophoretic Transfer Cell (Bio-Rad); Set the voltage at 
90-100 V for 1 hour or 30 V overnight. 


If using the Trans-Blot Electrophoretic Transfer Cell (Bio-Rad): Transfer tíme is dependent 
on the percentage of the gels and the size of a particular protein and can last from 30 min- 
utes to 4 hours. Perform the transfer at 4°C, and using Tris-glycine buffer, set the voltage: 
up to 100 V produces ~360 mA, 150 V produces ~550 mA, or 200 V produces ~850 mA. 


6. Proceed with Protocol 3 to detect the proteins on the membrane. 
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TROUBLESHOOTING GUIDE FOR ELECTROBLOTTING 


The PVDF membrane does not wet properly. 

*» PVDF-type membranes are very hydrophobic. 

* immerse the membranes in 100% inethanol until completely wet. 
* Quickly place them into clectroblotting buffer for equilibration. 


Coomassie staining the gel after electroblotting shows that the target protein has not compictely left the gel. 

Reduce the concentration of methanol in the transfer buffer. 

Reduce the amount of SDS removed from the gel by incubating the gel in electrotransfer buffer no longer than 5 min- 

utes prior to transfer. 

Increase the current while keeping the transfer sandwich as close to room temperature as possible. Electroblotting for a 

longer time period is not likely to help because the target protein may be unable to leave the gel after the SDS has been 

driven away. 

» [f the separation of the target protein from other proteins in the mixture is sufficient, use a lower acrylamide concen- 
tration in the separating gel; e.g., proteins of ~100 kD usually require 8% acrylamide and proteins of 10-70 kD often 
need 12% acrylamide. 

* Try an electrobiotting buffer with a higher (or lower) pH in case the pl of the target protein happens to be near the pH 
of the transfer buffer first tried. 


Although the gel contains little or no protein after electroblotting, the stained membrane also shows little or no protein. 

* Protein, especially those with low M, may transfer through some PVDF-type membranes (e.g., wide pore). When this 
occurs, a second sheet of membrane placed behind the first sheet in the transfer sandwich may bind some of the pro- 
tein that has “blown through.” To avoid the problem, use PVIDE-type membranes with a high binding capacity (e.g., 
ProBlott membranes from Perkin Elmer, mmobiion P$O from Millipore, Trans Blott from Bio-Rad, Fiuorotrans from 
Pail). 

* Reduce the SDS concentration, equilibrate the gel in clectroblotting buffer for a longer time prior to transfer, and/or 

increase the methanol concentration of the electrotransfer buffer. 

Because PVDF membranes are very hydrophobic, they must be immersed in 10096 methanol before they are equili- 

brated with buffer. Proteins will not bind well to dry PVDF. 


Blank patchy areas on the membrane remain after staining. 

‘# Remove any air bubbles that are trapped between the layers of the transfer sandwich because they can prevent transfer 
from occurring in small regions. 

+ Use only high-quality PVDF-type membranes. Poor-quality membranes may contain irregularities. 

* After complete immersion in methanol, the entire membrane must be thoroughly wetted with buffer. Proteins will not 
bind well to areas of PVDF that are not properly wetted. 


No signal results from direct sequence analysis. 

a Run a sample evaluation gel (e.g., SDS-PAGE; see Chapter 2} to reassess the amount of protein in the gel prior to elec- 
troblotting, or if the amount of material allows, measure the amount of protein in an aliquot of the sample by amino 
acid analysis or other methods (e.g, BCA method, Appendix 2). If an insufficient amount of sample was delivered to 
the sequencer, prepare a new sample. 

Fragment the unsuccessful sample on its sequencing matrix with cyanogen bromide and attempt sequencing again. If 
the sample was amino-terminally blocked, several signals will be apparent. See reference(s} to "deblocking" of proteins 
mentioned in the chapter introduction. 

» Prepare fragments for internal sequencing (see Chapter 7). 


More than one signal results from direct sequence analysis. 

» The primary signal from the major protein can sometimes be distinguished from the secondary signals of contaminat- 
ing proteins by careful analysis of the data. 

e improve the resolution of the target protein on a gel of different acrylamide concentrations, a gradient gel, or a 2D gel 
(see Chapters 2 and 4). 

s Protect the sample from excessive heat and acid, which can cause nonspecific cleavage of internal peptide bonds. 


Adapted, with permission, from the handbook, Preparing samples for protein sequencing: A newcomer's guide (©1995 The Perkin- 
Elmer Corporation}. 


PROTOCOL 3 


Detecting Proteins on PVDF Membranes 


A NUMBER OF DIFFERENT STAINS CAN BE USED to visualize proteins on PVDF membranes, 
including Coomassie Brilliant Blue R250 (CBR-250), Ponceau S, Amido Black, colloidal met- 
als, and transillumination, Apart from colloidal metal stains such as gold and silver, all of 
these stains are compatible with Edman chemistry sequencing. CBB R250 is versatile and sen- 
sitive enough for detecting sequenceable amounts of protein. Ponceau $ and Amido Black are 
less sensitive than CBR-250 (Sanchez et al. 1992), but they can be removed from the gel with 
H,O. A drawback of Amido Black is that it contributes a UV-absorbing contaminant that 
appears in the hydrophobic region of reversed-phase chromatograms, thereby limiting its 
usefulness for subsequent peptide mapping and PTH amino acid analysis. Transillumin- 
ation is a simple method, but it is not very sensitive for detecting proteins. (Transillumination 
works as follows: dried PVDF membranes are rewetted with 2096 methanol and viewed on a 
light table. The membrane remains opaque except where protein is bound; the protein bands 
appear translucent, wetter, and shinier than the surrounding membrane.) 

Gels should not be stained before electroblotting because exposure to methanol/acetic 
acid will fix the proteins in the acrylamide gel, thereby significantly reducing the yield of pro- 
tein transferred out of the gel during electroblotting. This protocol describes the CBR-250 
method for staining blotted proteins, and was adapted from Ward et al. (1990). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


p Reagents 

Coomassie Brilliant Blue staining solution 
Dissolve 1 g of CBR-250 <!> into } liter of aqueous 5096 methanol <!> (0.1% w/v) final concen- 
tration. Stir the solution for 3-4 hours and then filter through Whatman filter paper. 

Destaining solution l 
Mix 50% methanol, 10% acetic acid <!>, and 40% H;O. To avoid amino-terminal blockage of pro- 
teins, use only high-quality acetic acid (free of peroxides, aldehydes, etc.) during the staining and 
destaining procedures. 


» Equipment 


Plastic container 


» Biological Sample 
PVDF membrane from Protocol 2 containing the desired proteins 
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METHOD 


1. Immediately following electrotransfer in Protocol 2, place the PVDF membrane in a plas- 
tic container and add enough Coomassie Brilliant Blue staining solution to cover the 
membrane. Allow it to stand for 5-10 minutes at room temperature to stain the mem- 
brane. 


2. Discard the staining solution. Cover the membrane with destaining solution. Change the 
destaining solution until the PVDF membrane is fully destained. 


Approximately three to four changes are usually required for a total time of 30 minutes to 1 
hout. 


3. Wash the PVDF membrane with H,O for 5-10 minutes. 


4. Repeat the wash five more times. 


PROTOCOL 4 


Concentrating Acrylamide Gel Spots 


MATERIALS 


A NUMBER OF REPLICATE PROTEIN BANDS FROM 1D acrylamide gels (or 2D gel spots) are often 
required to provide sufficient material for sequencing. In these situations, it is necessary to 
excise several stained protein bands or spots from the gels and preconcentrate them prior to 
electroblotting for amino-terminal sequencing or internal sequence analysis following in-gel 
proteolytic (or chemical) fragmentation. Empirical observations indicate that to obtain effi- 
cient cleavage of a protein in a gel, it is necessary to keep the ratio of protein to acrylamide 
as high as possible (Eckerskorn and Lottspeich 1989; Tempst et al. 1990). This can be accom- 
plished by reelectrophoresis of several stained gel pieces (the number dictated by the volume 
of the sample well) either in an analytical SDS-PAGE gel or in a simple SDS-PAGE gel poured 
in a single conventional glass Pasteur pipette (Gevaert et al. 1996). The Pasteur pipette con- 
centratíng procedure is described in this protocol, which is adapted, with permission, from 
Gevaert et al. (1996). 


b Reagents 
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CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acrylamide mix (30%) <!> 
To prepare the 30% acrylamide mix, combine 30% (w/v) acrylamide and 1.0% N,N’-methyiene- 
bisacrylamide <!> in HO. The final mixture is 30%T and 3,396C. 

Agarose (5% wiv) 
Melt the agarose and hold at 55-659C. 

Coornassie Brilliant Blue staining solution 
Dissolve 1 g of CBB-R250 <!> into 1 liter of aqueous 50% methanol <!> (0.1% w/v) final concen- 
tration. Stir the solution for 3-4 hours and then filter through Whatman filter paper. 

Destaining solution 
Mix 50% methanol, 10% acetic acid <!>, and 40% H,O. To avoid amino-terminal blockage of pro- 
teins, use only high-quality acetic acid (free of peroxides, aldehydes, etc.) during the staining and 
destaining procedures. 

Equilibration buffer 
Mix 12.5 ml of 0.5 M Tris-HCI (pH 6.8), 20 ml of 1096 SDS, 5 ml of 2-mercaptoethanol <!>, and 10 
ml of glycerol, and then add a few grains of bromophenol biue. 

10x Laernmli electrophoresis running buffer 
Dissolve 30.0 g of Tris base, 144.0 g of glycine <!>, and 10.0 g of SDS in 1000 ml of H;O. The pH 
of the buffer should be 8.3 and no pH adjustment is required. Store ihe running buffer at room 
temperature and dilute to 1X before use. 
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P Equipment 


A  Gelpreparation B  Electrophoresis 
Laemmli buffer 


—«- e —— — 
MEA 0.51 mi 
L 
| icum Concentrated 
6.5 cm gel pieces gel band 
5 ~ 
5% acrylamide 5% agarose 


FIGURE 6.19. Design and operation of the Pasteur pipette concentration gel system. (Redrawn, with per- 
mission, from Simpson and Reid 1997; originally adapted from Gaevert et al. 1996.) 


Stacking gel (596) 

To prepare 596 stacking gel mixture, combine in the following order: 
2 ml of 3096 acrylamide mix 
3 ml of 0.5 M Tris-HCl (pH 6.8) 
0.12 ml of 10% (w/v) SDS <!> 
6.76 ml of H,O 
0.12 ml of 10% ammonium persulfate <!> 
0.006 ml of N,N,N’,N’-tetramethylelthylenediamine (TEMED) <!> 


Pasteur pipettes 

Polypropylene tubes (10 ml) 

Protean II xi 2-D cell (Bio-Rad) or equivalent 
Syringe (2 ml) 

Tubes (50 ml) 

Tube gel adaptor 


b Biological Sample 


2D electrophoresis gel spots containing the protein of interest 
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METHOD 


10. 
11. 


. Excise the desired 2D electrophoresis gel spots and place each spot in its own 10-ml 


polypropylene tube. Rehydrate the spots in 9 ml of H,O. 


‘Take care not to mix up the spots. Multiples of identical spots from replicate gels can be 
combined and treated in one tube. 


. Wash the rehydrated gel spots with 9 ml of H,O five times for ~5 minutes per wash. 


. Discard the H,O from the final wash, and replace it with enough equilibration buffer to 


cover the gel pieces. Equilibrate the gel spots for at least 1 hour at room temperature. 


. Dip the tip of a Pasteur pipette into a solution of melted 596 agarose and hold the pipette 


for a short while until the agarose solidifies. Pipette the stacking gel mixture into the 
Pasteur pipette to cast the concentrating gel. 


. Insert the pipettes into the Protean IJ xi 2-D cell with a tube gel adaptor. Layer 0.5 mi of 


H;O on top of each gel. 


. Allow the gels to polymerize (for gel details, see Figure 6.19A). Remove the H,O from the 


top of the pipette and load the equilibrated gel spots and equilibration buffer (from Step 
3) into the pipette (see Figure 6.19B). The maximum total volume of combined gel pieces 
and equilibration buffer is 0.5 ml. Fill the remaining volume of the pipette with Laemmli 
electrophoresis running buffer. 


. Perform electrophoresis in the Protean II xi 2-D cell at 250 V at room temperature until 


the dye reaches the end of the pipette. 


. Stop the electrophoresis. Use a 2-m] syringe to remove the concentrating gel from the 


pipette by applying gentle pressure from the bottom of the pipette. 


. Transfer tbe concentrating gel into a 50-ml tube filled with 30 ml of Coomassie Brilliant 


Blue staining solution. Stain the gel for 10-20 minutes at room temperature, 
Destain the gel with destaining solution. 


Excise the stained protein band for further characterization, for example, internal 
sequence analysis (see Chapter 7) or direct Edman degradation following electroblotting 
onto PVDF (see Protocol 2). 


PROTOCOL 9 


solid-phase Microsequencing of Phosphopeptides 


MATERIALS 


Å nayss OF PHOSPHORYLATED PEPTIDES isolated from endogenously phosphorylated tissue 
samples where no *P or other radiclabel is present may be particularly difficult if the amount 
is below detection levels for mass spectrometry. The identification of phosphorylation sites 
where ??P is present may also be problematical. However, excellent recovery of radioactivity 
from phosphorylated residues can be achieved with solid-phase sequencing, which allows the 
attachment of proteins via side-chain e¢-lysine and amino-terminal NH, groups (to 
Sequelon-DITC) or via side-chain and carboxy-terminal carboxyl groups (to Sequelon-AA). 
The latter PVDF arylamine supports are particularly applicable to the analysis of phospho- 
rylated peptides in conventional gas- or pulsed-liquid-phase protein sequencers. This 
method results in almost quantitative recovery of very low levels of radiolabeled phosphate 
using 9:1 methanol:H,O (containing 2 mM phosphate as a “carrier”) to replace the hydropho- 
bic solvent, n-butanol, The hydrophilic solvent does not wash away the peptide due to the 
covalent linkage of the peptide or protein to the derivatized PVDF membrane supports. If 
sufficient quantities of sample and radioactivity are available, the thiazolinone derivatives can 
be converted at each cycle of Edman degradation to the PTH amino acids and the sample can 
be split—part to the conventional HPLC and part to a fraction collector to measure the ?P- 
labeled phosphate released at each cycle. This protocol was provided by Alastair Aitken 
(University of Edinburgh, UK). 


b Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Carbodiimide reagent (1-[3-dimethylaminopropyl]-3-ethylcarbodiimide hydrochloride, 
EDC) <i> 

Coupling buffer (pH 5) supplied in Sequelon kit (nonreactive with carbodiimide, e.g., pyri- 
dinium-HCl or MES <!>} 

Methanol:H;O (9:1) containing 2 mM sodium phosphate <!> 

Peptide solvent (e.g., 20% acetonitrile/0.1% TFA) <!> 

S3 (n-butyl chloride/n-butanol) <!> 


Scintillation fluid (e.g., Universol-ES, ICN) <!> 
This fluid is added to the samples recovered at each step for counting after collection, but note the 
exceptions within the protocol. 
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» Equipment 


ATZ-coilect program 

Heating block preset to 55°C 

Micropipette 

Mylar sheet (supplied with kit) 

Protein sequencer (Applied Biosystems or equivalent) 
Scintillation counter 

Sequelon-AA kit (GEN 920033, Applied Biosystems) 
Test tubes 


P Biological Sample 
Peptide, radiolabeled <!> 


METHOD 


The quantity is not important. Approximately 200 cpm (Cerenkov counts of *P radioactivity) is the 
minimum to observe a reasonable “burst of counts,” depending on the length of the sequencer tun. 


. Dissolve the radiolabeled peptide of interest in a small tube in 20 pl of appropriate sol- 


vent (e.g., aqueous acetonitrile up to 30% with or without 0.1% TFA, which is a typical 
solution in which a peptide might elute from a RP-HPLC column). Heat the mixture at 
55°C and mix to dissolve. 


_ Count the tube in a scintillation counter with the Cerenkov program. Do not add any 


scintillation fluid for Cerenkov counting. 


_ Place the Sequelon-AA disk from the kit onto a Mylar sheet placed on a heating block at 


55°C. 


, Pipette the radiolabeled peptide sample onto the Sequelon-AA disk. Allow the disk to dry 


thoroughly. Count the disk and the sample tube in a scintillation counter. Again do not 
use any scintillation fluid. 


- Rinse the tube three times with 10-20 ul of peptide solvent. Pipette the washings onto the 


Sequclon-AA disk and dry between each application. Repeat the wash if significant 
counts are left in the tube. 


. Count the disk in the scintillation counter without using any scintillation fluid. 


. Pipette 100 pl of coupling buffer into a test tube and add 1 mg of the H,Q-soluble car- 


bodiimide reagent EDC. 


. Pipette 5 pl of the EDC solution onto the sample disk. Allow the reaction to proceed for 


20 minutes at room temperature. 


_ After the sample has been coupled, rinse the disk a few times with a peptide solvent sim- 


jlar to that in Step 5 to remove noncovalently linked peptide, which could give rise to 
background during the sequencing run. Count the disc in a scintillation counter. 
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10. Replace $3 (n-butyl chloride) with methanol:H,O (9:1) containing 2 mm sodium phos- 
phate. 


11. Place the disc in the reaction cartridge of the protein sequencer using methanol:H,O (9:1) 
containing 2 mM sodium phosphate. Carry out standard Edman sequencing using an 
ATZ-collect program. 


This delivers the resulting ATZ products and the “P-labeled phosphate directly to the frac- 
tion collector. It is advisable to check through and modify the program to optimize the 
delivery times of the reagents and solvents. The aqueous methanol has a higher viscosity 
than butyl chloride and will require a longer delivery time. The “Procise” model has sensors 
that should ensure correct delivery, but it is strongly advisable to verify that sufficient sol- 
vent delivery occurs, 


12. Count the samples from the fraction collector in scintillation fluid in the counter. 


Scintillation fluid may be used here and it will approximately double the radioactivity 
counted, The background will increase similarly; therefore, a significant improvement in the 
ratio of signal to noise is not always achieved. 


13. To check the efficiency of Edman degradation, remove the disc from the sequencer car- 
tridge, allow it to dry, and count it in a scintillation counter to check the level of radioac- 
tivity remaining. 


Depending on the residues encountered and the length of the sequencer run, typically 
~10-20% of the initial quantity might be left, due to the normal repetitive yield and accu- 
mulative blockage during each cycle of degradation. 


‘PROTOCOL 6 


Carboxy-terminal Sequence Analysis 


Cansoxy-reaMinat SEQUENCING [S A LARGELY AUTOMATED PROCESS, typically capable of deter- 
mining the identity of five residues from 0.5 nmole of protein. This carboxy-terminal 
sequence analysis protocol is a modification of the protocol from Applied Biosystems and is 
capable of longer runs with increased sensitivity. It has been tested with more than 600 pro- 
teins and protein fragments in sizes ranging from 10 to 600 residues (Bergman et al. 2001). 
The technique provides quantitative characterization of carboxy-terminal sequences and 
truncation patterns and is efficient for analysis of polypeptides blocked at the amino termi- 
nus. The following are its performance characteristics: 


+ Sensitivity down to the 10-pmole level. 
« Extended runs: up to 10 residues using 100-500 pmoles. 
« Detection of proline and PTC-Lys. 
« Efficient combination of amino- and carboxy-terminal degradation. 
This technique provides quantitative characterization of carboxy-terminal sequences and 
truncation patterns and is efficient for analysis of amino-terminally biocked polypeptides. 


This protocol was provided by Tomas Bergman, Ella Cederlund, and Hans Jórnvall 
(Department of Medical Biochemistry and Biophysics, Karolinska Institutet). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


» Reagents 
Reagents supplied by Applied Biosystems are listed with their bottle position (C) on the se- 
quencer and part number (PN). Table 6.4 contains additional information about the reagents 
required to perform carboxy-terminal sequencing. 


Acetic anhydride/lutidine in acetonitrile (C6, PN 402142) <!> 
Dilute 1:1 with acetonitrile. 

Alkylated thiohydantoin (ATH) amino acid standard (C1, PN 403088) 
Store at -209C. 

Bromomethylnaphthalene in acetonitrile (C8, PN 401703) <!> 
Dilute 1:1 with acetonitrile. Store at 49C. 

Coomassie Brilliant Blue (R250) <!> 
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P» Equipment 


Diisopropylethylamine (DIEA) in heptane (C11, PN 401702) <!> 
Dilute 1:1 with heptane. 

Ethyl acetate <!> 

Methanol <!> 

N-methylimidazole in acetonitrile (C3, PN 402141) <!> 
Dilute 1:1 with acetonitrile. 

Phenyl isocyanate (PIC) (196 v/v) in acetonitrile <!> 

Piperidine thiocyanate in acetonitrile (C4, PN 401700) <!> 
Dilute 1:1 with acetonitrile. 


Tetrabutylammonium thiocyanate in acetonitrile (C10, PN 401854) <!> 
Dilute 1:1 with acetonitrile. 


Trifluoroacetic acid (TFA) (C12, PN 401701) <!> 


Amino-terminal sequencing instrument 
During the development of this protocol, when carboxy-terminal degradation was combined with 
amino-terminal analysis, a Procise HT sequencer (Applied Biosystems) was used for amino-termi- 
nal degradations. 

Argon <!> 

Beaker (25-50 ml) 
See Step If, if adsorbing the protein sample to the PVDF membrane. 

Carboxy-terminal sequencing instrument 
During the development of this protocol, carboxy-terminal sequencing was carried out using a 
Procise C instrument (Model 494, Applied Biosystems), basically operated according to the manu- 
facturer’s recommendations but with important modifications introduced as detailed in this pro- 
tocol. Analysis of ATH amino acids from the degradations was performed using a 2 x 220-mm 
reversed-phase C, column (Brownlee Spheri-5 PTC, 5 jum) with detection at 257 nm. 

Heating block preset to 60°C 

Microfuge tubes (1.5 ml) 

Petri dish {glass} 

PVDF membrane (e.g., Immobilon-P, Millipore) or Sample preparation cartridge (ProSorb, 


Applied Biosystems) 
The ProSorb sample preparation cartridge is essentially a microfuge tube with its rounded end 
removed and a PVDF membrane affixed to the bottom of the tube, which is held against a filter by 
a plastic cylinder that serves as a coupling device. Solutions applied to the upper surface of the 
membrane are pulled through the membrane by the wicking action of the filter below. 

Reaction cartridge (Applied Biosystems, PN 603993) 
This cartridge should be of the type with a vertical slit intended for electroblotted samples. 


Reversed-phase C, column (2 x 220 mm) (PN 07110204, Brownlee Spheri-5 PTC, 5 pm) 


P Biological Sample 


Proteins separated on a polyacrylamide gel using either 1D- or 2D-PAGE, or proteins sup- 
plied in solution 


P Additional Reagents 


Step 1 of this protocol may require the reagents listed in Protocols 2 and 3 of this chapter. 
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METHOD 


Transfer of Proteins to PVDF Membrane 


1. Apply 50-100 pmoles of the protein sample to a PVDF membrane either by electroblot- 
ting or by adsorption directly from solution (using the ProSorb sample preparation car- 
tridge). 

For electroblotting protein to a PVDF membrane: 


a. Electrotransfer the gel-separated proteins to a PVDF membrane (see Protocol 2 and 
Mozdzanowski et al. 1992). 


b. Stain the PVDF membrane containing the proteins of interest with Coomassie 
Brilliant Blue R250 (see Protocol 3). Allow the membrane to dry in the air. 


EXPERIMENTAL TIP: Use a paper clip to hang the sample in a chemical fume hood, 


or dry it on the bench, but cover the membrane with filter paper to protect it from con- 
tamination. 


c. Excise the desired protein band(s) or spot(s). 


If not immediately analyzed, the bands/spots can be stored up to several months at 
-209C. 

d. If a combination of amino- and carboxy-terminal sequence analysis is required, pro- 
ceed to Step 2 to perform the amino-terminal sequencing on the PVDF-immobilized 
sample. Alternatively, if only carboxy-terminal sequencing is required, place the excised 
membrane-bound protein spots in a 1.5-ml microfuge tube, and proceed to Step 3. 

For adsorption of protein in solution to a PVDF membrane (ProSorb): 

a. Wet the PVDF membrane with 5 ul of methanol. 


b. Add 100 pl of 0.5% TFA to the sample reservoir. 
c. Add 5-800 pl of the protein sample solution to the sample reservoir. 


d. Wash the membrane with 100 pl of 0.196 TEA if the sample solution contains salt or 
other low-molecular-weight contaminants. Repeat the wash two more times each 
with 100 ul of 0.196 TFA. 


e. Remove the sample reservoir housing with the PVDF membrane from its holder and 
invert the reservoir onto a Petri dish. Make sure that the PVDF membrane with the 
immobilized sample faces up. 


m 


Cover the inverted membrane (in its reservoir) with a glass beaker and place the Petri 
dish on a heating block set at 60°C. Dry the PVDF membrane completely (~20 minutes). 


The intent is to keep contaminants from the air away from the PVDF membrane while 
it is drying. Do not allow the beaker to touch the membrane either. 


g. Carefully free the PVDF membrane from its housing. 


h. Ifa combination of amino- and carboxy-terminal sequence analysis is required, pro- 
ceed to Step 2 to perform the amino-terminal sequencing on the PY DE-immobilized 
sample. Alternatively, if only carboxy-terminal sequencing Is required, place the 
membrane-bound protein in a 1.5-ml microfuge tube, and praceed to Step 3. 
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Amino Acid Sequencing 


2. To sequentially perform amino- and carboxy-terminal sequencing: 

a. Perform amino-terminal sequence analysis for 5-25 cycles. 

b. Transfer the PVDF membrane to a 1.5-ml microfuge tube, and proceed to Step 3. 
3. To prepare the sample for carboxy-terminal sequencing: 


a. Add 5 ul of 5% DIEA in heptane to the PVDF membrane. Allow the heptane to evap- 
orate for ~1 minute at 60°C. 


b. Add 5 ul of 1% PIC in acetonitrile and incubate for 5 minutes at 60°C. 


c. Transfer the PVDF membrane with the PIC-derivatized sample to a reaction cartridge 
for a carboxy-terminal sequencing instrument. 


MODIFICATIONS TO THE CARBOXY-TERMINAL SEQUENCER PROGRAM 


1. Start with an ethyl acetate wash in the reaction cartridge. 


N 0-second. wash and 30-second argon dry 
"^. j60-second wash and 30-second argon dry 


2. Insert additional washing steps immediately before the C10/C12 combined 
* cleavage/cyclization step. Instead of a single ethyl acetate wash, include four washing 
steps with intervening drying steps. 
90-second wash and 30-second argon dry 
90-second wash and 30-second argon dry 
45-second wash and 30-second argon dry 
45-second wash and 30-second argon dry 


3. The chemistry of the degradation cycle follows the thiohydantoin method (Stark 1968). 
However, fellow the procedures listed below: 


e Activateshe carboxyl terminus only once with acetic anhydride/lutidine in acetoni- 
- _ trie. 

e Add bromomethylnaphthalene in each cycle to S-alkylate the thiohydantoin to make 

^^ it a better leaving group and allow a mild combination of peptide cleavage and acti- 
vation for the next cycle by using tetrabutylammonium thiocyanate in TFA (Boyd et 
al. 1992, 1995). 

e Use a reaction cartridge with a vertical slit intended for amino-terminal degradation 
of electroblotted samples (e.g., the reaction cartridge for the Procise HT, Applied 

Biosystems) to provide improved sensitivity in carboxy-terminal degradation. 

e Usea gradient with linear segments, typically: 14-2796 Solvent B (see Table 6.4) for 

~ 0-4 minutes, 27-44% Solvent B from 4-28 minutes, 44-47% Solvent B for 28-34 
minutes, and 47-9096 Solvent B for 34—35 minutes. Follow this with an isocratic 

- washiiig of the column at 90% Solvent B for 5 minutes. The optimal elution program 
tay vary slightly and is dependent on the particular reversed-phase column used. 


For additional changes to the manufacturer's recommended parameters (involving dilu- 
tions of solvents, choice of solvents used in the chromatography, and reaction tempera- 
tures), see Table 6.4. 
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MICROGRAM/PICOMOLE CONVERSION GUIDE 


For microsequencing purposes, proteins are measured in units of micrograms (jg) or in terms of picomoles 
(pmoles). Note that half as many micrograms of a 50,000-dalton protein are required than are required of a 
100,000-dalton protein. This is because a given number of micrograms of a 50,000-dalton protein contains 


twice as many picomoles (i.e., twice as many amino termini) as the same number of micrograms of a 100,000- 
dalton protein. 


General Formula 


The following general formula can be used for picomole/microgram conversions. 
1000/molecular weight of protein in kD = number of pmoles/pg 


Given the molecular weight of the protein, quick estimates can be made using this mnemonic device. 


100 kD: 00g = t nmole 10 pg = 100 pmoles 
5QkD: 50ug = i nmole 5pg = 100 pmoles 
25kD: 25yg = l'nmole 2.5 ug = 100 pmoles 
IQkD: 10pg = I nmole lpg = 100 pmoles 


From- Micrograms to Picomoles 
The number of picomoles in a particular sample is equal to the number of micrograms divided by the molec- 
ular weight of the protein times 10*. To convert micrograms of protein to picomoles: 
1. Calculate the number of micrograms per picomole of a particular protein by multiplying the protein's mol- 
 ecular weight'in daltons by 10°. 
2. Divide the measured or estimated number of micrograms by the number of micrograms per picomole. The 
following formula can be used: 
MI(W x 10°) =n 
- where M is the number of micrograms of sample available, W is the molecular weight of the protein, and n 
is the number of picomoles of sample available. For example, a 45,000-dalton protein weighs 0.045 ug per 
. picomole (45,000 x 10-9). Therefore, 2.5 ug is equal to 55.6 proles (2.5/0.045). Similarly, a band in a gel esti- 
mated to contain 4.0 jig of a 45,000-dalton protein provides ~90 pmoles for sequencing (4.0/0.045 = 88.9). 


3Prom Picomolesto Micrograms 
"The number of micrograms of protein needed for a sample consisting of a specified number of picomoles is 
equal to the number of picomoles multiplied by the molecular weight times 10-5. To convert picomoles of pro- 
tein to Micrograms: 

1. Multiply the molecular weight of the protein by 10°. 

2. Multiply the sum from Step 1 by the number of picomoles desired. 

in other words, to determine the amount of protein required to prepare a 50-pmole sample for sequencing, 
"multiply the number of micrograms per picomoles by 50. The following formula can be used: 

B(Wx 109) - M ofh | dni 

where M is the number of micrograms of sample required, W is the molecular weight of the protein, and n ls 
the number of picomoles desired. For example, a 60,000-dalton protein weighs 0.06 ug per picomole (60,000 
x 105). Therefore, the preparation of a 50-pmole sample for sequencing requires 3 pg (0.06 x 50). Similariy, 
-prepating a 50-pmole sample of a 30,000-dalton protein for sequencing requires 1.5 ug (0.03 x 50). 


(Reprinted, with permission, from the handbook, Preparing samples for protein sequencing: A newcomer! guide (GTbe Perkin-Elmer Corporation|.) 
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Peptide mapping (or protein fingerprinting) refers to the generation of peptides from a 
protein by enzymatic or chemical means, followed by separation and analysis of the frag- 
ments. This technique provides useful information about the primary structure of a pro- 
tein (analytical peptide mapping), as well as the conformation and the relationships 
between proteins (comparative peptide mapping). Analytical peptide mapping can be used 
for the following: 


* Identifying specific internal peptide sequences within a protein such as posttranslation- 
al modification sites, antigenic determinants, active sites, and ligand binding regions. 


* Obtaining sufficient internal sequence information to allow identification of a protein, 
which is especially important for amino-terminally blocked proteins that are refractory 
to the Edman degradation procedure. 


* Determining cysteine connectivities (disulfide bonds). 


Comparative peptide mapping is useful for comparing the sequences or conformations 
of related proteins. A major advantage of this peptide mapping approach is that it can yield 
much useful information without much investment of time and equipment. 


Nor ALL PROTEINS YIELD MEANINGFUL AMINO-TERMINAL sequence information using the 
Edman degradation procedure (outlined in Chapter 6). For instance, many polypeptide 
chains have blocked amino termini (due to, e.g., N-acetyl and N-formyl derivitization of a- 
amino groups or pyroglutamate formation of an amino-terminal glutamine residue under 
acidic conditions) and are refractory to the Edman degradation procedure. A free o&-amino 
group is a necessary prerequisite for the Edman degradation method. It has been estimated 
that greater than 75% of intracellular proteins isolated from eukaryotic cells have blocked 
amino termini, mainly as a result of posttranslational modification, especially acetylation 
(Brown and Roberts 1976; Brown 1979). Furthermore, for many high-molecular-mass pro- 
teins (>50 kD), amino-terminal sequence data may be uninterpretable due to a rapid build 
up of phenylthiohydantoin (PTH) amino acid background levels during the Edman degra- 
dation procedure. 

When working with subpicomole amounts of material (i.e., proteins in the low-micro- 
gram range), an amino-terminal sequence strategy based on Edman degradation of the intact 
material is particularly risky because it may provide an "all-or-none" opportunity to obtain 
useful sequence data. If a protein is amino-terminally *blocked;" and no sequence informa- 
tion is obtained, the sample is lost. (However, it is possible to glean some sequence informa- 
tion by removing the sample from the sequencer, treating it in situ with cyanogen bromide 
on the sample disk, and resequencing the cleaved material [Simpson and Nice 1984].) 
Therefore, for many reasons, the preferred strategy is to obtain internal sequence information 
on low-M, peptides following enzymatic or chemical fragmentation of the mature protein. 
This strategy offers the likelihood of obtaining multiple peptide sequences from various 
regions of the polypeptide chain. 

Although limited amino-terminal sequence information can be extremely useful for pro- 
tein identification purposes, extensive sequence coverage of a polypeptide chain is often 
required to 


+ confirm the integrity of a recombinant protein, 
* identify posttranslational modification sites (e.g., phosphorylation or glycosylation), 
+ identify artificial modifications (e.g., chemical cross-linking studies), 
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* map antibody recognition sites (epitopes), and 


* determine cysteine connectivities (disulfide bonds) within a polypeptide chain. 


Treatment with a specific protease such as trypsin can generate a set of peptides covering 
the entire polypeptide chain. Visualization of peptide mixtures separated by chromatography 
(typically, reversed-phase chromatography, see Protocol 15 and Chapter 5) or electrophore- 
sis (see Cleveland "limited digests" procedure in Protocols 16 and 17) is referred to as peptide 
mapping or “fingerprinting.” Although mapping techniques are commonly used for purify- 
ing peptides from a complex digest, they are also used for comparing one protein with anoth- 
er by the distinctive pattern that the peptides form on the map, hence the name “fingerprint- 


n 


ing. 


MACROSCALE AND MICROSCALE METHODS ARE USED TO GENERATE PEPTIDE MAPS 


The usual approach for generating a peptide map of a protein for the purpose of obtaining 
peptide sequence information is described below: 


¢ Reduction and alkylation of the purified protein. This can be performed either “in solu- 
tion" or in situ in a one- or two-dimensional acrylamide gel. 


Fragmentation of the reduced and alkylated protein by chemical (e.g., cyanogen bromide 
or limited acid hydrolvsis) or enzymatic (e.g., trypsin) means. 


Separation of the peptides generated. 


« Sequence analysis of the peptides using classical Edman degradation (see Chapter 6) or 
mass-spectrometry-based methods (see Chapter 8). 


The strategy for generating peptides for sequence analysis very much depends on the nature 
of the source material (see Figure 7.1). There are three key methods for purifying proteins: 


* Chromatographic procedures such as reversed-phase high-pressure liquid chromatography 
(RP-HPLC; see Chapter 5) and ion exchange chromatography, which yield proteins in solu- 
tion. 


* One- and two-dimensional acrylamide get electrophoresis, from which proteins can be iso- 
lated as bands or spots, respectively (see Chapter 2 or 4). 


* Gel electrophoresis in combination with electroblotting of proteins to an inert immobiliz- 
ing membrane such as polyvinylidene difluoride (PVDF; see Chapter 4). 


In this chapter, attention is given to both macroscale (ie., milligram amounts) and 
microscale (i.e. low-microgram amounts) peptide mapping, which require different strate- 
gies. For instance, any desalting, reduction, alkylation, and concentration steps carry the like- 
ly risk of sample loss due to adsorption onto surfaces, especially after a drying step, such as 
lyophilization. These losses are particularly significant when working with microgram 
amounts of protein (2 tig of a 20-kD protein is equivalent to 100 prnoles), presenting the 
investigator with technical challenges in both sample handling and manipulation. 

Although extremely high purity is essential for Edman sequencing, mass-spectrometry- 
based sequencing strategies are more forgiving. In fact, mass spectrometry provides an addi- 
tional separation step based on the mass:charge ratio of the proteins or peptides. Even so, 
when attempting microscale purification and sequencing, the investigator must plan ahead to 
ensure the compatibility of buffers and detergents with downstream reduction and alkyla- 
tion, fragmentation, and separation steps. Special consideration for sample preparation for 
microscale sequencing is given in this chapter. 
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FIGURE 7.1. Experimental approaches for obtaining sequence information irom yel-separated proteins. 


SOME CONSIDERATIONS WHEN HANDLING SAMPLES FOR MICROSEQUENCING 


Avoid drying samples prepared by RP-HPLC. If the volume must be reduced, concen- 
trate the sample by trace enrichment using a narrow-bore reversed-phase column (i.e., 
«2.1-mm internal volume) (for experimental procedures, see Protocol 15). Using this 
procedure, a large volume (several milliliters) can be concentrated to «50 pl with >90% 
yield. 


Minimize the number of handling steps. 


Store RP-HPLC samples at -20°C in tightly capped polypropylene tubes (e.g., 1.5-ml 
microfuge tubes) to avoid losses that occur by the sample sticking irreversibly to the sides 
of the tube upon evaporation of the organic phase. To avoid this potential problem, add 
a small amount of detergent to the sample (e.g., 0.0196 Tween-20). 


Proteolysis of RP-HPLC-purified samples can be conducted in the same collection tube 
by simply adjusting the detergent concentration to ~0.02% Tween-20, reducing the vol- 
ume to ~50 yl by centrifugal lyophilization, and then adding the appropriate digest 
buffer and protease (see Protocols 3 and 5). 
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COVALENT CROSS-LINKS WITHIN PROTEINS MUST BE BROKEN TO 
IMPROVE PEPTIDE MAPPING 


Once a protein has been purified to homogeneity using one of the approaches outlined in 
Figure 7.1, it must be readied for fragmentation (either chemical or enzymatic), the first stage 
of peptide mapping, and subsequent sequence analysis. However, one major problem fre- 
quently encountered during fragmentation of a protein sample is the obstruction of protease 
action by covalent inter- and intrapolypeptide cross-links (e.g., disulfide bonds). Far peptide 
mapping studies, such cross-links must be broken to render the polypeptide chain more uni- 
formly accessible to the protease and to increase the likelihood of obtaining cleavage of all 
susceptible peptide bonds. In addition, proteolytic cleavage fragments comprising two or 
more cross-linked peptides arc often difficult to interpret during peptide mapping and sub- 
sequent structural analysis. However, maintaining the integrity of cross-ünks 15 often useful 
in obtaining limited cleavage of a protein in its native state. Limited digests are a useful first 
step in strategies aimed at determining either domain structures of a protein (Polverino de 
Laureto et al. 1999) or conformational epitopes (Jemmerson and Paterson 1986). In this 
chapter, disulfide bonds are the only type of covalent polypeptide cross-links that are dis- 
cussed. 

Other types of less frequently occurring cross-links (those involved m maintaining the 
structural integrity of collagen, elastin, and other fibrous proteins, e.g, desmosine and 
isodesmosine) (‘Traub and Piez 1971} are not described here because there are no generally 
applicable methods for their selective cleavage. 


Performic Acid Is Used to Cleave Disulfide Bonds and Modify 
Sulfhydryl Groups 


Procedures for the cleavage of the covalent disulfide linkages of cvstine include oxidation. 
reduction, and other nucleophilic displacement reactions (for a review, see Liu 1977; Fontana 
and Gross 1987). Oxidative cleavage of disulfide bonds with performic acid was a frequently 
used procedure in the early days of protein sequence analysis (Sanger 1949). This method 
(see Protocol 1) has many attractions, especially for amino acid analysis. Performic acid oxi- 
dation is a simple single-step chemical modification procedure that quantitatively converts 
cysteine to cysteic acid and methionine to methionine sulfone in yields of 90-95%, Amino 
acid analysis of performic-acid-oxidized proteins is often used to determine the total half- 
cystine (as cysteic acid) and methionine content of a protein. 

One of the major attractions of performic acid cleavage of disulfide bonds is the forma- 
tion of cysteic acid, a strongly acidic sulfonate moiety with a highly polar character that 
increases the solubility of the polypeptide chain. A similar functional group (S-sulfo) can be 
introduced onto the B-carbon of cysteine without the oxidation of tryptophan indole and 
methionine thiocther side chains, which accompanies performic acid oxidation. This is 
accomplished by sulfitolysis (i.e., treating disulfides with sulfite [SO  ] at alkaline pH) where- 
by S-sulfonated derivatives and thiols are formed 


RSSR + RSO7 €— RS + RSSO; 


One half of the original disulfide is converted to S-sulfocysteine and the other half is con- 
verted to the cysteine mercaptide. With a large excess of sulfite, the reaction continues until 
disulfides are completely converted into S-sulfonate. Although sulfitolysis has been used 
infrequently as a mild procedure for the cleavage of disulfide linkages in proteins, the S-sul- 
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fonate derivatives are stable at neutral and mildly acidic conditions and are water-soluble. 
Because the S-sulfo group is readily removed by reduction with an excess of reducing agent, 
this group has been used for the reversible protection of the thiol function of cysteine (Liu 
1977). Treatment of protein hydrolysates with dithiothreitol (DTT) followed by an excess of 
sodium tetrathionate (Na,5,O,) converts cystine and cysteine to S-sulfocysteine, a derivative 
that is suitable for quantitative analysis (Inglis and Liu 1970; Simpson et al. 1976). 

The disadvantages of performic acid oxidation are that it can lead to the modification of 
amino acids besides cystine and methionine (e.g., the destruction of the indole side chain of 
tryptophan) and cleavage of acid-labile peptide bonds (e.g., Asp-Pro bonds). Furthermore, 
PTH-cysteic acid derivatives are difficult to identify during sequencing by the Edman degra- 
dation method. Carboxymethylation (see Protocols 2 and 3) is not ideal for quantitative 
amino acid analysis involving cystine or cysteine residues because it is difficult to assess the 
completeness of the reaction. Pyridylethylation is the preferred method of cystine/cysteine 
modification both for sequence analysis and for those amino acid analyzers that use 
phenylisothiocyanate as an amino-terminal derivatizing agent (see Protocol 12). 


Performic acid oxidation cannot be applied prior to cyanogen bromide cleavage of the 
polypeptide chain (see Protocol 6) because methionine residues are converted to their sul- 
fone derivatives, which, unlike the methionine thioether, is not alkylated by cyanogen bro- 
mide, 


Many Reagents Are Available for Reducing and Alkylating Proteins 


By far the most selective and useful method for splitting disulfide bonds in proteins is reduc- 
tive cleavage (Cecil 1963; Liu 1977). Reduction is customarily performed in the absence of 
oxygen, at a slightly alkaline pH (~pH 8) and under denaturing conditions where the protein 
is unfolded, for example, in the presence of 8 M urea or 6 M guanidine hydrochloride, The 
classical approach for reducing a protein involves treating the protein with a high concentra- 
tion of a low-molecular-weight thiol reagent. It is clear from the reaction represented below 
that a large molar excess (typically, 100-1000-fold) of thiol reagent must be used to drive the 
reaction to completion. 


+ 2RSH =e + RSSR 


$—-s SH SH 


Thiol reagents used for this purpose include cysteine, glutathione, 2-mercaptoethyl- 
amine, thioglycollic acid, 2-mercaptoethanol, dithiothreitol (and its isomer dithioerythritol), 
tributylphosphine, and tris(2-carboxyethyl)phosphine (see Figure 7.2) (Liu 1977). 

Although 2-mercaptoethanol was the traditional reducing agent for protein structural 
analysis, DTT, also referred to as Cleland's reagent after its discoverer (Cleland 1964), is now 
the reagent of choice (Konigsberg 1972). DTT (and its isomer, dithioerythritol) is a superior 
alternative to the mercaptans, requiring much lower concentrations (20-fold molar excess) 
for maintaining monothiols completely in the reduced state and of reducing disulfides quan- 
titatively (Cleland 1964). DTT favors the reaction to go to the right because its oxidized prod- 
uct forms a thermodynamically stable six-membered ring (containing a disulfide bridge) that 
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FIGURE 7.2. Reducing agents commonly used in protein chemistry. (Portion reprinted, with permission, 
from Kellner et al. 1994.) 


is energetically favored over the mixed disulfide (Figure 7.2). Due to its low redox potential 
(—0.33 V at pH 7), DTT has gained wide acceptance because it can fully reduce cystine at pH 
8 in a few minutes. Additionally, it is a highly water-soluble solid that is resistant to air oxi- 
dation, and it has a relatively low level of unpleasant odor compared to 2- mercaptoethanol. 

Trialkylphosphines such as tributylphosphine have been used extensively for the reduc- 
tive cleavage of protein disulfides, especially wool cystine (see Maclaren and Sweetman 1966; 
Sweetman and Maclaren 1966; for review, see Rüegg and Rudinger 1977). Trialkylphosphines 
reduce disulfides rapidly and stoichiometrically in aqueous solution (see Figure 7.2); they are 
highly specific for disulfides and are essentially unreactive toward other amino acid func- 
tional groups found in proteins (Rüegg and Rudinger 1977). Tributylphosphine is a power- 
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ful reducing agent, and it can readily reduce very stable disulfides such as oxidized DTT. 
Tributylphosphine has been used to reduce proteins prior to two-dimensional gel elec- 
trophoresis (Herbert et al. 1998; Herbert 1999), but it has net gained universal acceptance by 
protein chemists because of its toxicity, unpleasant odor, and poor solubility in water. The 
newly introduced sulhydryl reducing agent tris(2-carboxyethyl) phosphine (| TCEP) (Burns et 
al. 1991}, which is now commercially available, is a more attractive alternative because it is 
water soluble and does not have an unpleasant odor, and it is a more effective reducing agent 
than DTT or 2-mercaptoethanol (Getz et al. 1999). For a comparison of reducing agents 
available for protein chemistry, see Table 7.1. 

Procedures designed for the complete reduction of a protein. usually require strong 
denaturants (sec below) and elevated temperatures tega 45-509C) (Crestfield et al. 1963; 
Hirs 1967) (see Protocols 2 and 3). Even if a protein does not contain disulfide linkages, a 
reduction step should be performed, because any free thiols in the protein may have been oxi- 
dized during the purification procedure and thus would not be available for alkylation. By 
these means, S-sulfocysteine (and methionine sulfone) is regenerated to form cysteine and 
methionine. The extent of reduction of disulfide bonds in a protein may be estimated with 
Ellman's reagent, 5.5 -dithiobis( 2-nitrobenzoic acid) (DTNB) (Habeeb 1972; Riddles et al. 
1983) (see Protocol 4). 

Once the reducing reagent is removed, protein thiol groups readily reform disulfide 
bonds due to air oxidation. This can be prevented by modification of the thiol groups to form 
stable derivatives (e.g., by alkylation). Several sulfhydryl alkylating groups may be used tor 


TABLE 7.1. Reagents tor the reductive cleavage uf disulfide bonds in proteins 


Reagent Comments References 


2-Mercaptocthanol Weak reducing agent; original reagent of Liu (1977) 
choice for reducing disultides; large molar 
excess required; unpleasant odor (must be 
used in a chemical fume hood), water 
soluble, inexpensive. 


Dithiothreitol (OTT) Stronger reducing agent than 2 mercapto Cleland (1964); 
ethanol; water soluble, httle odor; alse known Konigsberg (1972) 
as Cleland's reagent after its discoverer. 

Tribulyiphosphine CUBP)' Powerful reducing agent (even reduces Raegg arid Rudinger 
oxidized LIT}; reduces disulfides (977) 


stoichiometrically (1 mole of TBP reduces 
L mole of disulfide); rapid, low water 
solubility, unpleasant odor, toxic, works 
at low pH: does not react with some 
alkylating reagents. 
Tris(2-carboxyethel)phosphine Powerful reducing agent (even reduces Burns et al. (1991) 
(TCEP ^^ oxidized DTT), reduces disulfides rapidly; 
highly water soluble, odor-free, low-toxicity: 
works at low pH; does not react with some 
alkylating reagents. 


“The extent of reduction of disulfide bonds m a protein may be estimated with Elmars reagent, 2,5 -dithobis(2-nitreben 
vole acid YIN B) (Haberb 1972, Riddles et al. 1983; see Protocol 4) 

“TRP aad TURP are compatible with many alkylating agents teg. 4 vinylpynidine, but not rodotcetic acid or it» amide. 
Therefore. TBP (or TCEP} and 4 vinvlpyridine can be added together. When wing mdoazetic acid or iodoacetamide in con- 
junction with [CEP or triburylphosphine, it 1s advisable to carry out reduction end alkylation in two steps t Ruegg end Rudinger 
1977]. . 

*For a comparison of the use of TCEP and DIT in protein chemistry. see Get? et al :199891 
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this purpose following disulfide reduction. For a list of the most commonly used alkylating 
reagents, see Table 7.2. The reduced protein is usually not isolated, and alkylation is general- 
ly accomplished in the same reaction mixture containing reduced protein, denaturant, and 
excess reducing agent. The two most frequently used alkylating agents are 4-vinylpyridine 
(Raftery and Cole 1966), which yields S-pyridylethyl cysteine, and iodoacetic acid (or iodoac- 
etamide), yielding 5-carboxymethyl cysteine (Crestfield et al. 1963). The frequently observed 
modification of protein thiols by acrylamide monomers during gel electrophoresis has also 
been adapted for use in S-alklyation (Brune 1992). 

To cleave disulfide bonds efficiently in a protein, it is often necessary to use a denaturing 
agent that renders the protein in a conformational state that exposes these linkages suffi- 
ciently to the reducing agent. Commonly used denaturing ageuts utilized during the reduc- 
tion/alkylation step include urea and guanidine- HCl at concentrations in the range of 6-8 M, 
as well as detergents. Care should be exercised in selecting the appropriate grade of denatu- 
rant, especially when working with microscale amounts of sample. For example, the grade of 


TABLE 7.2. Reagents for the alkylation of cysteine residues in proteins 


Reagent Comment References 
lodoacetic acid Classical alkylating reagent for cysteines, Crestfield et al. (19635; 
Modified cysteines (S-carhoxymethylcysteine) are Hirs (1967) 


negatively charged. 

S-carboxymethyleysteine may be quantitated in the 
intact protein by amino acid analysis; stable in acid. 
PTH derivative is subject to B-elimination during 
sequencing; can react with other amino acids (mainly 
methionine, histidine, and lysine). 


Iodoacetamide Classical alkylating reagent for cysteines, 
S-carhoximadomethylcysteine has a neutral charge; 
easily quantitated in the intact protein by amino acid 
analysis; stable in acid. 

PTH derivative is subject to B-elimination during 
sequencing; can react with other amino acids 
(mainly methionine, histidine, and lysine). 


4-Vinylpyridine S-(2-pyridylethyl) cysteine is stable in acid. Raftery and Cole (1966); 
Well resolved in amino acid analysis; amenable to Cavins and Friedman 
spectral quantitation in the intact protein (1970) 


(absorbance at 254 nm) (Renlund et al. 1990). 

PTH derivative i5 stable during sequencing and 
readily detected during mass spectrometry by 

the release of the labile 106-daiton 5-pyridylethyl 
moiety (Moritz et al. 1996). 

S-pyridylethyl modification of cysteine introduces a 
polar nature to the side chain that increases the 
solubility of the protein. 

S-pyridylethylation must be followed immediately by 
separation of the reaction mixture because prolonged 
umubation may cause side reactions with histidine, 
tryptophan, and methionine. 


Acrylamide PTH derivative is stable during Edman sequencing. Brune (1992) 
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urea is important because poor-quality material (especially aged urea) is usually contami- 
nated by cyanate ions (Stark et al. 1960; Cole 1961). Ammonium cyanate formation in fresh- 
ly prepared urea is an inherent property of the reagent that occurs upon long-term storage. 
Cyanate present in 8 M urea is sufficient to react with amino groups of proteins, rendering 
them refractory to the Edman chemistry (Stark et al. 1960} and altering the electrophoretic 
behavicr of proteins in gels containing urea {Cole and Mecham 1966). This phenomenon is 
particularly troublesome for two-dimensional gel analysis in which proteins are separated in 
the first dimension on the basis of their charge (see Chapter 4). Cyanate-free urea, which is 
commercially available, can also be produced from aged material by recrystallization from 
hexane or by simple treatment of aqueous urea solutions with a mixed bed resin (see 


Appendix 2). 


For chromatographic purposes, it is important to use ultrapure guanidine-HCl free of 
metal ions and materials that absorb in the 225-300-nm range. Both high-grade urea (¢.g., 
Sequanal grade from Pierce) and guanidine-HCl suitable for microscale protein manipula- 
tion and sequencing are commercially available either as the free salt or, in the case of 
guanidine-HCl, as an aqueous 8.0 m solution that can be diluted with an appropriate buffer 
to the desired molarity. 


Procedures for the reduction and alkylation of proteins at large-scale levels (21 mg) and 
microscale levels (1-20 pl) are given in Protocols 2 and 3, respectively. 


THE POLYPEPTIDE CHAIN MUST BE FRAGMENTED BY ENZYMATIC OR 
CHEMICAL MEANS 


A fundamental step in peptide mapping is the fragmentation of the polypeptide chain 10 
yield law-molecular-weight peptides that are conducive to sequence analysis by Edman 
chemistry and mass spectrometric methods. Either enzymatic or chemical means can accom- 
plish this. Different proteases of high and reproducible specificity, like trypsin, are common- 
ly used to fragment proteins. In general, proteases like trypsin can be used to produce small 
peptides, which are an advantage for proteomic analysis methods such as mass spectrometry 
(see Chapter 8). Chemical cleavage is usually a compromise between efficient cleavage and 
specificity. The most common chemical cleavage method employs cyanogen bromide, which 
is specific for Met-X bonds. Due to the low abundance of Met residues in a protein, cyanogen 
bromide cleavage usually yields large peptide fragments. Although large peptides are pre- 
ferred for Edman degradation, such peptides are often hydrophobic and difficult to purify. 


Fragmenting Proteins by Enzymatic Means 


Ideally, for peptide mapping studies, proteolytic cleavage should be selective as well as spe- 
cific. The selectivity of a protease reflects the number of sites in the substrate at which pep- 
tide bond hydrolysis may occur. The complexity of the peptide map can thus be anticipated 
from the selectivity of the protease and the amino acid composition of the protein. Proteases 
can be selected to cleave at either basic, acidic, or hydrophobic side chains. Commonly used 
proteases that are commercially available are described in Tables 7,3 and 7.4. The nomencla- 
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ture used for protease selectivity was introduced in 1967 by Schechter and Berger and is now 
widely used in the literature (Schechter and Berger 1967) (see Figure 7.3). For an estimate of 
the number and length of peptide fragments for a hypothetical 300-residue protein, see Table 
7.5, 

The specificity of a protease reflects the efficiency of cleavage at susceptible sites (see 
Tables 7.3 and 7.4). Highly specific proteases effect complete scission of certain peptide bonds 
in the polypeptide substrate without partially hydrolyzing any other bonds. Thus, specific 


TABLE 7.3. Selected chemical and enzymic cleavage reagents for protein chains 


Reagent Site Comments 


Enzymatic cleavage 


Trypsin Arg, Lys Specific, often used first; bonds to proline are not cleaved; amino- 
ethylcysteine provides additional site. 

Endoproteinase Lys-C Lys Very high specific activity; very effective in gel digests. 

Clostripain Arg Lys bonds remain intact; yields may be lower than with trypsin; 
cleaves Arg Pro. 

A. mellea protein Lys Splits on amino side; sometimes after Arg. 

5. aureus protein Glu Sometimes cleaves after Asp; also cleaves cysteic acid. 

Endoproteinase Asp-N Asp Cleaves amino-terminal to Asp; also cleaves cysteic acid. 

Asparaginylendopeptidase — Asn-X pH 5.5-6.5; stable to denaturants. 

Chymotrypsin Trp, Tyr, Phe, Leu, Met Largely specific for hydrophobic residues; also useful for limited 
digestion applications. 

Thermolysin hydrophobic Splits on amino side; may be used at 60°C. 

Pepsin Phe-Phe Relatively nonspecific; prefers aromatics or Leu; low pH optimum; no 
disulfide interchange occurs. 

Subtilisin, Elastase various Nonspecific; for providing small peptides. 

Thrombin, Bromelain Arg-Gly, Gly-Val Limited proteolytic activity; give large fragments. 


Chemical-Enzymic 


Succinylation-trypsin Arg Lysine blocked; changes its polarity; citraconylation provides 
reversible modification. 


Chemical cleavage 

Cyanogen bromide Met Converted to homoserine lactone; yields >70%, Ser, Thr bonds 
difficult; Trp-X can cleave. 

Dilute acid (pH 2} Asp Use formic acid for insoluble proteins and dilute; some amide 
cleavage; yields >60%. 

Formic acid (8096) Asp-Pro Yields of 40% with high specificity. 

HC] (6 m) multiple For random generation of small peptides. 

Hydroxylamine Asn-Gly Also breaks imide link to Gly; yields ~50%. 

DMSO/HCIHBr Trp Several alternative reagents available; oxidation and modification of 
amino acids likely. 

NTCR Cys Splits on the amino side; Cys amino blocked. 


Cleavage occurs at the carboxyl end of the amino acid unless indicated otherwise. 
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TABLE 7.5. Calculated number and length of peptide fragments for a hypothetical 300- residue protein 


Amino acid Specific cleavage No. fragments Average length 
Phe, Tyr, Trp, Leu chymotrypsin 54 6 
Lys, Arg trypsin 35 u 
Glu Glu C 20 [^ 
Lys Lys-C 19 lt 
Arg Arg-tC. 17 |& 
Asp Asp-N 7 1é 
Met cyanogen bromide (CNBr) 8 38 
Trp BNPS-skatole 5 e 


The hypothetical 300-residue protein was calculated from the NRRI-PIR protein database bv the total number of each amine 
acid versus the number of database entries. (Reprinted, with permission, from Kellner 1994.) 


cleavage results in each region of the polypeptide chain being represented by a single peptide 
with no overlapping peptides (i.e., two or more peptides representing the same region of the 
amino acid sequence) or redundant peptides. 

Proteolytic cleavage is facilitated by first denaturing the polypeptide chain of interest 
and, if a protein contains disulfide bonds, reducing the -S—S— bridges and alkylating the cys- 
teines (Crestfield et al. 1963). Susceptible protease cleavage sites are not equally accessible in 
native proteins, and denaturation is required to disrupt the secondary and tertiary structures 
of the protein so that its conformation assumes that of a random coil. However, peptide map- 
ping strategies that involve proteolytic cleavage of a protein under nonreducing conditions 
are extremely useful for determining the cysteine connectivities (disulfide bonds} in a pro- 
tein. (Knowledge of disulfide linkages is a useful prerequisite for determining the three- 
dimensiona! structure of proteins [see below, Peptide Mapping Can Be Used to Determine 
Disulfide Linkages in Proteins; also Simpson et al. 1988; Cole et al. 1999; Moritz et al. 2061].} 


Enzyme 


Substrate -NH-CH-CO-NH-CH-CO-NH-CH-CO-NH-CH-CO-NH-CH-CO-NH -CH-CO- 


t 


Scissile bond 


FIGURE 7.3. The Schechter and Berger nomenclature for the description of protease subsites. In this sys- 
tem, it is considered that the amino acid residues of the polypeptide substrate bind in enzyme subsites of 
the active site. By convention, these subsites on the protease are called S (for subsites) and the substrate 
amino acid residues are called P (for peptide). The amino acid residues of the amino-terminal side of the 
scissile bond are numbered P3, P2, and P1, and those residues of the carboxy-terminal side are numbered 
Pl’, P2’, P3... . Pl or Pl’ are those residues located adjacent to the scissile bond. The substrate residues 
around the cleavage site can then be numbered up to P8. The subsites on the protease that complement the 
substrate-binding residues are numbered 53, $2, $1, $1’, $2’, 53^... 
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TABLE 7.6. Optimized proteolysis conditions in the presence of chaotropes, detergents, and organic sol- 


vents 
Enzyme 
Lys-C 
(Achromobacter 
Conditions Trypsin Chymotrypsin protease) Glu-C(V8)  Subtilisin 
Native 
buffer 0.1 M AB? 0.1 MAB 0.1M AB 0.05 M AB 0.1 M AB 
tme 2 hours 2 hours 2 hours 5 hours 1 hour 
Urea 4M 2M 8M 2M(R)* 8M (R) 
buffer +5 mM Ca** 0.2 M TC Ü.L M TC 
time 15 hours 5 hours 5 hours 5 hours | hour 
Guanidine-HC] (1-2 M) IM 2M i Įm CR) 2M(R) 
buffer 0.2 TC 0.2 M AB 0.2 M TC 
time 24 hours 24 hours 5 hours 18 hours 2 hours 
SDS <0.1% «0.196 (R) 196 0.196 (R) 196 
CHAPS 296 1% 2% 2% (R) 2% (R) 
Acetonitrile 40% 30% 40% 20% (R} 40% 


Adapted from Riviere et a]. (1591). d 

Buffers and incubation times are the same as those under native conditions, except where indicated. The highest cancentra- 
tions of excipients permitting adequate proteolysis are listed; concentrations higher than 2% CHAPS and 40% acetonitrile are 
not listed. 

*AB = Ammonium bicarbonate; TC = Tris-Cl. 

*R = Restricted digest. 

‘For a comprehensive study on the effects of organic cosolvents, see Welinder (1988). 


Optimized proteolysis conditions for a number of commonly used proteases, including 
their activity in the presence of widely used denaturants, are given in Table 7.6. Additional 
details about proteolytic enzymes are contained in the information panel on NOMENCLATURE 
ON PROTEASES, PROTEINASES, AND PEPTIDASES at the end of this chapter. 


UseFuL Wes SITES FOR OBTAINING INFORMATION ABOUT PROTEASES 
Enzyme: http://www.expasy.ch/enzyme/ or http://www.chem.qmw.ac.uk/iubmb/enzyme/ 


Enzyme nomenclature. Primarily based on the recommendations of the nomenclature 
committee of IUBMB, and it describes each type of characterized enzyme for which an 
EC (Enzyme Commission) number has been provided. 


Prolysis: http://delphi.phys.univ-tours.fr/Prolysis/ 
: Protease/peptidase nomenclature, classification, and chemistry (T. Moreau). 
Prowl: http://prowl.rockefeller.edu/recipes/contents.htm 
Protocols for using proteases for mass spectrometric analysis (R.C. Beavis). 
MEROPS: http://www.merops.ac.uk 


Catalog and structure-based classifications of peptidases (i.e., all proteolytic enzymes). 


For general reviews of proteolytic enzymes and their uses, see Lorand (1981) and Bond and 
Beynon (1989). 
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How to choose a proteolytic enzyme 


One of the most commoniv used and best-characterized proteinases in protein structural 
analysis is trypsin (EC 3.4.21.4); the use of this enzyme is described in Protocol 5. Other use- 
ful protcinases for fragmenting proteins are listed in Tables 7.3 and 7.4, and optimized con- 
ditions for their use are given in Table 7.6. For very basic proteins that are rich in lysine 
and/or arginine residues, trypsin can generate a large number of peptides of small average 
size thal may not be amenable to sequence analysis using the Edman procedure (or mass 
spectrometry sequencing procedures). The action of trypsin can be modified to restrict cleav- 
age to arginine residues alone by selectively modifying all lysine residues in the protein sub- 
strate. Two common methods for modifying lysyl side chams—succinylation and citracony- 
lation (this modification has the added advantage in that it is fully reversible under mildly 
acidic conditions}—are described in the additional protocols in Protocol 5. The need for lysyl 
side-chain modification strategies has been ameliorated in recent times duc to the advent of 
commercially available proteinases that selectively cleave only lysyl bonds (Lys-C, EC 
3.4.21.50; also called lysyl endopeptidase) or argininyl bonds (Arg-C from mouse submaxil- 
lary gland, EC 3.4.22.8, and Clostripain, EC 3.4.22,8). Note that Clostripain, unlike trypsin, 
cleaves Arg-Pro bonds. For properties and optimal cleavage conditions, see Table 7.4. 

A large variety of proteinases can be used in the same manner as trypsin; attention must 
be paid to the buffer (optimal pH) conditions (see Tables 7.3, 7.4, and 7.6). Described below 
are other commonly used proteinases. 


e Glu-C (EC 3.4.21.9) tor “V8” protease) isolated from Staphylococcus aureus strain V8 is a 
very useful proteinase because, like trypsin (and Lys-C and Arg-C), it shows good speci- 
ficity (Houmard and Drapeau 1972; Drapeau 1977}. It cleaves on the carboxy-terminal side 
of glutamyl residues (using 50 mM ammonium bicarbonate, pH 7.8, or 50 mM ammonium 
acetate, pH 4.0); however, this proteinase will cleave Asp-P' bonds (albeit at low frequen- 
cy) as well as Glu-P' bonds in phosphate buffer (50 mM) at pH 7.8. Glu-C will not cleave 
Glu-Pro bonds, and potentially labile bonds in a multiple sequence (e.g., Glu-Glu-X) may 
not each be cleaved in a good yield. Endoproteinase Glu-C is active in 0.1-0.2% SDS, 2 M 
urea, | M guanidine-lICl, and 10% acetonitrile. Its amino acid sequence is known (Drapeau 
1978; SwissProt accession number P04188). 


Asp-N endoproteinase (EC 3.4.24.33) from a Pseudomonas fragi mutant is a metallopro- 
teinase with excellent specificity. In phosphate, acetate, or Tris buffers at pH 6.0-8.5, it 
cleaves peptide bonds amino-terminally at aspartic acid or cysteic acid (Drapeau 1980). 
Asp-N will cleave P,-Glu bonds (at very low frequency) as well as P,-Asp- bonds if high 
enzyme:substrate ratios and long incubation times are employed. Additional cleavage at 
glutamyl residues can be prevented by reducing the enzyme concentration (enzyme:sub- 
strate ratio of 1:1000) at an incubation time of 2-6 hours. Endoproteinase Asp-N is active 
in 0.1% (w/v) SDS, 1 M urea, 1 M guanidine-HCl, and 10% acetonitrile. Partial amino acid 
sequences of Asp-N have been described previously (Hagmann et al. 1995; Hagmann 
1998). In addition to proteinases with high selectivity, it is worth noting two enzymes of 
broad specificity, namely, pepsin and thermolysin. 


Pepsin (EC 3.4.23.1) works at very low pH (pH 2-3), which makes it suitable for disulfide 
bond determination studies because, at low pH, disulfide bonds rearrange less frequently 
than under alkaline conditions. Moreover, low-pH conditions may favor the solubility of 
some proteins. 
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* Thermolysin (EC 3.4.24,4) is extremely useful because it is active at high temperatures (up 
to 809C) and in 8 M urea. Thus, thermolysin is active under conditions highly favorable to 
protein unfolding. Thermolysin has broad specificity for the amino-terminal side of 
hydrophobic residues, particularly, leucine, isoleucine, phenylalanine, and valine in 
descending order of preference (Heinrickson 1977). Thermolysin has been described by 
many reports in the literature to be successful in fragmenting proteins that were refractorv 
to commonly used endoproteinases (see, e.g., Du et al. 1996). 


Other broad-specificity endoproteinases include papain (useful for the preparation of 
Ffab), fragments), proteinase K, and subtilisin (for properties, see Tables 7.3 and 7.4). t'or 
example, papain activated prior to use with cysteine, at pH 5.5 (0.1 M acetate, 3 mat EDTA), 
is added to immunoglobulin G (IgG) (concentration of ~10 mg/ml) at 37°C at time zero, and 
again later (after 9 hours) at an enzyme:substrate ratio of 1:20 (w/w) (Parham et al. 1982). 


In-gel proteolytic digestion 


Most "sequencing-grade" endoproteinases used in conventional solution digests (c.g. 
trypsin, Staphylococcus protease V8, endoproteinase Lys-C, and endoproteinase Asp-N) give 
comparable peptide maps for in-gel digests. Of these, endoproteinase Lys-C { Achromobacter 
protease 1) gives considerably higher in-gel cleavage efficiencies. Lys-C also retains full activ- 
ity in the presence of 0.1% SDS, whereas other proteases are inhibited to varving extents by 
this detergent, trypsin being the most sensitive. A serious disadvantage of Lys-C is that sev- 
eral of the generated peptides will contain internal arginine residues that complicate the 
interpretation of mass spectrometric collision-induced dissociation fragmentation patterns 
(see Chapter 8). This, in turn, limits the usefulness of some mass-spectrometric-based pep- 
tide identification procedures. For these reasons, trypsin is recommended for methods rely- 
ing on mass-spectrometric-based identification procedures. Prior to in-gel digestion, it is 
customary to first visualize the protein and, if necessary, concentrate it to optimize yields of 
peptide fragments. 


Visualization of Gel-separated Proteins 


The most widely used method for the detection of proteins in primary gels is the conven- 
tional Coomassie Blue staining procedure (Wilson 1983), which can detect proteins in the 
low-microgram range (~0.5—-1.0 ig) (see Chapters 2 and 4). Care must be exercised with this 
staining method (sec Protocol 9), because overstaining can interfere with subsequent in-gel 
proteolysis and peptide mapping steps. ‘To minimize these potential problems, the gel should 
be stained for only 20 minutes; the gel background should be destained overnight to visual- 
ize the proteins and remove excess Coomassie Blue. After the protein spot has been excised, 
additional extended washing steps must be performed with deionized H.O to remove 
Coomassie Blue associated with the protein and excess SDS. 

Another category of protein visualization methods gaining wide acceptance is negative 
staining, which relies on selective precipitation of metal ions in gels, leaving protein bands 
unstained and readily amenable for structural analysis, Common negative staining methods 
include potassium acetate/chloride (Nelles and Bamburg 1976; Bergmann and Jornvall 
1987), sodium acetate ( Higgins and Darmus 1979), copper chloride (Lee et al. 19877, aurodye 
(Casero et al. 1985), zinc chloride (Lonnie and Weaver 1990), and one of the most widely 
used methods, zinc-imadazole (Ternandez-Patron et al. 1995a,b) (see Protocol 10), which is 
approximately ten times more sensitive than Coomassie Blue. Like Coomassie Blue, this neg- 
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ative staining procedure does not result in detectable protein modification. A sensitive (~0.5 
ng) procedure for the negative staining of proteins, based on the precipitation of methyl- 
trichloroacetate in gels, has been reported by Candiano et al. (1996). in this procedure, after 
separation, gels are incubated with 8% methyltrichloroacetate ester in 38% isopropanol and 
then washed in H,O to produce a negative image of colorless proteins against an opaque 
background (Candiano et al. 1996). Because this process is reversible, gels can be restained 
(e.g., silver staining, see Protocol 11) after rapid visualization. 

Silver staining is the most widely used high-sensitivity staining method for the detection 
of proteins in polyacrylamide gels (Switzer et al. 1979), with a sensitivity reported to be ~100 
times better than Coomassie Blue (for detailed reviews, see Rabilloud 1990; Rabilloud et al. 
1994). Silver staining has not been used for protein microanalysis until recently because the 
detection limits (~1-10 ng) were much too low for conventional sequence analysis bv auto- 
mated Edman degradation. However, with the improved sensitivity of protein structural 
analysis by mass spectrometric methods (low femtomole levels) {McCormack et al. 1997), sil- 
ver staining is gaming broader acceptance (Muzio et al, 1996; Shevchenko et al. 1996). 


The silvér staining procedure designed for visualizing proteins for mass spectrometric 
is purposes is significantly less sensitive than the classical staining method, due to the 

eliriüation of the glutaraldehyde fixing step (see Rabilloud 1990, 1999, 2000, 2001; 
,,Rabillotid et al;-4994; Shevchenke et al. 1996). 


When using any silver staining protocol for primary structural analysis, a critical para- 
meter is chemical modification of the protein. Such rnodifications are likely to occur during 
the presensitizing step, which involves pretreatment of the gel with sensitizers (e.g., sulfosal- 
icylic acid, sodium thiosulfate, DTT, glutaraldehyde, and chelators) (for a review on various 
silver staining protocols, see Rabilloud 1990; Rabilloud et al. 1994) to improve the contrast 
(and hence sensitivity) between the stained protein and the gel background. The "acidic" sil- 
ver staining method (Rabilloud 1990, 2000), adapted for primary structural analysis by omis- 
sion of the fixation/seusitization treatment with glutaraldehyde, is described in Protocol tl 
(for additional silver staining methods compatible with mass spectrometry, see Chapter 2 
Protocol 7 and Chapter 4, Protocol 12). This silver staining method has been used recently 
for a number of biological applications (Muzio et al. 1996; McCormack et al. 1997) and 
results in avoidance of protein modification. 

Empirical observations from a number of proteomics laboratories indicate that precon- 
centration of stained excised gel spots (or bands) prior to electroblotting onto PVDF is nec- 
essary to achieve high initial sequencing yields. Similarly, preconcentration prior to in-gel 
proteolytic (or chemical) fragmentation is necessary to obtain high peptide vields. For a dis- 
cussion of methods suitable for concentrating gel-resolved proteins, see Chapter 6, and for a 
description of a method, sec Protocol 4 in Chapter 6. 


. Internal Sequencing of Gel-separated Proteins 


Several methods have been reported for obtaining peptides (rom proteins separated bv poly- 
acrylamide gel electrophoresis. Earlier approaches involved the electrophoretic or passive 
extraction of proteins from the gel, followed by proteolytic digestion in solution (Hunkapiller 
et al. 1983; Simpson et al. 1987; Ward et al. 1990), These approaches were often technically 
difficult due to the problem of removing detergents that interfere with both proteolytic diges- 
tion and chromatographic fractionation of peptide mixtures (Simpson et al. 1987), and they 
have now been superseded by two general techniques: 
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* [n-gel digestion and extraction of the resultant peptides (Eckerskorn and Lottspeich 1989; 
Kawasaki et al. 1990; Ward et al. 1990; Rosenfeld et al. 1992: Hellman et al. 1995; Moritz et 
al. 1995). 


* In situ cleavage of proteins electroblotted onto inert membranes such as nitrocellulose 
(Aebersold et al. 1987) or PVDF (Fernandez et al. 1992). 


Methods for chemical cleavage of proteins (e.g., cyanogen bromide) have been reported 
for the following approaches: in situ on the sample disk of a gas-phase sequencer (Simpson 
and Nice 1984), in-gel (Jahnen et al. 1990), and on- membrane (Stone et al. 1992). Although 
each of these procedures has some disadvantages, the recent impetus has been toward the in- 
gel approach, due mainly to protein losses incurred by overblotting as well as inefficient 
transfer to the membrane (particularly of high-M, proteins), ie, huge losses incurred by 
electroblotting ento inert membranes. A comparison of peptide maps of proteins digested 
in-gel and on-membrane usually reveals fewer peptides for the on-membrane-digested pro- 
teins. Anecdotal observations suggest that this is due to the protease having limited access to 
the substrate embedded in the nitrocellulose or PV DF membrane. Several attempts have been 
made to improve digests of membrane-bound proteins, including the use of a cationic PVDF 
membrane, which allows high recovery of peptides (Patterson et al. 1992), and the use of the 
detergent Zwittergent 3-16 (Lui et al. 1996). For reviews of in-gel and on-membrane diges- 
tion procedures, see Patterson and Aebersold (1995) and Williams et al. (1993). 


Reduction/Alkylation and Proteolytic Digestion of Protein Gel Spots 


Crucial to many of the peptide-based methods for identifying proteins separated by two- 
dimensional electrophoresis is the ability to achieve efficient digestion of proteins immobi- 
lized in the acrylamide gel matrix or upon a blotting membrane. It is well recognized that 
attempts to digest mature proteins composed of disulfide-bonded structures (in contrast to 
nondisulfide-bonded proteins) are often unsuccessful, regardless of whether the digest was 
performed on-membrane or in-gel. Moreover, for disulfide-bonded proteins that do digest, 
the peptide maps are often extremely complex. For these reasons, it is useful to reduce and 
alkylate gel-resolved proteins prior to proteolytic digestion. This additional manipulation 
step prior to performing in-gel digestion has the additional advantage of lowering the level 
of background artifacts (Moritz et al. 1996), an important consideration for any subsequent 
mass spectrometric analysis. A number of procedures for in situ reduction and alkylation of 
proteins on PVDF blots (Iwamatsu 1992; Henzel et al. 1993; fend et al. 1995) and in-gel diges- 
tion protocols (Moritz et al. 1996) have been described. A procedure for the S-pyridylethyla- 
tion of proteins in intact polyacrylamide gels that is compatible with subsequent mass spec- 
trometric analysis techniques is given in Protocol 12 (see Figure 7.4). Procedures for per- 
forming in-gel and on-membrane digestion of gel-separated proteins are described in 
Protocols 13 and 14, respectively. 


Fragmenting Proteins by Chemical Means 


There are a variety of chemical reactions known to cleave polypeptide chains. Some of these 
methods are nonspecific (e.g., 6 M HCl for 24 hours at 110°C hydrolyzes a protein into its 
constituent amino acids), whereas others cleave at specific amino acid residues (see Tables 7.3 
and 7.7). A variety of chemical methods for cleavage have been reviewed (Fontana and Gross 
1986). Unfortunately, the high specificity required for a fragmentation technique to be use- 
ful for peptide mapping (i.e., specificity in peptide bond cleavage) 1s rarely achievable, with 
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Whole-gel reduction and alkylation 
CBR-250 stained or electrophoretically separated 
1-DE or 2-DE gels proteins 


P 


Excision of individual gel 
pieces containing 
CBR-250-stained protein 


Extraction of peptides 
i —————————————— 
from gel piece 


Retention time (min) 


Rapid RP-HPLC separation of peptides 


| In-gel digestion 


Identification of proteins by a combination of 
Edman degradation, MS and tandem MS/MS 


FIGURE 7.4. Flow diagram for whole-gel reduction, S-pyridylethylation, in situ digestion, and identifica- 
tion of acrylamide gel-resolved proteins. (Reprinted, with permission, from Moritz et al. 1996.) 


the exception of cyanogen bromide cleavage of methionyl bonds. Most chemical fragmenta- 
tion methods suffer from one or more of the following disadvantages: 


* low cleavage yield 

* lack of specificity 

e undesirable side reactions 

* wide variability in the reactivity of the sensitive bond (Kasper 1975) 


TABLE 7.7. Chemical methods for selective peptide bond cleavage in proteins 


Peptide bond 

Reagent cleaved Abbreviated method Comments 

Cyanogen bromide — -Met-X- Incubate in CNBr (20-100-fold molar Excellent specificity and yield. 

(CNBr) excess over Met) in acid (e.g., 70% Met-Ser and Met-Thr may give «10096 yield. 
formic acid or 85% TFA} for 18-24 Cys may be oxidized to cysteic acid, and acid 
hours at room temperature in the labile bonds (eg., Asp-Pro) cleaved. 
dark. Excess CNBr can cause cleavage (and degradation) 

of Trp and Tyr side chains. (Note: CNBr is toxic 
and must be used in a chemical fume hood.) 

Hydroxylamine -Asn-Gly Incubate in 2 M hydroxylamine, 6 M Good specificity. 
guanidine-HC] (adjusted to pH 9 with Some Asn-Gly bonds are resistant to cleavage, 
4M LiOH) for 4 hours at 45°C. and some Asn-X bonds may cleave. 

BNPS-skatole -Trp-X Incubate in BNPS-skatole (100-fold BNPS-skatole reagent is unstable under acidic 
molar excess over Trp) in 50% (v/v) conditions and must be freshly prepared prior 


acetic acid for 48 hours at 25°C in the to use to minimize side reactions (e.g., with 
dark. Tyr, Met, and Cys oxidation). 
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Cyanogen bromide cleavage 


The most specific and generally applicable chemical fragmentation method available is 
cyanogen bromide cleavage of the polypeptide chain at methionyl residues (Gross and 
Witkop 1961) (see Protocol 6). Cyanogen bromide reacts with the sulfur of the thioether side 
chain of methionine to yield a mixture of homoserine and homoserine lactone plus 
methylthiocyanate (Gross and Witkop 1961). When the reaction occurs with methionine in 
peptide linkage, the bond involving the carboxyl group of methionine is cleaved (see the reac- 
tion mechanism in Figure 7.5). All peptide fragments produced by cyanogen bromide treat- 
ment of a protein contain a carboxy-terminal homoserine or its lactone, except the carboxy- 
terminal peptide of the polypeptide chain (unless methionine is the carboxy-terminal amino 
acid). It is difficult to find chromatographic conditions for the separation of homoserine lac- 
tone and homoserine during amino acid analysis using ion-exchange chromatography, 
because homoserine lactone elutes with the basic amino acids between NH, and arginine, 
and homoserine coelutes with glutamic acid. Thus, it is useful to convert homoserine lactone 
to the free acid, by treating homoserine lactone with 0.1 v pyridine acetate (pH 6.5) for | 
hour at 105°C (Ambler 1965) or by dissolving it in 0.1 M NaOH for 1 hour at room temper- 
ature (Schechter et al. 1976). 

Because methionine sulfoxide does not react with cyanogen bromide. the sample may 
need to be reduced (e.g., with 8-mercaptoethanol) prior to treatment with cyanogen bromide 
to improve cleavage yields (methionine sulfoxide is readily converted to methionine by 
reduction). The cleavage yields at Met-Ser and Met-Thr bonds may be low because of the 
involvement of the B-hydroxyl groups of seryl and threonyl residues in alternate reactions 
that do not lead to cleavage (see the reaction mechanism in Figure 7.5). 


Hydroxylamine cleavage of Asn-Gly bonds 


At alkaline pH, hydroxylamine can be used to specifically cleave Asn-Gly bonds in proteins 
to generate peptide fragments (Blumenfeld et al. 1965; for a review, see Bornstien and Balian 


3 3 o o" 
cH aa CH3 CN £ O 
| r CN - Br Sv Br T0 - Xw— H 
IS! v sA CH4SCN R4, «NH N*H^* R 
"oi mue “ol Br- 
4 Methyl thiocyanate Iminolactone 
R, ^NH NH ^* Re R, NH NHR, ring formation 
(1) (2) (3) 
HO HO 
o o oO Ht 70 
—— 7 + H-N A^ Rs =— Uo FX 
OH OH m 
R, “NH R “NH A," NH O 
Homoseryl residue Homoseryl Homosery! lactone 
(4) (5) residue (6) residue 


FIGURE 7.5. Reaction mechanism for cyanogen bromide cleavage. The nucleophilic thioether sulfur from ihe methionine side 
chain reacts with cyanogen bromide (7) to form a sulfonium ion (2). Methyl thiocyanate is released while an intermediate imino 
ring (iminolactone) is formed involving the carbony! group from methionine (3). The iminolactonie is hydrolyzed and the Met-X 
bond is cleaved (4), resulting in a homoserine at the carboxyl terminus and the release of a new peptide fragment (with an G-amino 
amino terminus) (5). Homoserine and homoserine lactone are interconvertible and form a mixture (6). (Adapted from Inglis and 


Edman 1970; modified, with permission. from Kellner et al. 1994.] 
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1977) (see Protocol 7). The Asn-Gly sequence in proteins occurs with low frequency (about 
every 350 amino acids) and thus usually results in the generation of very large peptide frag- 
ments. The reaction mechanism (Figure 7,6) involves the attack of the side-chain carbonyl 
group of asparagine by the carboxy-terminally adjacent backbone amine group to form a 
cyclic imide (succinimide) intermediate. Under the nucleophilic attack of hydroxylamine at 


Ry =NH CO 4 


(1) Asparaginyl-glycyl 
Cyclic imide (Succinimide) formation | -NH3 


co 


X 
L /NH ~y CO Ra 


R4 "NH CO 


H 
(2) 
Deamidation pee Hydroxylamine 
Isomerization | cleavage 
-OH™ Y *NHaOH 
0 
o QS 4 
NL c 
C 
NH .,. CO R2 Ra "NH CO— NHOH 
nyw CO” 
H (3) o-Aspartyl hydroxamate 


Aspartyl glycyl (5) 


+ NH CO Ry 
‘ T 
H 


NI vi o NHOH (5) Amino-terminal 
T Ne glycine 


Ay "NH" ^ COOH 
A, "NH  CO— OH 


Isoaspartate (7) (4) B-Aspartyl hydroxamate 


FIGURE 7.6. Cleavage of Asn-Gly bonds using hydroxlyamine. In this reaction, the side-chain carbonyl group of asparagine is sub- 
jected to nucleophilic attack by the c-amino group of the carboxy-terminally neighboring residue (glycine) (!) to form the cyclic 
imide, anhydroaspartylglycine (2), Cleavage with hydroxylamine yields (through the intermediary of the hydroxamate) a mixture 
of a-aspartyl (3) and B-aspartyl (4) hydroxamates and a new amino-terminal glycine (5). Alternatively, nucleophilic attack of the 
cyclic imide (succinimide} intermediate (2) by hydroxide ion leads to a mixture of aspartate (6) or an isoaspartate residue (7). 
Isoaspartate residues are refractory to the Edman degradation procedure and wil block ongoing amino-terminal sequence analy- 
sis. (Modified, with permission, from Kellner et al, 1994.) 
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alkaline pH, the Asn-Gly peptide bond can be hydrolyzed to form a mixture of &-3-aspartyl 
and B-aspartyl hydroxamate, as well as a new amino-terminal glycine. The cyclic imide ring 
may also open and form either an aspartate residue (i.e. a deamidation reaction) or an 
isoaspartate residue (i.e., isomerization reaction), These latter phenomena are known to 
occur spontancously in protein aging (Stephenson and Clarke 1989), especially in eye lens 
proteins. 


Cleavage at tryptophan 


The high reactivity of the indole side chain of tryptophan, and the relative rarity of this 
residue in most proteins, has evoked much interest in finding methods for its selective cieav- 
age. A large number of cleavage reactions for tryptophan have been described. These include 
BNPS-skatole [2-[2-nitrophenylsulfenyl]-3-methyl-3-bromoindolenine} (Omenn et al. 
1970), dimethylsulfoxide and halogen acid (Savige and Fontana 1977), treatment with a high 
concentration of cyanogen bromide in heptofluorobutyric acid (HFBA) (Ozols and Gerard 
1977), and exposure to tribromocresol (Burnstein and Patchornik 1972) and o-iodosoben- 
zoic acid (Mohoney et al. 1981). For a comprehensive review of tryptophan cleavage proce- 
dures, see Fontana and Gross (1986). Of these methods, the 0-iodosobenzoic acid procedure 
has become accepted as the most reliable (see Protocol 8). 


PEPTIDE MAPPING BY SDS-PAGE (CLEVELAND METHOD) IS A RELATIVELY 
EASY APPROACH 


Peptide separation by one-dimensional SDS-PAGE is a relatively easy approach for compar- 
ing the peptide fragments of proteins generated by protéolytic or chemical cleavage (i.e, pep- 
tide mapping). The similarity/dissimilarity of the resultant peptide pattern reflects the simi- 
larity/dissimilarity of the parent proteins, This peptide mapping approach, first described by 
Cleveland (1977), requires no special equipment, and it can also be combined with western 
blotting to locate epitopes (i.e., epitope mapping). Protein fragmentation can be perlormed 
prior to loading the gel (the preferred approach) (see Protocol 16) or, alternatively, in the 
stacking gel with the resultant peptides separated directly in the running gel (Protocol 17). 
The conventional Laemmli SDS-PAGE gel system is used to resolve peptide fragments «3000 
Mp and the tricine gel system (Schágger and von Jagow 1987) is used for smaller peptides (for 
basic PAGE protocols, sce Chapters 2 and 4). Typically, several lanes with increasing incuba- 
tion times or enzyme concentrations are employed to optimize the proteolytic conditions. In 
contrast to traditional peptide mapping procedures (e.g. RP-HPLC methods), which require 
low-microgram amounts of protein, the Cleveland rnethod requires very small amounts of 
material (e.g, sufficient for visualization by silver staining; sensitivity can be further 
enhanced by radiolabeling). For commonly used protein fragmentation procedures, see 
Tables 7.6 and 7.7. 


PEPTIDE MAPPING CAN BE USED TO DETERMINE DISULFIDE LINKAGES IN PROTEINS 


Analyzing the cysteine connectivities in proteins is an important facet of the structural char- 
acterization of a protein. Disulfide linkages between cysteine residues are an important struc- 
tural element of the extracellular domains of membrane-associated receptors, as well as 
secreted proteins and peptides such as plasma proteins, hormones, protease inhibitors, and 
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venom proteins, Disulfide linkages can have an important role in establishing and maintain- 
ing the structural fold of a protein. 

Before embarking on such a quest, it is important first to establish the number of Iree thi- 
ols and disulfide bonds in the protein of interest. Traditionally, Ellman’s reagent (DTNB) has 
been used to assay the number of free thiols in a protein (Ellman 1959). The method is based 
on the reaction of a free thiol in a protein with DTNB to give the mixed disulfide and ?-nitro- 
5-thiobenzoate (TNB), which is quantified by the absorbance of the anion {TNB} at 420 
nm. The reagent has also been used to quantitate the number of disulfides by first blocking 
any free thiols with an alkylating agent (e.g., iodoacetic acid, iodoacetamide, and 4-vinylpyri- 
dine) and then reducing the protein (e.g., with DTT) prior to reaction with DTNB (Anderson 
and Wetlaufer 1975). For the analysis of free thiols/disulfide thiols, see Protocol 4. 

Traditional methodology for determining disulfide linkages in proteins involves cleavage 
of the polypeptide chain with proteinases between half-cysteinyl residues to obtain peptides 
that are tethered by one disulfide bond. The resulting mixture of peptides is separated by RP- 
HPLC. Cysteine-linked peptides are identified in peptide maps (e.g., by comparison of nonre- 
duced and reduced maps) by the difference in their RP-HPLC retention times. The retention 
times of nondisulfide-containing peptides will be the same in both nonreducing and reduc- 
ing peptide maps, whereas under reducing conditions, disulfide bridges will be broken with 
the resultant loss of bridged peptides in the reduced peptide map and concomitant appear- 
ance of their reduced peptide components. Typically, fragmentation 1s performed under low 
pH conditions (e.g., pepsin at pH 2-3) to minimize disulfide exchange. However, the native 
protein is first alkylated (e.g., with 4-vinylpyridine or iodoactetate) (for a list of commonly 
used alkylating agents, sec Table 7.2) to prevent possible disulfide exchange. Fragmentation 
can then be performed using a specific proteinase(s) that requires basic conditions (e.g. 
trypsin and Glu-C). Disulfide-containing peptides can be identified by their sequences using 
classical Edman chemistry or tandem mass spectrometry, or molecular masses. For an exam- 
ple of the classical approach, see Figure 7.7, which outlines the strategy involved in the deter- 
mination of the disulfide connectivities of interleukin-6 (IL-6), a 20-kD glycoprotein with 
two disulfides (Simpson et al. 1988). A more recent strategy, employing both Edman chem- 
istry and mass spectrometry, for the characterization (free thiol, disulfide linkages, and N-gly- 
cosylation sites) of the membrane-associated IL-6 receptor components, IL-6 receptor, and 
gp 130 is outlined in Cole et al. (1999) and Moritz et al. (2001). 

When cysteines are well dispersed in the polypeptide chain (as was the case with the 
example of IL-6 given in Figure 7.7), a single proteinase or multistep enzymatic or chemical 
digestions may be used to obtain a full set of diagnostic peptide fragments that will allow the 
determination of their connectivities. However, when cysteines are adjacent or tightly clus- 
tered, it is almost impossible to obtain peptides containing a single disulfide bond. In 1993, 
Gray devised a strategy for analyzing disulfide patterns in highly bridged small peptides 
(Gray 1993). In this strategy, peptides are partially reduced with the water-soluble reducing 
agent TCEP at pH 3.0, the nascent free thiols are alkylated with iodoacetamide, and their 
positions are identified from sequence analysis. A strategy to locate free thiols in proteins 
using 2-nitro-5-thiocyanobenzoic acid (NTCB) (Jacobson et al. 1973) has been described 
recently (Wu and Watson 1997). NTCB selectively cyanylates cysteine thiols. The amino-ter- 
minal peptide bond of cysteines modified with NTCB can be readily cleaved under alkaline 
conditions to form an amino-terminal peptide and a series of 2-irsinothiazolidone-4-car- 
boxy! peptides (Jacobson et al, 1973) that can be mass mapped using matrix-assisted laser 
desorption/ionization time-of-flight mass spectrometry (MALDI-TOF) to the sequence of 
the original molecule ( Wu and Watson 1997). A variation on this approach using the cyany- 
lating agent, 1-cyano-4-dimethylamino-pyridium tetrafluoroborate (CDAP), has been 
described for assigning disulfide pairings in proteins (Wu and Watson 1997). 
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PEPTIDE MAPPING USING RP-HPLC 


The fractionation of peptide mixtures to yield homogeneous peptides suitable for sequence 
analysis (especially for the Edman degradation procedure) usually requires a number of chro- 
matographic steps employing various columns as well as solvent- mediated selective steps, For 
a review of reversed-phase peptide separation procedures, see Simpson et al. (1989), and ref- 
erences therein. A recommended initial separation is 


e performed on a microbore Brownlee RP-300 (2.1 mm I.D. x 30 mm, Perkin-Elmer) or 
Vydac C4 (Vydac, Hesperio, California) reversed-phase column, or 
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Sequences indicate the results of Édman degradation of unreduced peptide 
fractions. X denotes a Cys residue expected from the known sequence but not 
observed; C danotea the di-Pth derivative of cyatine. 


FIGURE 7.7. Disulfide assignment of IL-6 using a combination of peptide mapping and Edman chemistry. (Top left panel) 
Recombinant mI1.-6 protein was digested with Staphylococcus aureus V8 protease, and 10 pg of peptides was loaded onto a Browniee 
RP-300 column (100 x 4.6 mm I.D., 7 ym dp). (A) Unreduced S. aureus digest of rmlL-6 (10 pg of peptide) loaded directly. (B) Prior 
to loading, 5. aureus digest of rmIL-6 (10 pg of peptide) was treated with 100 mm DTT. Peaks (400-600 pl) were collected manual- 
ly in microfuge tubes and stored at 20°C. For sequence analysis, peptides were concentrated to 40-60 jl by microbore RP-HPLC. 
(Top right panel) Sequence analysis of peptide fractions L, IL, and ITI from the peptide map shown in the top left panel. (Bottom panel) 
Summary of sequence data and disulfide bridge identifications. (Reprinted, with permission, from Simpson et al. 1988b.) 
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* using a trifluoroacetic acid (TFAJ/acetonitrile gradient system composed of Solvent A 
(0.196 v/v aqueous TFA) and Solvent B (6096 acetonitrile containing 0,08596 v/v aqueous 
TFA), which is usually developed with a linear 60-minute gradient from 0% to 100% 
Solvent B at a flow rate of 100 l/min, 


For additional information on choosing the appropriate column and flow rate, sce 
Chapter 5, Figures 5.4 and 5.5 (these two figures cover the principles of varying column inter- 
nal diameters/flow rates and a practical example). Peptides are usually detected in the column 
eluate by their absorbance at UV anm: Aromatic amino-acid-containing peptides (particularly 
peptides containing tryptophan and tyrosine), as well as S-pyridylethylated-cysteine-contain- 
ing peptides, are readily identified by their characteristic UV spectra using photodiode-array 
detection (Simpson et al. 1989; Moritz et al. 1996), Peptide fractions should be collected in 1.5- 
ml tubes according to UV absorbance at 214 nm and immediately capped to prevent loss of 
organic solvent. To prevent sample loss during long-term storage (e.g., irreversible sticking of 
peptide to the tube walls), the addition of Tween-20 (final concentration, 0.01—0.0296 w/v) to 
peptide fractions is recommended. This is especially important when working with low-pica- 
mole amounts of sample. To reduce contaminant levels that interfere with RP-HPLC and rnass 
spectrometry, it is recommended that extremely clean tubes be used (e.g., microfuge tubes). 
For high-sensitivity work, the tubes should be rinsed with Solvent B prior to usage. A standard 
protocol for the RP-HPLC of peptides is given in Protocol 15. 

In situations where peptide fractions are clearly impure (as judged from peak shape 
and/or UV spectrum across the peak), samples are rechromatographed using a separation 
system with a different selectivity (Simpson et al. 1989). Solvent systems that can be used for 
the second chromatographic system include 


* aqueous 196 (w/v) NaCl/acetonitrile, 
® aqueous 0.196 (v/v) TFA/methanol, and 


* 50 mM (w/v) sodium phosphate/acetonitrile. 


These solvent systems are compatible with most Edman chemistry sequencers. To pre- 
vent sample loss, peptide fractions should not be dried between chromatographic steps. 
Because peptide fractions usually contain sufficient quantities of secondary solvent to pre- 
vent their retention on a similar interactive support, it is necessary to dilute samples (one- to 
twofold) with the primary solvent to facilitate their retention (trace enrichment) onto the 
reversed-phase column used in the subsequent chromatographic step. Dilution can be read- 
ily accomplished in a large-volume (1.5 ml) sample-loading syringe immediately prior to 
injection. An illustration of the selective effects achieved with various mobile-phase condi- 
tions, using IL-6 peptide mapping as an example, is given in Figure 7.8. 

In the example shown in Figure 7.8, IL-6 (15 ug) was S-carboxymethylated before enzy- 
matic digestion, S-carboxymethyl (Cm) IL-6 was recovered from the reaction mixture by RP- 
HPLC using a microbore column (50 x 1-mm I.D.). Before committing the bulk of the sam- 
ple to this desalting procedure, the chromatographic recovery of Cm-IL-6 was evaluated using 
an analytical microbore-column step (Figure 7.8A,B). This pilot desalting step is extremely 
important because some proteins, upon S-carboxymethylation, exhibit different chromato- 
graphic behaviors (e.g., are more hydrophobic and hence retained more strongly) than their 
native form. Most of the IL-6 sequence was established by analysis of peptides by treatment of 
Cm-IL-6 with trypsin, chymotrypsin, V8 protease, and cyanogen bromide. A peptide map, 
using the conventional TFA/acetonitrile solvent system, of a tryptic digest of Cm-IL-6 is 
shown in Figure 7.8C. Peptides were fractionated in a second chromatographic dimension, 
using an unbuffered saline/acetonitrile solvent system, to achieve homogeneity (Figure 7.8D). 
If the peptides were still not homogeneous (as judged from peak shape or spectral analysis 
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FIGURE 7.8. Microbore HPLC purification of peptides from murine IL-6. (Reprinted, with permission, from Simpson et al. 19884. 
(A) Chromatographic conditions: column, Brownlee RP-300 (50 x 1-mm LD.); 5 ul of the reduction/alkylation reaction mixture (con- 
taining 300 ng of Cm-IL-6, see Protocol 3) was loaded directly at a flow rate of 0.5 ml/min onto the column that had been equilibrat- 
ed in aqueous 0.196 (by volume) TFA. The flow rate was decreased to 40 l/min, and after the baseline had stabilized (10 minutes) 
the column was developed with a linear 75-minute gradient of 0- 10096 B where Solvent A was aqueous 0.1% TFA and Solvent B was 
75% (by volume), acetonitrile/25% H,O containing 0.085% TFA. The column temperature was 45°C, Peaks were collected manually. 


(B) The remainder of the reaction mixture (245 pl containing 14 pg of Cm-IL-6) was loaded in 60-1) aliquots onto the same col- 
umn using identical chromatographic conditions to those described in A. Real-time spectral data of eluting peaks were obtained 
using a diode array detector (Hewlett-Packard Model 1040). (Inset) Absorption spectrum of Cm-IL-6: vertical axis, absorbance 
normalized to relative absorbance on a scale 0-100%; (- - - -) zero-order derivative spectrum; [ j) second-order derivative spec- 
trum. The extremum at 290 nm is characteristic of a tryptophan residue. 

(C) Separation of tryptic peptides of Cm-IL-6 by microbore RP-HPLC. Chromatographic conditions: column, Brownlee RP-300 (30- 
nm pore size, 7-4m dimethyloctylsilica packed into a column (30 x 2. 1-mm }.D.): linear 60-minute gradient of 0- 100% B, where Salvent 
A was 0.1% (by volume) aqueous TFA and Solvent B was 6096 actonitrile/40% H,O containing 0.09% (by volume) TFA. Flow rate was 
100 pl minute; column temperature was 45°C, Sample, 8 jig (370 pmoles) in 1050 pl, was loaded at 1 ml/min. All peaks ( -60—100 pi} 
were collected manually and stored at -20°C, Pooling of peptide fractions is indicated by horizontal bars (-}. Peptides found in each pool 
(T4, T5, T6, etc.) have been labeled in accordance with their position in the sequence as proposed by Simpson et al. (1988aj. 


(Continued on facing page.) 
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using diode array detection; Grego et al. 1985), then a third chromatographic dimension using 
a different packing (e.g, ODS-Hypersil) and organic mobile-phase component (e.g. 
methanol) was used (Figure 7.8E,F). To isolate short, highly charged peptides that are not usu- 
ally retained on conventional reversed- phase chromatographic systems and elute with the col- 
umn breakthrough, the ion-pairing agent sodium hexylsulfonate was used (Figure 7.8). 


Ion pair reagents are strong hydrophobic ions that form neutral ion pairs with oppositely 
charged sample molecules (see Chapter 5). By these means, the simultaneous separation of 
charged and noncharged molecules is possible. For further information, see the Merck Web 
Site at http:www.merck.de/english/services/chromatographie/reagents/lichropu.htm. 


In the example shown, unretained peptides from a first chromatographic separation of a 
chymotryptic digest of Cm-iL-6 (Brownlee RP-300 column, 30-nm pore size/TFA-acetoni- 
trile system; for details, see Figure 7.8B) were chromatographed on an ODS-Hypersil column 
employing 0.02 mM sodium phosphate (pH 2.5) and 0.03 M sodium hexylsulfonate in the 
mobile phase. Peptides recovered by these means (e.g., chymotryptic peptide C8 Arg-Ser- 
Thr-Arg-Gin-Thr) were directly sequenced without the need for desalting. 

For peptide mapping of trace-abundant proteins, we recommend the use of microbore 
columns (€2.1 mm I.D.} or capillary columns («0.5-mm ID.) (see Chapter 8). Microbore 
columns can be operated on conventional HPLC systems that can deliver accurate low flow 
rates (~50 jl/min) (Simpson et al. 1989). In addition, commercial HPLC systems designed 
for their usage (e.g., the Ultimate LC pump, LC Packings Inc., and the Agilent Capt 100L.C 
system) are now available. However, conventional HPLC systems can be readily (and eco- 
nomicaily) modified to provide the accurate low flow rates (0.4—4 l/min) and gradients nec- 
essary to operate capillary columns (Moritz and Simpson 1992a,b, 1993; Moritz et al. 1994; 
Tong et al. 1997). Such columns are essential for the preparation of samples for a number of 
mass spectrometric methods for peptide identification. Detailed protocols for the facile fab- 
rication of «0.32-mm 1.D. polyimide-coated fused-silica columns, with a detection limit of 
~500 pg amounts of protein, are described by Moritz et al. (1994) and Tong et al. (1997). A 
method for packing capillary columns using a conventional column-packing device i5 given 
in Chapter 5, Protocol 2. 


FIGURE 7.8. (Continued from facing page.) 

(D) Rechromatography of peak fraction T4/T8 from A by RP-HPLC. Peptides T4 and T8 were resolved by chromatography using 
the same chromatographic conditions described in A, but using unbuffered sodium chloride as the mobile phase. Peak fraction 
T4/T8 (—100 pl), was diluted twofold with H,O and then applied to the column that had been previously equilibrated with 1% 
(mass/volume) aqueous sodium chloride. The column was developed with a linear 60-minute gradient of 0-50% B, where Solvent 
A was 1% aqueous sodium chloride and Solvent B was acetonitrile, Flow rate was 100 ii/min; column temperature was 45°C. 
Pooling of fractions is indicated by horizontal bars { } 

(E) Rechromatography of peak fraction T5/T6 from A by RP-HPLC. Chromatographic conditions were the same as those described 
in B. Pooling of fractions is indicated by a horizontal bar (-}. 

(F) Rechromatography of peak fraction T5/T6 from C by RP-HPLC. Chromatographic conditions: column, OD$-Hypersil 12-nm 
pore size, 5-um particle diameter octadecylsilica packed into a stainless steel column (50 X 1-mm 1.D.); the peptide fraction T5/T6 
from C was diluted twofold with H,O and applied at a flow rate of 200 l/min to the column that had been previously equilibrat- 
ed with 0.1% (by volume) TFA, The column was developed at a flow rate of 40 ul/min, with a linear 60-minute gradient of 0-100% 
B, where Solvent A was aqueous 0.196 TFA and Solvent B was 60% methanol/40% H,O containing 0.09% TFA. Pooling of fractions 
is indicated by horizontal bars (-). 

(G) fon-pair chromatography of nonretained chymotryptic peptides of Cm-IL-6. Chromatographic conditions: column, ODS- 
Hypersil 12-nm pore size, 5-|im particle size octadecylsilica packed into a stainless steel column (50 x 1-mm 1.D.); linear 50-minute 
gradient from 0% to 100% B, where Solvent A was aqueous 0.02 M sodium phosphate, 0.1% (by volume) phosphoric acid (pH 2.5), 
and 0.03 M sodium hexylsulfonate and Solvent B was 50% acetonitrile in Solvent A. Flow rate was 40 ulmin; column temperature 
was 45°C, Sample load: eluent breakthrough (~2 ml) was diluted twofold with primary Solvent A and applied to the column in 
400-1 aliquots at a flow rate of 1 ml/min. 


PROTOCOL l 


Performic Acid Oxidation of Proteins 


MATERIALS 


I. IS CUSTOMARY TO CLFAVE DISULFIDE BONDS as a prerequisite to many peptide-mapping 
strategies, This will facilitate protein unfolding and thus optimize proteolytic digestion. In 
addition, cleavage of disulfide bonds simplifies the interpretation of peptide maps by remov- 
ing possible peptide fragments(s) in the mixture that are held together by disulfide bonds. 
Described in this protocol is the performic acid oxidation method for cleaving disulfide 
bonds, originally described by Sanger (1949) in his pioneering work on the structure of 
insulin. 

Performic acid is an extremely powerful oxidizing reagent that oxidizes tryptophan, the 
sulfur-containing amino acids (cysteine, cystine, and methionine), the phenolic group of 
tyrosine, and the hydroxyl groups of serine and threonine. Of these reactions, the most rapid 
is the conversion of cysteine to cysteic acid and of methionine to the sulfoxides, followed by 
the oxidation of cystine to cysteic acid and methionine sulfoxide to methionine sulfone. 
Somewhat slower is the oxidation of tryptophan to several derivatives including N- 
formylkynurenine. Conversion of cystine (disulfide bonds) and cysteines to cysteic acid is 
~92%, and oxidation of methionine to the sulfone is essentially quantitative. Because cysteic 
acid and methionine sulfone can be analyzed quantitatively using the ninhydrin detection 
method of amino acid analysis (CHP quantitation), perfomic acid oxidation provides a con- 
venient determination of the combined cysteine and cystine content and the total methion- 
ine content of a protein. 

Performic acid oxidation, however, is not a recommended procedure for cleaving disul- 
fide bonds for general peptide-mapping strategies due to its significant limitations, especial- 
ly the loss of tryptophan and modifications to tyrosine, serine, and threonine. The preferred 
method for cleaving disulfide bonds for general peptide mapping is by reduction (see 
Protocols 2 and 3). For further details of the performic acid oxidation procedure, see Hirs 
(1967), from which this protocol was adapted. 


» Reagents 
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CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


Formic acid (8896 w/v) «1» 
Hydrobromic acid (48% wiv) <!> l l 
WARNING: Because it gives off a caustic, irritating vapor, use this acid in a chemical fume hood. 
Hydrogen peroxide (30% wiv) «t!» 
NaOH pellets (sec Step 6) <!> 
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» Equipment 
Evaporator, rotatory (SpeedVac or equivalent) or Lyophilizer, centrifugal 
Tubes (glass or polypropylene, capped) 


» Biological Sample 
Purified protein of interest, lyophilized 


METHOD 


1. Add 100 ul of hydrogen peroxide to 900 pł of formic acid. Allow the mixture to stand for 
1 hour at room temperature. This reaction produces performic acid (HCOOOH). 


2. Chill the performic acid on ice to ~0°C, 


During performic acid oxidation, the temperature of all the reactants should be at 0°C to 
minimize side reactions such as oxidation of phenolic groups (tyrosine and phenylalanine} 
and the hydroxyl groups of serine and threonine. 


3. Dissolve the protein in 50 ul of performic acid (~500 ug/ml) in a precooled tube. 


4. Place the cap on the tube and incubate the mixture for ~4 hours at 0°C. If the protein is 
insoluble, allow the reaction to proceed for up to 20 hours. 


5. Add 30 ul of cold hydrobromic acid to destroy the excess performic acid. 


Care must be taken since a small quantity of bromine will be liberated. 


6. Dry the sample thoroughly using a rotatory evaporator or centrifugal lyophilizer. 
Removal of residual bromine and formic acid is facilitated by the addition of a few pel- 
lets of sodium hydroxide in the condenser. Evaporation to dryness takes ~ 10 minutes. 


Because many amino acids are modified, it is recommended that performic-acid-oxidized 
proteins be used for the estimation of S-S/S-H (as cysteic acid) and methionine (as methio- 
nine sulfone) groups by amino acid analysis (but not sequence analysis). 


PROTOCOL 2 


Reduction and S-carboxymethylation of Proteins: 
Large-scale Method 


la INTERCHAIN AND/OR INTRACHAIN DISULFIDE LINKAGE in a protein can present problems 

during proteolytic or chemical fragmentation procedures. 

e Disulfide-linked peptides elute as single peaks during RP-HPLC-based peptide mapping, 
making the interpretation of such maps difficult. 


It is desirable io separate peptide chains linked through -S—5- bridges prior to sequence 
work, otherwise the task of interpreting the sequence data may be too complicated. 


Proteins whose disulfide bonds are not split are less susceptible to proteolytic fragmenta- 
tion. 


The most commonly used method for cleaving disulfide bonds involves the reduction of 
cystine to yield cysteine residues. However, cysteine residues are highly reactive, which can 
cause complications (e.g., random disulfide bond formation and susceptibility to oxidation 
upon removal of the reducing agent) during sequence work. For this reason, it is customary 
to stabilize them as suitable derivatives (e.g., by alkylation). 

In this protocol, the intact protein (21 mg) is reduced and then S-alkylated with 
iodoacetic acid (or iodoacetamide). The resulting cysteine derivative S-carboxymethylcysteine 
(or S-carboximadomethylcysteine) is easily detectable during chemical sequencing. Another 
frequently used alkylating agent is 4-vinylpyridine, which yields 4-pyridylethylcysteine 
(Raftery and Cole 1966). 4-Pyridylethylcysteine is readily detected in the Edman degradation 
process as its PTH derivative, as well as by mass-spectrometry-based sequencing methods. For 
microscale (<100 ug of protein) reduction and S-alkylation procedures, see Protocol 3. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


P Reagents 
Acetic acid (1 M) (ultrapure grade) <!> 


Alkylation buffer 2 u 
Alkylation buffer is 1.5 m Tris-HCI «t» (pH 8.5) containing 2.5 mm EDTA and 6 M guanidine- 
HEL zt». Urea <!> (8 M) can be substituted for guanidine- HCl. Stock solutions of reduction 
buffer made with urea may be stored frozen for up to 2 weeks prior to use. 

NH,HCO, (0.1 M or 196 w/v, Analar grade) <!> 
Ammonium bicarbonate (1% w/v) gives a pH of 7.8, and pH adjustment ts not necessary. 

Dithiothreitol (DTT) <!> 

Guanidine hydrochloride, solid (Mallinckrodt) or aqueous 8 M solution (Pierce) 
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HCl, concentrated <!> 


lodoacetic acid <!> or iodoacetamide <!> 
The iodoacetic acid used must be colorless. Any free iodine (revealed by yellow coloration of the 
material) causes oxidation of cysteine residues (thereby, preventing alkylation) and possibly tyro- 
sine residues. High-quality iodoacetic acid can be obtained from Fiuka (Buchs, Switzerland). If the 
iodoacetic acid is not available or if it is not colorless, recrystallize it from hexane before use. 4- 
Vinylpyridine may be substituted for the iodoacetic acid or iodoacetamide (see the panel at the end 
of the protocol). 

2-Mercaptoethanol <!> 

N’,N-Ethylmorpholine acetate buffer (0.2 M, pH 8.0) «1» 
Add 230 pi of N-ethylmorpholine to ~8.0 ml of H,O, Add acetic acid to give a pH of 8.0. Adjust the 
volume to a total of 10 ml, 

Reduction buffer 
Reduction buffer is 0.2 M Tris-HC1 (pH 8.5) containing 2.5 mM EDTA and 6 M guanidine-HCl. Urea 
(8 M) can be substituted for guanidine- HCl. Stack solutions of reduction buffer made with urea 
may be stored frozen for up to 2 weeks prior to use. 


P Equipment 


Chromatography system and size-exclusion column 
For example, Sephadex G-10, G-25 (medium), G-50 SF, or Bio-Gel P- 10 size-exclusion resin, using 
1 M acetic acid to develop the column. 


Nitrogen «1» (purified grade) or Argon <!> 
pH indicator paper 

Polypropylene tube (tightly capped) 

Tygon tubing (see Step 3) 


b Biological Sample 
Protein sample, purified (lyophilized) 


METHOD 


Sample Reduction 
1. Dissolve the protein in reduction buffer (—10 mg protein/ml buffer) in a polypropylene 


tube. 


2. Add an accurately weighed amount of solid DTT to give a 60-fold molar excess (Le., 
approximately half the weight of the protein) or to give a final concentration of 0.1 M. 


3. Gently blow a stream of nitrogen (or argon) over the top of the solution for 30 seconds. 


This can be accomplished by attaching a Pasteur pipette to a length of Tygon tubing con- 
nected to the nitrogen cylinder regulator, and adjusting the nitrogen flow such that it can be 
barely detected on the back of the hand. 


4. Incubate the protein solution for 3 hours at 37°C. 


5. If desired, the number of -SH groups can be quantitated using the Ellman procedure (see 
Protocol 4). 
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Sample Alkylation 


in some cases, reduced and alkylated proteins can be rapidly desalted and recovered in a small 
volume of volatile solvent, by RP-HPLC. This alternative procedure is particularly attractive 
for low-M, protein (15-25K) such as growth factors and cytokines. Desalting (and sample 
concentration) by RP-HPLC (TFA/acetonitrile solvent system) has been successfully employed 
for low-microgram amounts of material {see Protocol 3). 


6. Cool the reduced protein substrate solution on ice. 


7. Add iodoacetic acid (approximately the same weight as the protein substrate) freshly dis- 
solved in a small volume (~0.2 ml) of alkylation buffer. Alternatively, use a small aliquot 
from a stock solution of sodium iodoacetic acid (15 mg of iodoacetic acid per 100 ul of 
0.1 M NaOH) to give a slight molar excess over the total SH groups in the mixture (ie., 
protein SH groups plus DTT). 


8. Mix the contents of the tube by gently tapping the side of the tube. 
9, Flush the reaction tube with nitrogen for 30 seconds. 


10. Incubate the reaction for 15 minutes at room temperature in the absence of light. 


The reaction is carried out in the dark to prevent the formation of free iodine from the 
breakdown of iodoacetic acid. 


11. Add a molar excess of 2-mercaptoethanol (~50 pl per 5 mg of DTT) to “mop up" the 
excess alkylating reagent and thus stop the reaction. 


12. Adjust the pH of the solution to 2-3 by the careful addition of concentrated HCI. Check the 
pH by spotting ~0.5 pl of the reaction mixture onto a strip of pH indicator paper. 


A pilot study is recommended to check the solubility of the alkylated protein (which may 
differ significantly from that of the native protein). Alternative solutions and buffer salts that 
are volatile include 1 M acetic acid (pH 2.1}, 1% (w/v) ammonium bicarbonate (pH 7.8), 
and 0.2 M N-ethylmorpholine acetate buffer (pH 8.0). Protein can be readily recovered from 
these solvents and buffer salt solutions by lyophilization. 


Recovery of $-carboxymethyl Protein (Desalting) 


Alternatively, reduced and alkylated protein can be desalted by extensive dialysis against a 

volatile solution (e.g., 1 M acetic acid) or buffer salt solution (e.g., 196 ammonium bicarbon- 

ate). However, this approach is time-consuming (typically, 24 hours with several changes of 

dialysis solution are essential) and high losses can occur. Following dialysis, protein can be 

recovered by lyophilization. 

13. To remove the excess reagents, rapidly desalt the protein mixture by size-exclusion chro- 
matography. 


14. Recover the reduced and alkylated protein from the 1 M acetic acid eluent as a dried pow- 
der using lyophilization. 


Alternatively, reduced protein can be alkylated with 4-vinylpyridine. This is accomplished 
by treating the reduced protein substrate for 90 minutes with 3.0 moies of 4-vinylpyridine 
per 1 mole of DTT at room temperature. Desalting of the S-pyridylethy! protein 1s per- 
formed as described above for S-carboxymethylated protein. For details, see Step 12. 


PROTOCOL 3 


Reduction and Scarboxymethylation of Proteins: 
Microscale Method 


MATERIALS 


» Reagents 


Riveron AND ALKYLATION OF MICROSCALE AMOUNTS OF PROTEIN (10—50 ug) require special 
attention to avoid losses (especially those due to the sample sticking irreversibly to the sides 
of the sample vessel). Adding a small quantity of detergent (e.g., 0.02% Tween-20) to the pro- 
tein solution during all manipulations can prevent such losses. Classical methods for desalt- 
ing (e.g., size-exclusion chromatography) and dialysis should be avoided due to large sample 
losses. 

In this protocol, a procedure for reduction and S-carboxymethylation of low-microgram 
amounts of protein is described. Samples are reduced and alkylated in small volumes («200 
ul) in the presence of 0.02% Tween-20 and desalted (and concentrated) by RP-HPLC in 
yields of >90%. Avoid using Tween-20 if the subsequent step involves mass spectrometry 
(MS) because Tween-20 tends to stick to the RP-HPLC supports and leach off the column 
resulting in artifact ion peaks in the MS spectra. This procedure has been successfully applied 
to the complete amine acid sequence determination of the glycoproteins IL-6 and IL-9 
(Simpson et al. 1988, 1989). 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acetonitrile <!> 

Jodoacetic acid «1» 
Prepare a 0.5 M solution of iodoacetic acid (Puriss grade from Fluka |Buchs, Switzerland! recrys- 
tallized prior to use) in 1 M NaOH <!> immediately before use. 
Store iodoacetic acid dry by placing the bottle in a plastic container containing silica gel in the dark 
at -20*C. Aliow the bottle to come to room temperature before opening. With time, solid iodoacetic 
acid will develop a yellow coloration (free todine) and should be recrystallized. 

n-Propanol <!> 

Reduction buffer 
Stock buffer adjusted to 0.15 M DTT <!> (prepared fresh immediately before use). 

Stock buffer 
0.2 M Tris-HCl (pH 8.3) containing 2 mM EDTA, 6 M guanidine-HCl «17, and 0.02% (w/v) Tween- 
20. Mix 1 volume of 1 m Tris-HCl (pH 8.5) and 5 volumes of aqueous 8 M guanidine-HCl (solution 
provided by Pierce); adjust to 2 mm EDTA. 


Trifluoroacetic acid (TFA) <!> 
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» Equipment 


Centrifuge, bench-top 


HPLC system microbore reversed-phase column 


For example, Brownlee RP-300 30-nm pore size, 7-um particle diameter, octyl silica packed into a 
stainless steel column, either a 50 x 1-mm LD. or 30 x 2.1-mm 1.D. (PE Brownlee). For basic RP- 
HPLC protocols, see Chapter 5. 


Lyophilizer, centrifugal (Thermo Savant) 
Polypropylene tubes (1.5-ml, capped) 
Water bath or incubator preset to 40"C 


P Biological Sample 
Pure protein samples (~10-15 Hg) 


METHOD 


Two cytokines are mentioned in this protocol (IL-6 and 1L-9) because different strategies were used 
for the last stage of their purification, prior to reduction/alkylation. The last stage of purification 
for IL-6 (32 ug) involved immunoaffinity chromatography and the protein was recovered from the 
column in ~130 pl using 1 M acetic acid <!> containing 0.01% (w/v) Tween-20 (Simpson et al. 
1988), For H.-9, a reversed-phase column was used and pure IL-9 (15 pg) was recovered in 120 pl 
of 35% aqueous acetonitrile <!> containing 0.1% (by volume!) TFA <!> (the active fraction was 
immediately adjusted to a final concentration of 0.02% Tween-20) (Simpson et al. 1989). 


Sample Reduction 


l. 


Evaporate solutions of purified proteins (containing 0.01-0.02% Tween-20) to near dry- 
ness (^10 pl) by centrifugal lyophilization. 


. Add 150 jl of stock buffer to the microfuge tube containing the "near" dried protein sam- 


ple. Mix thoroughly by gently tapping the microfuge tube or by vortexing it and then 
brictly centrifuging the sample in a microfuge. 


. Add 16 ul of reduction buffer to give a final concentration of 15 mM DTT. 


. Incubate the sample for 4.5-5 hours at 40°C. 


Sample Alkylation 


5. Allow the reduced protein solution to come to room temperature. 


6. Add 20 u! of stock 0.5 M iodoacetic acid solution to give a final concentration of ~0.05 M 


iodoacetic acid. 


7. Incubate the reaction for 30 minutes at 25°C in the dark. 
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Scarboxymethyl Protein Recovery (Desalting/Concentration) 


8. Inject 2-396 (~300 ng of protein) of the total mixture onto a reversed-phase microbore 
column (50 x 1-mm LD. HPLC system (see Protocols 1 and 2 in Chapter 5). 


Before committing the bulk of the 5-carboxymethyiated proteins to RP-HPLC, conduct a 
pilot experiment using 2-396 of the total sample ( —300 ng) to ascertain whether the mater- 
ial can be recovered from the reversed-phase column. This pilot desalting step is extremely 
important because some proteins display different chromatographic behaviors upon 5S-car- 
boxymethylation (e.g., they are more hydrophobic than the native protein). If the relative 
hydrophobicity of an S-carboxymethylated protein is significantly higher than that of the 
native protein, it may be difficult to recover from the reversed-phase column using a 
TFA/acetonitrile solvent system. In this case, try substituting the acetonitrile with n- 
propanol (see Chapter 5, Protocol 5). If this does not work, try desalting the S-car- 
boxymethylated protein using micro-size exclusion chromatography or fast micro-desait- 
ing. 


9. Once it is established that the S-carboxymethylated protein can be recovered from the 
reversed-phase column, inject the remainder of the material. 


10. Adjust the S-carboxymethyl protein-containing fraction (typically, in 300-500 ul of 
aqueous 0.1% TFA/acetonitrile) to 0.02% (w/v) with respect to Tween-20. For prote- 
olytic digestion of S-carboxymethylated proteins collected in this manner, proceed to 
Protocol 5. 


PROTOCOL 4 


Estimation of Free Thiols and Disulfide Bonds Using 
Ellman’s Reagent 


I. THIS PROTOCOL, A METHOD FOR QUANTITATING THE AMOUNT OF FREE THOIS and disulfide 
bond linkages in proteins is described. The method uses the sulfhydry! reagent 5,5’-dithiobis 
(2-nitrobenzoic acid), which is also referred to as the Filman reagent after its originator 
(Ellman 1959). For a review outlining the use of Ellman's reagent for determination of total 
protein sulfhydryl, and the amount of sulfhydryl groups in tissues, see Habeeb (1972). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 
Denaturing buffer 
6 M guanidine hydrochloride <!> 
0.1 M Na;HPO, (pH 8.0) <!> 
Dithiothreitol (DTT) (200 mm) in deionized H,O (Calbiochem or Roche) <!> 


Ellman's reagent (DTNB) 
Prepare 10 mm DTNB (4 mg/ml) in 0.1 M sodium phosphate buffer (pH 8.0). <!> 


Iodoacetic acid <!> 

Nitrogen «t» 

Reaction buffer (0.1 M sodium phosphate buffer, pH 8.0) 
4-Vinylpyridine <!> 


y» Equipment 
Dialysis equipment 
Eor further information concerning dialysis membranes and equipmeut, see hitp://www.spectra- 
por.com. 
Size-exclusion chromatography equipment 
Spectrophotometer and cuvettes 


P Biological Sample 


Protein sample mE 
The sample should contain an accurate amount of protein (for protein estimation protocals, see 
Appendix 2)—at least 2 nmoles in 100 ul of reaction buffer or denaturing buffer (e.g., buried thi- 


ols may need to be exposed by denaturing the protein). 
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METHODS 


Method 1: Free Thiols 


— 


. Add 3 ml of reaction buffer to the sample and reference cuvettes. 
. Measure the absorbance at 412 nm (the absorbance should be adjusted to zero, Aj). 
. Add 100 ul of reaction buffer to the reference cuvette. 


. Add 100 ul of Ellman’s reagent to the sample cuvette. Measure the absorbance at 412 nm 


(Aprns)- 


. Add 100 tl of protein solution to the reference cuvette. 


. Add 100 ul of protein solution to the sample cuvette. Mix thoroughly. Measure the 


absorbance when there is no further increase (after 3—5 minutes). This is the Ag, 


. Calculate the concentration of thiols as follows: A44, = E2 TNB^ [RSH], where AA,,, 


= Aga — (3.1/3.2) (Apu © Anio) and Egan TNB? = 1.415 x 10* cm? w^ (for denatur- 
ing buffer, E,,,, TNB* = 1.37 x 10 cnr! M7’), 


The Ernan INB^ values given are from Riddles et al. (1983). 


Method 2: Disulfide Thiols 


This method is essentially the same as that described above in Method 1, except that the pro- 
tein sample is first alkylated (without prior reduction) to derivatize any free thiols, but leav- 
ing the disulfide links intact. Alkylation can be performed using iodoacetic acid or 4- 
vinylpyridine (see Protocol 2). 


1. 
2. 


Dissolve the sample in reaction buffer or denaturing buffer (at least 2 nmoles in 100 pl}. 


Add freshly prepared DTT (10-100 mM final concentration), and displace the oxygen 
with a stream of nitrogen. Incubate the reactions for 1-2 hours at 25°C. 


. Desalt the reduced sample by dialysis against the reaction buffer or by size-exclusion 


chromatography. 


, Measure the protein concentration (for basic protocols, see Appendix 2), and analyze the 


newly exposed thiols using the procedure described in Method 1 above. 


PROTOCOL 5 


Fragmentation of Protein Using Trypsin 


MATERIALS 


I. THIS PROTOCOL, TIE HIGHLY SPECIFIC PROTEASE TRYPSIN is used to hydrolyze a protein com- 
pletely. Proteolysis is carried out with high levels of trypsin to ensure total proteolysis. 
(Trypsin is a robust enzyme; hence, proteolysis can also be carried out under denaturing con- 
ditions such as 2 M guanidine-HCl, 0.1% SDS, and >10% acetonitrile to ensure complete 
digestion.) Trypsin cleaves the peptide bond between the carboxyl group of arginine or the 
COOH group of lysine and the amino group of the adjacent amino acid. The rate of cleav- 
age occurs more slowly when the lysine and arginine residues are adjacent to acidic amino 
acids in the sequence or cystine. Cleavage does not occur when lysine or arginine is followed 
by proline. 


P» Reagents 


gl 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <:>. 


CaCl, <!> 

NH,HCO, (Analar grade} <!> (1% w/v) in deionized H,O «1» 
The pH of this solution is 7.8 and does not require further adjustment. The solution may be stored 
refrigerated or frozen for long periods of time. 

Phenylmethylsulfonyl fluoride (PMSF) «1» 
Prepare this reagent just prior to use by dissolving it in 1-propanol <!> to a final concentration of 
1 M PMSE Dilute PMSF 1000-fold to 1 mM in the reaction mixture, PMSF is an effective inhibitor 
of serine proteinases due to its high reactivity with serine residues at the active site of this class of 
enzymes. 

Trypsin stock solution (1 mg/ml trypsin in 1 mM HCl, 20 mm CaCl;) 
Trypsin is available from various commercial suppliers. It is stable for years as a dry solid at -209C. 
Most grades of commercially available trypsin (FC 3.4.21 4) are from bovine pancreas. [t is recom- 
mended that TPCK-treated trypsin be used for peptide-mapping studies because this treatment 
specifically inhibits chymotrypsin, a common contaminant of trypsin preparations. Trypsin activ- 
ity can vary from batch to batch. For reproducible peptide maps, the same batch of trypsin should 
be used throughout the studies. 
Use fresh or divide the solution into small aliquots (~50 pl) and store frozen at-20°C. A stock solu- 
tion may be thawed and refrozen à few times, but for consistent results, thaw only once. 


Tween-20 
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P» Equipment 
Polypropylene tube (tightly capped) 
Water bath or Incubator capable of maintaining temperature to £1°C 


P Biological Sample 
Lyophilized protein of interest 


METHOD 


1. Dissolve the lyophilized protein substrate in 1% ammonium bicarbonate using the min- 
imal volume necessary to achieve a high substrate concentration. 


When working with micro amounts of protein substrate (i.e., «1 mg), add Tween-20 to the 
ammonium bicarbonate to a final concentration of 0.1% (w/v), otherwise serious losses will 
occur due to the substrate (and enzyme) sticking irreversibly to the walls of the tube. 
Ammonium bicarbonate is a simple volatile buffer that can be readily removed by 
lyophilization. Nonvolatile buffers at pH 8 such as 0.05 m Tris-HCl (pH 8.0-8.3) may be 
readily substituted for ammonium bicarbonate. 


2. Add trypsin to the substrate at a substrate:enzyme ratio of 200:1 to 50:1 for denatured 
(e.g., reduced and alkylated proteins), or as high as 1:1 for native protein substrates, For 
in-gel proteolysis, use 0.5-1.0 jig, regardless of the amount of substrate (typically, «0.5 
ug for Coomassie-Blue-stained proteins). 


3. Set up a parallel reaction mix lacking only the protein sample (and if there is sufficient 
material, a second reaction lacking only trypsin) as a control for RP-HPLC peptide map- 
ping. 


4. Incubate the reactions for 16 hours (overnight is convenient) at 37°C. 


5. Stop the reaction by placing the sample onto an RP-HPLC column to begin peptide map- 
ping, or chill the incubation mixture on ice to slow the reaction. 


Alternatively, stop the reaction by adding a specific inhibitor (e.g, Na-tosyl-t.-lvsyl 
chloromethyl ketone, or N-tosyl-1-phenylalanine chloromethyl ketone [TPCK], or PMSF) 
in molar excess to the trypsin used. Because trypsin is inactive at low pH values, the reac- 
tion can also be stopped by adjusting the pH of the reaction mixture to pH 2-3 (e.g., by the 
addition of volatile acids such as acetic acid or trifluoroacetic acid). 
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TROUBLESHOOTING TRYPSIN DIGESTS 


* Autodigestion of trypsin during incubation at neutral pH produces y-trypsin, among 


other products. The intrinsic chymotrypsin-like activity of w-trypsin (Keil-Dlouhá et al. 
1971) explains the occasional “nontryptic”-type cleavage (1.e., at some tyrosyl, pheny- 
lalanyl, or tryptophanyl bonds) sometimes observed with tryptic digests. Thus, the rare 
nonspecific cleavages that can occur with trypsin, adjacent to aromatic or hydrophobic 
residues, are most likely due to the presence of small amounts of y-trypsin, and not to 
chymotrypsin contamination. The addition of low levels of Ça” (0.1 mM CaCL) to the 
incubation buffer is recommended to reduce trypsin autodigestion (and y-trypsin pro 
duction). 


For optimal digestion, it is important that the protein substrate be as soluble as possible, 
or as evenly suspended as possible, in the digestion buffer. This can be accomplished by 
adjusting the protein substrate solution to 8-M urea and then diluting to 2 M urea for 


digestion with urea. Trypsin is a robust proteinase, being active in the presence of 2 M 


guariidine-HCl, 0.1% (w/v) SDS or in the organic solvent, acetonitrile (up to ~40% v/v) 
(see Table, 7.6 and Welinder 1988). In the case of acetonitrile, low concentrations (e.g., 
«1096 v/v) may facilitate more rapid proteolysis compared to that of buffer alone, 
whereas high concentrations (e.g., 5096) usually slow the reaction. 


The activity of trypsin in acetonitrile is an important attribute, because fractions from 
RP-HPLC. (aqueous acetonitrile/0,1% v/v TEA system) may be readily digested after 
adding Tween-20 (to a final concentration of 0.02% w/v), reducing the volume to ~50 
ul in a cefitrifugal vacuum concentrator (to lower the acetonitrile concentration), and 
simply diluting to 1 ml with 0.196 ammonium bicarbonate containing 1 mM CaCl, and 
0.0296 Tween-20 (Simpson et al. 1988, 1989; see also Protocols 2 and 3). 


Native protein substrates may be tightly folded, and thus they may need to be unfolded 
to facilitate proteolytic attack. This can be accomplished by reduction/alkylation, which 
unravels the protein, making it readily accessible to trypsin. Alternatively, denaturation 
of the protein substrate can he accomplished by boiling or with the use of urea, guani- 
ditie-HCl, SDS, or an organic solvent (see text above about optimal digestion). 
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PROTOCOL 6 


Cleavage at Met-X Bonds by Cyanogen Bromide 


MATERIALS 


» Reagents 


P Equipment 


Susie CLEAVAGE OF A PROTEIN BY CYANOGEN BROMIDE, first introduced by Gross and 
Witkop in 1961, generates a distinctive set of peptide fragments. Cyanogen bromide cuts pep- 
tide bonds on the carboxy-terminal side of methionine residues (see Figure 7.5 for mecha- 
nism of cyanogen bromide cleavage), and because this amino acid is relatively infrequent in 
proteins, this cleavage tends to produce relatively large and relatively few peptides. Met-X 
amino acid bonds are cleaved specifically and almost quantitatively, with the exception of 
Met-Ser and Met-Thr bonds where the hydroxyl group of neighboring side chain may inter- 
fere with ring opening of the iminolactone. (The use of 70% trifluoroacetic acid instead of 
formic acid has been reported to improve cleavage yields [Titani et al. 1972].) 

Typical procedures use ~100-fold molar excess of cyanogen bromide over the methion- 
ine, and the reaction is performed under mild acidic conditions (e.g., 7096 formic acid), 
under nitrogen, and in the dark for 4-24 hours. All reagents are volatile and can be readily 
removed by lyophilization. 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Cyanogen bromide <!> (Aldrich) 
Stare in a dry refrigerated container (e.g., place reagent bottle in a wide-necked plastic bottle con- 
taining silica gel) in the dark. Use only white crystals. 
WARNING: Cyanogen bromide is very toxic, Wear protective gloves and safety glasses and work in 
a chemical fume hood. 

Formic acid, minimum assay 98% (Aristar grade)<!> 

D-Mercaptoethanol «t» 
Store refrigerated. 

NH,HCO, (0.2 M) <!> 

Sodium hypochlorite solution (domestic bleach) <!> 


Chemical fume hood 
Concentrator (Speed Vac) 
Microfuge tubes (with caps) 
Nitrogen (pressurized) <!> 
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» Biological Sample 
Lyophilized protein 


METHOD 


Reduction 


1. Dissolve the protein in 0.2 M ammonium bicarbonate to a final concentration of 10-20 


mg/ml. 
2. Add -mercaptoethanol to between 1% and 596 (v/v). 


3. Blow nitrogen over the solution to displace oxygen, cap the tube, and incubate overnight 
at room temperature. 


4. Dry the sample using a SpeedVac concentrator. 


Warming the sample will help evaporate all of the ammonium bicarbonate. 


Cyanogen Bromide Cleavage 


5. Redissolve the dried sample (10-20 mg/ml} in 70% formic acid. 


6. Add solid cyanogen bromide (2 mg of reagent per milligram of protein) directly to the 
protein solution. For very small amounts of protein, add one small crystal of cyanogen 
bromide. 

Cyanogen bromide is usually present at 20-100-fold excess with respect to methionyl 
residues, which is equivalent to approximately equal amounts of protein and cyanogen bro- 
mide. 


7. Cap the tube and incubate the reaction for 18-20 hours at room temperature in the 
dark. 


8. Terminate the reaction by diluting the reaction mixture with ~5 volumes of H,O and 
removing the excess reagents by lyophilization with a SpeedVac concentrator. 


The products can then be separated by liquid chromatography or gel electrophoresis (or 
stored at -20*C). 


9. To remove all traces of cyanogen bromide, decontaminate equipment (e.g., spatulas 
tubes} by immersion in sodium hypochlorite (bleach) solution for a few minutes until 
effervescence stops. 
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TROUBLESHOOTING 


`è Cyanogen bromide: Some side reactions may occur if impure cyanogen bromide (colored 
yellow or orange) is used. This can be avoided by using the white crystals that have seb- 
limed on the inner surface of the reagent bottle. Excess cyanogen bromide (e.g., 1000-fold 
or higher) will oxidize methionine to methionine sulfoxide, especially in strong acid such 
as heptafluorobutyric acid (HFBA). This phenomenon has been used to effect trypto- 
phan cleavage (albeit with poor yields), but not Met-X bonds. Under these conditions, 
tyrosyl residues may become brominated. 


« Formic acid: Alternative acids to formic acid include 0.1 M HCl, 75-85% TFA, or HFBA. 
* The stronger acids may cause Asp-Pro cleavage and increased acid hydrolysis. 


'*- Mer-X cleavage and sequencing: lt is possible to cleave Met-X bonds in samples that 
have been applied to the glass fiber disks of the gas phase/pulsed liquid sequencers 

` * (Simpson and Nice 1984). This cleavage has proven to be particularly useful for obtain- 

iy; ing sequenceinformation from amino-terminally blocked proteins. ft is also possible to 

t^ chemically fragment proteins in situ in acrylamide gels (Jahnen et al. 1990) for obtain- 
ing sequence information. 


* Lactone derivative coupling and sequencing: Because the lactone derivative can be cou- 
pled selectively, and in good yield, to solid supports of the amine type (e.g., 3-amino 
propyl glass) (Horn and Laursen 1973), this is a useful method for the isolation of 
methionine-containing peptides for sequence analysis. 


PROTOCOL 7 


Cleavage of Asn-Gly Bonds by Hydroxylamine 


Suscnve CLEAVAGE OF ASN-GLY BONDS IN PROTEINS by hydroxylamine generates a distinctive 
set of peptide fragments (for the mechanism of hydroxylamine cleavage of Asn-Gly bonds, 
see Figure 7.6), Because the Asn-Gly sequence occurs statistically about every 350 amino 
acids in a protein, cleavage of this bond will yield relatively large and relatively few peptide 
fragments (Kellner 1994). In this protocol, a procedure for performing Asn-Gly bonds cleav- 
age in solution is described. The hydroxylamine cleavage method can be adapted to fragment 
proteins within acrylamide gels (Saris et al. 1983). A cleavage buffer comprising 2 M hydrox- 
ylamine and 6 M guanidine-HCl in 15 mM Tris titrated to pH 9.3 with 4.5 M lithium hydrox- 
ide will yield ~25% cleavage. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


P Reagents 
Cleavage buffer 
2 M hydroxylamine HC] «t» 
4.3 m guanidine-HCl <!> 
0.2 M potassium carbonate (pH 9.0)<'> 


Dissolve 3.5 g of hydroxylamine (Analar grade, BDH) in 12 ml of prechilied 6 M aqueous guani- 
dine-HCl (use an ice bath), and then add 2 ml of 50% (w/v) NaOH «t» slowly with vigorous stir- 
ring (magnetic stir bar) followed by 5 ml of 1 M potassium carbonate. Adjust the pH to 9.0 with 
50% NaOH, and then adjust the volume to 25 mi with 6 M guanidine-HCl. 


The inclusion of guanidine-HCl is reported to improve the efficiency of Asn-Gly cleavage (Kwong 
and Harris 1994). 
Formic acid (Puriss grade, Merck) <!> 


Trifluoroacetic acid {TFA} <!> 
Optional, see Step 5. 


b Equipment 
Column (0.8 cm 1.D. x 30 cm; Sephadex G-25 medium) (Amersham Biosciences) 
Alternatively, a disposable desalting column from Bio- Rad can be used. 


Liquid chromatography equipment (RP-HPLC or size-exclusion) 
Magnetic stir plate 

Polypropylene tubes (tightly capped) 

Water bath or incubator preset to 45°C 
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P Biological Sample 
Lyophilized protein 


METHOD 


1. Dissolve the protein directly in the cleavage buffer to give a final concentration of ~5 
mg/ml. 


2. Incubate the mixture for 4 hours at 45°C. 


3. Stop the reaction by cooling the sample and adjusting the pH to 2.5 by the addition of 
concentrated formic acid (or 3 volumes of 2% v/v TFA in H,O). 


4. Desalt the sample by chromatography on a column. Analyze the peptide mixture imme- 
diately by liquid chromatography, or, alternatively, store the samples at —209C for later 
analysis. 


5. If analyzing by gel electrophoresis, the reaction itself may be stopped (Step 3), not by 
acidification, but by the addition of SDS-PAGE sample buffer. 


PROTOCOL 8 


Cleavage at Tryptophan by o-iodosobenzoic Acid 


Curemica: REAGENTS YHAL SELECTIVELY CLEAVE AT TRYPTOPHAN RESIDUES have contributed 
greatly to the elucidation of protein structure and function (Fontana and Gross 1986). 
Selective cleavage of a protein at tryptophan, a relatively rare amino acid, generates a dis- 
tinctive set of large peptide fragments. The use of 0-iodosobenzote acid for specific cleavage 
at tryptophan residues was first proposed by Mahoney and Hermodson in 1979. This proto- 
col has been adapted from the method of Mahoney et al. (1981). The yield of cleavage is 
moderate to high (up to 8096) and, besides some cleavage at tyrosine residues, there are very 
few side reactions (Fontana et al. 1983). For a detailed review of other, albeit less specific, pro- 
cedures for cleaving proteins at tryptophan, see Fontana and Gross (1986). 


MATERIALS 

CAUTION: See Appendix 3 for appropriate handling of materials marked with «^. 
b Reagents 

p-Cresol <!> 

Glacial acetic acid (Puriss grade, Merck)«!» 

Guanidine-HCl <!> 

o-lodosobenzoic acid <!> 
b Equipment 


Column, size-exclusion (Sephadex G-25, Amersham Biosciences or equivalent) 
Concentrator (SpeedVac, Thermo Savant) 
Nitrogen supply <!> 


P Biological Sample 


Lyophilized protein 
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METHOD 


— 


. Dissolve the o-iodosobenzoic acid (10 mg) in 1.0 ml of 8096 (v/v) acetic acid containing 


4 M guanidine-HCl and 20 ul of p-cresol. 


. Incubate the mixture for 2 hours at room temperature. 


. Add the protein to a final concentration 5-10 mg/ml. Flush the tube with a stream of 


nitrogen and incubate for 24 hours at room temperature in the dark. 


. Terminate the reaction by adding H,O (~10 volumes) and dry using a SpeedVac concen- 


trator (or equivalent). Alternatively, obtain the peptides by applying the digest directly to 
a size-exclusion column. 


PROTOCOL 9 


Staining Proteins in Gels with Coomassie Blue 


Tue MOST COMMONLY USED DYE FOR VISUALIZING PROTEINS in SDS-PAGE is Coomassie 
Brilliant Blue R250 (CBR-250) because of its relatively high sensitivity (Meyer and Lambert 
1965; Syrovy and Hodny 1991). This protocol describes the standard CBR-250 staining 
method, along with a simple method for preparing stained gels for long-term storage. The 
limit of detection of standard CBR-250 is generally quoted as 50-100 ng, but if sufficient 
destaining can be achieved, then as little as 10 ng of protein can be detected. By increasing the 
temperature of both the staining and destaining steps, the length of the procedure can be 
shortened to ~25 minutes with a concomitant increase in sensitivity to 2.5-5 ng of protein 
(for other Coomassie Blue staining procedures, see Chapter 2, Protocols 2, 3, and 4, and 
Chapter 4, Protocol 10). This protocol was adapted from Simpson and Reid (1998). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «zv, 


» Reagents 


Coomassie Brilliant Blue staining solution <!> 
Mix 1 g of Coomassie Brilliant Blue (Bio-Rad) in 1 liter of 50% (v/v! methanol xU», [098 (v/v) 
glacial acetic acid <t>, and 40% H,O. Stir the solution for 3-4 hours and then filter through 
Whatman filter paper. 


Destaining solution 
12% (v/v) methanol 
7% (viv) glacial acetic acid 
81% HO 


b Equipment 
Cellophane 
Gel dryer frame, plastic (Amersham Biosciences) 
Mechanical shaker 
Plastic containers 
Plastic wrap 
Tissue paper, fine grade (Kimwipes) 


b Biological Sample | 
Proteins within a gel that have been separated by electrophoresis 
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METHOD 


Staining of Gels 


Perform the staining steps at room temperature unless otherwise indicated. 


1. Place the gel containing the proteins of interest in a plastic container that has sufficient 
Coomassie Brilliant Blue staining solution to cover the gel. Place the container on a 
mechanical shaker and allow the gel to stain for 20 minutes at room temperature. 


2. Remove the staining solution and add destaining solution and three sheets of fine-grade 


tissue paper. Destain with shaking. Replenish the destain solution several times until the 
gel is fully destained. 


The tissue paper binds released stain. 


Storage of Coomassie-Blue-stained Gels 


3. If the gei will be processed in less than 1 month, store it in 200-250 ml of destaining solu- 
tion at room temperature in a sealed plastic container. 


4. For long-term storage, place the gel between two sheets of cellophane in a plastic gel 
frame. Air dry the gel in a chemical fume hood for ~16 hours. 


5. Wrap the dried gel in plastic wrap and store it at room temperature. 


PROTOCOL 10 


Zinc/Imidazole Procedure for Visualization of Proteins 
in Gels by Negative Staining 


MATERIALS 


Ta ZINC/IMIDAZOLE STAINING PROCEDURE FOR VISUALIZING PROTEINS in acrylamide gels is 
based on differential salt binding. In this protocol, the negative stain uses the heavy divalent 
cation zinc for making a precipitate with dodecyl sulfate. Because protein-bound salts (e.g., 
dodecyl sulfate or the heavy cation zinc) are chemically less active than the free zinc ions in 
the gel, precipitation of an insoluble salt is much slower in those regions of the gel occupied 
by proteins than in the gel background where zinc dodecyl sulfate precipitates. The result is 
a "negative stain,” with translucent proteins and an opaque gel background, due to zinc dode- 
cyl sulfate precipitation. The sensitivity of the method was markedly improved by altering the 
composition of the precipitated salt, from zinc dodecy! sulfate to a complex of zinc and imi- 
dazole (Ortiz et al. 1992; Fernandez-Patron et al. 1998). Typical protein detection sensitivi- 
ties with the zinc/imidazole staining procedure approach the low-nanogram range. The stain 
is reversible when divalent chelators, such as EDTA, are included. The differential salt-bind- 
ing method for protein visualization has also been adapted for proteins blotted onto mem- 
branes (Patton ei al. 1994) and for general proteome analysis of gel-resolved proteins 
(Castellanos-Serra L et al. 1999). 

This protocol, which was adapted from a procedure described by Fernandez- Patron et al. 
(1995a,b) and from Simpson and Reid (1998), provides two methods: reverse stain using imi- 
dazole, SDS, and zinc, and a double-staining procedure using Coomassie Blue stain followed 
by zinc/imidazole. 


b Reagents 
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Fixing solution 
50% (v/v} methanol <!> 
5% (v/v) glacial acetic acid <!> 
45% H,O 
Imidazole (0.2 M) containing 0.1% SDS 
Prepare 6.8 g of imidazole «V», 0.5 g of SDS <!>, and 500 mi of H,O. 


Zinc sulfate (0.2 M) 
Dissolve 28.7 g of ZnSO,-7H,O in 500 ml of H,O. 
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» Equipment 
Shaker 


P Biological Sample 


Proteins within a gel that have been separated by electrophoresis 


These proteins should be either unstained (Method 1) or stained with Coomassie Brilliant Blue 
(Method 2; see Protocol 9 for staining procedure). 


METHODS 


Method 1: Direct Reverse Staining with Imidazole, SDS, and Zinc 


— 


. Submerge the gel in fixing solution for 20 minutes with gentle shaking. 


2. Discard the fixing solution. Wash the gel twice with deionized H,O with gentle shaking 
for 15 minutes. 


3. Incubate the gel in 0.2 M imidazole containing 0.1% SDS for 15 minutes with gentle shak- 
ing. 


4. Remove the imidazole-SDS solution. Add 0.2 M zinc sulfate to the gel and agitate it for 
30—60 seconds. 


5. When the gel has stained satisfactorily, remove the zinc sulfate solution and submerge the 
gel in deionized H,O. 


Method 2: Double Staining of Coomassie-Blue-stained Polyacrylamide Gels with 
Imidazole, SDS, and Zinc 


1. After Coomassie Blue staining (see Protocol 9), wash the destained gel twice in deionized 
H,O with gentle shaking for 15 minutes. 


2. Incubate the gel in 0.2 M imidazole, 0.1% SDS solution for 15 minutes with gentle shak- 
ing. 


3, Remove the imidazole-SDS solution. Add 0.2 M zinc sulfate to the gel, and agitate it for 
30-60 seconds. 


A. When the gel has stained satisfactorily, remove the zinc sulfate solution and submerge the 
gel in detonized H,O. 


PROTOCOL l1 


Staining Proteins in Gels with Silver Nitrate 


MATERIALS 


Siver STAINING I$ ONE QE THE COMMONLY USED PROCEDURES for visualizing proteins in acry- 
larnide gels (for two excellent reviews, see Merril and Washart 1998; Rabilloud 2000). All sil- 
ver staining methods rely on the reduction of ionic to metallic silver to provide metallic sil- 
ver images, the selective reduction at gel sites occupied by proteins compared to nonprotein 
sites being dependent on differences in the oxidation-reduction potentials at these sites 
(Merril 1987). There are two broad methodologies for silver staining. One approach (nondi- 
amine silver nitrate stains) employs silver nitrate as the silvering agent and formaldehyde in 
alkaline carbonate solution as the developing agent, whereas the other approach (diamine or 
ammoniacal staias) uses ammoniacal silver as the silvering agent and formaldehyde in dilute 
citric acid as the developing agent. Although protocols using ammoniacal silver are arguably 
more sensitive and give darker hues than those based on silver nitrate, they are more prone 
to negative staining, resulting in hollow or “doughnut” spots, give unacceptable backgrounds 
with tricine-based gel systems, and are not very robust due to their reliance on the ammonia- 
silver ratio (Rabilloud 2000). Additionally, ammoniacal silver staining is more sensitive for 
basic proteins, but less so for very acidic proteins. This protocol describes an adaptation of 
the silver nitrate procedure developed by Rabilloud (1990, 2001), Shevchenko et al. (1996). 
and Simpson and Reid (1998). The sensitivity of the method is in the low-nanogram range, 
which is 50—100 times more sensitive than classical Coomassie Blue staining (see Protocol 9), 
~10 times better than colloidal Coomassie Blue staining (Neuhoff et al. 1988; Chapter 4, 
Protocol 10), and at least twice as sensitive as the negative staining zinc/ imidazole method 
(see Protocol 10). 


p Reagents 
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Acetic acid (1%) in H,O <!> 
Developing solution 
2% (wiv) sodium carbonate <!> 
0.04% (v/v) formaldehyde <!> in H,O 
Fixing solution (50% v/v methanol <!> and 5% v/v glacial acetic acid <!> in H,O) 
Methanol (50%) 
Silver nitrate (0.1% w/v) (chilled on ice) in H,O <!> 
Sodium thiosulfate (0.02% w/v) in H,O 
Stopping solution (5% v/v glacial acetic acid in H;O) 
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b Equipment 


Plastic containers 
Shaker 


» Biological Sample 


Proteins within a gel that have been separated by electrophoresis 


METHOD 


. Place the gel in a plastic container (use a disposable plastic container or a thoroughly 


washed container) and add sufficient fixing solution to cover the gel, Fix the gel for 20 
minutes with gentle shaking. 


. Remove the fixing solution and add enough 50% methanol to cover the gel. Shake the gel 


gently for 10 minutes. 


. Remove the 50% methanol and add deionized H,O. Shake the gel gently for a further 10 


minutes. 


. Remove the deionized H,O and soak the gel in 0.02% sodium thiosulfate for 1 minute. 
. Rinse the gel twice with deionized H,O for 1 minute. 


. Submerge the gel in chilled 0.1% silver nitrate solution and incubate it for 20 minutes at 


4°C. 


. Rinse the gel twice with deionized H,O for 1 minute. 


. Submerge the gel in developing solution and shake the container vigorously until the 


desired intensity of staining is reached (see Step 9). Discard the developing solution if it ` 
turns a yellow color and replenish it with fresh developing solution. 


EXPERIMENTAL TIP: It is critical that the development solution remains transparent dur- 
ing the development step. 


. When the desired intensity of staining is reached, stop the development by replacing the 


developing solution with stopping solution. 


. Store the silver-stained gel in 196 acetic acid until further use. 


prorocot 12 


In-gel Spyridylethylation of Gel-resolved Proteins 


MATERIALS 


I. PROTEOLYTIC. DIGESTION OF ONE-DIMENSIONAL or two-dimensional gel-separated pro- 
teins (sec Protocol 13) is a commonly used method for generating peptide fragments for iden- 
tifying and characterizing proteins by classical Edman chemistry (see Chapter 6) or mass- 
spectrometry-based methods (see Chapter 8). Although most gel methods for separating pro- 
teins (e.g, SDS-PAGE and conventional two-dimensional gel electrophoresis) are conducted 
under reducing conditions, there is a strong propensity for disulfide bond formation to occur 
during protein digestion and/or peptide isolation. This can result in cysteine-containing pep- 
tides being cross-linked by disulfide bridges and increased complexity of peptide maps. In this 
protocol, adapted from Simpson and Reid (1998), a method for performing in-gel reduction 
and S-alkylation of Coomassie-Blue-stained proteins is described, The method is applicable 
for either the whole gel or individually excised, stained protein spots/bands. Reduction is per- 
formed with dithiothreitol, and alkylation with 4-vinylpyridine. (Treatment of free cysteines 
with 4-vinylpyridine yields the S-B-(4-pyridylethyl) cysteinyl derivative.) S-B-(4-pyridylethyl) 
cysteine-containing peptides can be readily identified during RP-HPLC by their characteristic 
absorbance at 254 nm and during electrospray ionization tandem mass spectrometry by the 
appearance of a characteristic pyridylethyl fragment ion of 106 daitons ( Moritz et al. 1996). 
The position of cysteine residues in a polypeptide sequence can be determined either by mass 
spectrometry (see Chapter 8) or as phenylthiohydantoin S-B-(4-pyridylethyl) cysteine during 
Edman degradation (see Chapter 6). 


p Reagents 


» Equipment 
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2-Mercaptoethanol <!> 
Reduction buffer 

0.2 M Trizma (base) (pH 8.4) 

10 mM dithiothreitol (DTT) <!> 

2 mM EDTA 

Add DTT just before using this buffer. 
4-Vinylpyridine (Aldrich) «t» 


Centrifugal lyophilizer (c.g., SpeedVac, Thermo Savant) (optional) 
H,O (deionized) (Milli-Q) 

Incubator or oven preset to 40°C 

Lyophilizer (Thermo Savant) 

Microfuge tube (1.5-ml) 

Pipette (micropipettor or Pasteur) 

Plastic container 
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b Biological Sample 
Gel containing the proteins of interest separated by electrophoresis 


METHODS 


Method 1: Whole Gel Reduction and $-pyridylethylation 


1. Ensure that the gel has been appropriately stained and destained (see Protocol 9). 


2. Wash the intact gel extensively in three changes of 400-500 ml of H,O for ~1.5 hours. 


This step removes any acetic acid resulting from the destain protocol that would otherwise 
adversely affect the pH of the reduction. 


3. Transfer the intact gel to a clean container. 


4. Immerse the gel in 50 ml of reduction buffer. (The volume depends on the size of both 
the gel and the container.) Incubate the gel for 2 hours at 40°C, 


5. Add 4-vinylpyridine to the reduction buffer to a final concentration of 2% (v/v). Incubate 
the gel in the dark for 1 hour at room temperature. 


6. Add excess 2-mercaptoethanol (296 v/v final concentration) to stop the alkylation. 


7. Wash the gel extensively in three changes of 400—500 ml of H,O for - 1.5 hours. 


This step is included to remove B-mercaptoethanol, which would otherwise interfere with 
proteolysis. 


8. If the protein spots are no longer visible, repeat the staining and destaining procedures 
(see Protocol 9). 


Method 2: Individual Gel Spot Reduction and S-pyridylethylation 


1. Inspect the gel to make sure it has been appropriately stained and destained (see Protocol 9). 


2. Excise the spot of interest. Place the excised gel piece in a clean 1.5-ml polypropylene 
microfuge tube. 


3. Wash the excised gel piece extensively using three changes of 1 ml each of H,O for ~45 
minutes, Alternatively, completely dehydrate the gel by centrifugal lyophilization. 


4. Remove the deionized H,O from the microfuge tube. 


5. Add enough reduction buffer to cover the gel piece(s) completely (typically 100—200 yl). 


Dehydrated gel pieces will swell as they absorb the reduction buffer. Make sure that there is 
enough buffer to accommodate this. 


6. Incubate the gel for 1 hour at 40°C. 


7. Add 4-vinylpyridine to the reduction buffer to a final concentration of 296 (v/v). Incubate 
the gel in the dark for 1 hour at room temperature. 


8. Add excess B-mercaptoethanol (296 v/v final concentration) to stop the alkylation. 
9. Carefully remove the reduction buffer using a micropipettor or a Pasteur pipette and discard. 


10. Wash the gel pieces extensively in three changes of | ml each of H,O for ~45 minutes. 


PROTOCOL 13 


In-gel Proteolytic Digestion and Extraction of Peptides 


MATERIALS 


Porrcavsanave GEL ELECTROPHORESIS !8 ONE OF THE MOST COMMONLY USER TOOLS for pro- 
teome analysis, especially when the method is combined with peptide mapping and internal 
sequence analysis procedures for identifying and characteriving proteins. Two main 
approaches for digesting gel-separated proteins to generate peptide maps are (1) protein 
blotting onto immobilizing membranes followed by on- membrane digestion (Simpson et al. 
1989; Gevaert and Vandekerckhove 2000; see Protocol 14) and (2) digesting proteins direct- 
ly in the gel matrix and extracting the peptides (Ward et al. 1990). It is now generally recog- 
nized that the overall recoveries of peptides from in-gel digestion methods are significantly 
greater than that from on-membrane digest strategies. In this protocol, adapted from 
Simpson and Reid (1998), a general method for in situ digestion of gel-separated proteins is 
provided. Because in-gel digestions might fail when the visualized protein is below — 10 pg of 
protein/cm®, it is customary to combine several weakly stained Coomassie Blue gel spots and 
reconcentrate them electrophoretically in a new gel, either a one-dimensional SDS-PAGE gel 
(see Chapter 2) or by using the Pasteur pipette gel procedure for concentrating proteins 
described in Chapter 6, Protocol 4. 


» Reagents 


8 


CAUTION: Sec Appendix 3 for appropriate handling of materials marked with <t>. 


Buffers for in situ trypsin cleavage 
Digestion buffer (0.05 M NH,HCO, <!>) 


Trypsin (0.1 ug/ml) in digestion buffer l 
Prepare a stock solution of trypsin (~0.1 pg/ml) in 1 mm HCI <!> and store at -209C. Prior to 
use, dilute the trypsin stock solution 1:1 with digestion buffer ta ensure correct concentration 
and that the pH of ihe enzyme solution is 7.8. 


Wash buffer 
0.1 M NH,HCO, 
50% acetonitrile <!> 
Buffers for in situ Achromobacter lyticus protease I (Lys-C cleavage) 
A. lyticus protease I (0.1 ug/ml) in digestion buffer 
Digestion buffer 
0.1 M Tris-HCl (pH 9.3) 
Wash buffer 
0.05 M Tris-HCl (pH 9.3) 
50% acetonitrile 
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Trifluoroacetic acid (TFA) <!> (0,196 v/v) aqueous containing 6096 acetonitrile 
Prepare this reagent by adding 600 mi of acetonitrile to a I-liter glass measuring cylinder and then 
adjusting the volume to 1 liter with H,O. Add 1.0 ml of TFA to achieve a final concentration of 0.1% 


(v/v). 
b Equipment 


Lyophilizer, centrifugal (e.g., SpeedVac, Thermo Savant) 
Microfuge tubes 


Water bath preset to 30°C 


b Biological Sample 


Proteins separated on a two-dimensional gel 


METHOD 


1. Excise the protein gel spots of interest and place in microfuge tubes. 


2. Remove excess Coomassie Blue stain by washing twice with 1 ml of either 0.1 M 
NH,HCO,, 50% acetonitrile (for trypsin) or 0.05 M Tris-HCl (pH 9.3), 5096 acetonitrile 
(for Lys-C] each for 30 minutes at 30°C. 


3. Dry each gel piece completely by centrifugal lyophilization. The gel piece should not stick 
to the walls of the microfuge tube when completely dry. 


4. Rehydrate the gel piece by adding 10 ul of digestion buffer, containing 0.5 Ug of the 
appropriate protease, directly onto the dried gel piece. 


For other enzymes, sce Table 7.4. 
. Wait ~10-20 minutes, until the solution has been absorbed by the gel piece. 
. If necessary, repeat Steps 4 and 5 to allow the gel piece to fully swell. 
. Add 200 pi of digestion buffer to fully immerse the gel piece. 


. Incubate for ~16 hours at 35°C. 


Oo wo N Oo C 


. Carefully remove the digestion buffer (now called the extract), and place it into a clean 
microfuge tube. 


‘The digestion buffer contains >80% of the extractable peptides. 
10. Add 200 pl of 0.1% TFA, 60% acetonitrile to the gel piece. 
11. Incubate the tube with the gel piece for 30 minutes at room temperature. 


12. Carefully remove the extract away from the gel piece and combine it with the extract from 
Step 9. 


13. Reduce the volume of the pooled extracts by centrifugal lyophilization, Do not dry the 
pooled extracts completely otherwise sample loss may result. 


The obiective of this step is to remove acetonitrile from the pooled extracts and reduce the 
volume for subsequent dilution and injection onto a capillary RP-HPLC system. 


PROTOCOL 14 


On-membrane Proteolytic Digestion of 
Electroblotted Proteins 


Toss ARE TWO MAIN APPROACHES FOR DIGESTING PROTEINS to generate peptide maps for gel- 
separated proteins: (1) protein blotting onto immobilizing membranes (see Chapter 6, 
Protocol 2 and Chapter 4, Protocols 15 and 16) followed by on-membrane digestion (Simp- 
son et al. 1989; Gevaert and Vandekerckhove 2000), and (2) digesting proteins directly in the 
gel matrix and extracting the peptides (Ward et al. 1990) (see Protocol 13). In this protocol, 
an adaptation of the Aebersold (1993) method for performing on-membrane protein diges- 
tion is provided. 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <l>, 
p» Reagents 
Acetic acid (1% aqueous) <!> 
Amido Black 10B dye (0.1%) in H,O/acetic acid/methanol <!> (45:10:45, v/v/v} 
Digestion buffer 
For trypsin (bovine or porcine), chymotrypsin, LysC, and AspN, use 100 mm NH,HCO, «T» (pH 
7.9), 1096 (viv) acetonitrile <!>, at 37°C overnight. Use an enzyme concentration of ~100 ng of 
enzyme/ul of digestion buffer, irrespective of the amount of substrate. 
NaOH (200 mM) <!> 
Ponceau S dye (0.196) in 1% aqueous acetic acid 
PVP-40 (0.5% w/v) in 100 mM acetic acid 
PVP-40 is used to prevent absorption of the protease to the nitrocellulose during digestion, This 
detergent can be replaced by 196 Zwittergent 3-16 (Calbiochem) (Lui et al. 1996), which is fully 
compatible with downstream processing methods such as RP-HPLC (Zwittergent 3-16 is UV. 
transparent) and MALDI-TOF mass spectrometry. 
WARNING: Zwittergent 3-16 is not compatible with nanoseparation techniques such as capillary 
RP-HPLC because the detergent competes with the peptides for the binding sites on the reversed- 
phase column. 
P Equipment 


Electroblotting apparatus 
Microfuge tubes (J.5, 0.5, 0.2 ml) 
Nitrocellulose or PVDF membrane 
RP-HPLC column 


» Biological Sample 
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Gel containing the proteins of interest separated by electrophoresis 
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METHOD 


Electroblotting and Staining the Proteins 


1. Electroblot the proteins from the gel onto a nitrocellulose or PVDF membrane. 


Typically, this takes 2 hours for 0.5-mm-thick gels. For proteins that are difficult to transfer, 
add up to 0.00596 SDS <!> to the transfer buffer. Nitrocellulose membranes are preferred 
to PVDF membranes due to the hydrophobic surface of PVDF, which limits the recovery of 
peptide fragments. and the higher yields of recovered peptides from nitrocellulose. 


it is not possible to perform on-membrane digestion (PVDF) after amino-terminal sequence 
analysis of the protein. Because nitrocellulose is not inert to Edman chemistry reagents and 
solvents, this membrane cannot be used for direct amino-terminal sequence analysis. Both 
Amido Black and Ponceau $ staining procedures do not interfere with proteolytic digestion, 
the extraction of peptides from the membrane, or subsequent RP-HPLC analysis of peptides. 


2 Stain the membrane with either Amido Black or Ponceau 5. 
For staining with Amido Black 


a. Immerse the nitrocellulose membrane or PVDF membrane in 0.1% Amido Black LOB 
for 1-3 minutes. 


b. Rapidly destain with several washes of H,O/acetic acid/methanol. 


c, Rinse the destained blots thoroughly with deionized H,O to remove any excess acetic 
acid. 


d. Cut out the stained protein bands (or for two-dimensional gel spots, up to 40 spots 
from identical gels may be required) and transfer these hands to 1.5-ml microfuge 
tubes for immediate processing (begins with Step 3) or for storage at —20°C. 


For staining with Ponceau 5 
a. Immerse the nitrocellulose membrane in 0.1% Ponceau S for 1 minute. 


b. Gently agitate the blot for 1-3 minutes in 196 aqueous acetic acid to remove excess 
stain. 


c. Cut out the protein bands of interest and transfer them to microfuge tubes. 


d. Destain the protein bands by washing the membrane pieces with 200 mm NaOH for 
1-2 minutes. 


e. Wash the membrane pieces with deionized H,O and process them immediately 
(begins with Step 3) or store them wet at -20C (avoid excessive drying). 


Digestion of the Membrane-bound Protein 


3. Add 1.2 ml of 0.5% (w/v) PVP-40 in 100 mM acetic acid to each tube. 
4. Incubate the tube for 30 minutes at 37°C. 


5. Centrifuge the tube at ~1000g for 5 minutes, 
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, Remove the supernatant solution and discard. 


. Add ^1 ml of H,O to the tube. 


It is essential to remove excess PVP-40 before RP-HPLC peptide mapping because of the 
strong UV absorbance of this detergent. Moreover, breakdown products of PV P-40 also pro- 
duce major contaminant peaks in electrospray ionization mass spectrometry (ESI-MS). 


. Vortex the tube for 5 seconds. 
. Repeat Steps 5 and 6. 
. Repeat Steps 7-9 five more times. 


. Cut the nitrocellulose strips into small pieces (~1 X 1 mm) and place them in a fresh tube 


(0.5- or a 0.2-ml tube). 


_ Add the minimal quantity of digestion buffer (10-20 pl) to submerge the nitrocellulose 


pieces. 


. After digestion, typically 16 hours or overnight at 37°C, load the total reaction mixture 


onto an appropriate RP-HPLC column for peptide fractionation (or store the peptide 
mixture at -20°C until use) (see Protocol 15 and Chapter 5). 


PROTOCOL 15 


Reversed-phase HPLC of Peptides: 
standard Conditions 


IR THIS PROTOCOL, ONLY NONSPECIALIZED COLUMNS, MOBILE PHASES, and instrumentation that 
are readily available and easily operated are described. For more details concerning chro- 
matographic theory, see Mant and Hodges (1991) and Snyder and Kirkland (1979), and for 
HPLC troubleshooting, see www.proteinsandproteomic.org. 


EVALUATING THE RP-HPLC System 


Before analyzing a mixture of peptides by RP-HPLC, it is extremely important to evaluate 
the system—-both HPLC hardware and column-—before it is applied to a biological prob- 
lem. This is especially true when a new column enters the laboratory, and applies to both 
beginners and more experienced chromatographers who wish to obtain reproducible pep- 
tide maps. 

When using à new column, make sure first to read the manufacturer's instructions and 
recommendations for proper conditioning, use, and storage of the column. Often, subtle 
operating conditions that are crucial for correct column usage are overlooked, resulting in 
irreversible column damage and the possible loss of an important sample. Once the oper- 
ating procedures and column limitations (e.g, operating pressure and stability with 
respect to pH) are understood, perform a test separation using a set of standards (e.g., 
peptide or protein mixture) that have been designed to evaluate column performance, For 
this exercise, it is essential to reproduce the standard set of operating conditions described 
in this protocol before proceeding with an analysis of the test sample. If an equivalent elu- 
tion profile cannot be obtained on the new column (see below) using these standards and 
standard operating conditions, consider the new column suspect and discuss the problem 
with the manufacturer. (Columns that do not perform with a recognized set of standards 
are more easily returned to a manufacturer than simply reporting to the manufacturer that 
the column “cannot separate" an ill-defined complex mixture of interest.) These standatd 
operating conditions should be used on a regular basis (and carefully logged) to monitor 
column performance and the lifetime of the column. 

Because stationary phases differ widely from one manufacturer to another (see Table 
5.1) in terms of their physical characteristics—the functional ligand used, ligand density, 
base support (matrix), particle size, and pore size, as well as column dimensions—stan- 
dard protein/peptide separations are a useful diagnostic of these differences (and. can be 
exploited in a purification strategy). For a review of current columns for RP-HPLC sepa- 
rations in highly aqueous mobile phases, see Majors and Przybyciel (2002). If samples are 
to be collected for further characterization, careful attention should be given to the "dead 
volume" of the plumbing between the flow cell and outlet (see below). 

For useful RP-HPLC Web Site tutorials, see Chapter 7, and for HPLC troubleshooting 
and accessories, see http;//www.ionsource.com. 
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MATERIALS 


Ties FOR PREPARING HPLC For PEPTIDE MAPPING 
* Read the manufacturer's instructions for correct column usage. 


* Ensure that the HPLC system is in good working condition by running a blank (i.e, no 
column installed or sample injected). 


+ Evaluate the chromatographic system by running a set of “known standards.” 


» Reagents 


» Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acetonitrile (HPLC grade) (J.T. Baker, Mallinckrodt, or equivalent) <!> 
Deionized H,O (HPLC grade, 18 mega Q) (Milli-Q, Millipore, or equivalent) 
Protein standards (in order of their elution from the suggested reversed-phase columns) 
Ribonuclease B (R 7884, Sigma) 
Hen egg lysozme (L 6876, Sigma) 
Bovine serum albumin (A 3675, Sigma) 
Carbonic anhydrase (C 3934, Sigma) 
Myoglobin (M 0630, Sigma) 
Ovalbumin (albumin chicken egg) (high-purity grade from Sigma, A 7641) 


Dissolve the protein standards (1 mg of each protein/ml) in H,O, and divide them into 100-1 
aliquots. These standards can be stored frozen for several weeks, but do not freeze-thaw them 
more than five times. Upon thawing aliquots to use as HPLC controls, dilute them 1:10 with 
Solvent A just prior to use (see Step 2). 


Trifluoroacetic acid (TFA, HPLC grade) (Pierce or equivalent) <!> 


HPLC chromatographic system 
The system should be equipped with programmed gradient elution, UV detection at 210-220 om, 
components for acquisition of chromatographic data, and integration of peak areas, 


The system must be capable of accurate and reproducible solvent delivery and gradient formation. 
For a schematic of a generic HPLC system, see Figure 7.10A (at the end of the protocol). 


Polypropylene tubes (1.5 ml) with tight-fitting caps 


Reversed-phase column (100 À or 300 À pore size) 
For a description, see Chapter 5. Use a 5-10-um particle size packed into a column of 100 X 4.6- 
mm 1.D. or cartridge of 100 x 2.1-mm I.D. 


Commonly used columns for samples of >50 jig used in the author's laboratory: 
Brownlee RP-300 (100 x 2.1 7 yum, 300 À) (C4) (from Alltech, http://www.alltechweb.com ) 
Vydac 218TP3215 (150 x 2.1 3 um, 300 A) (Cis) (Vydac) 
For samples > 100 pig, use the same support, but packed into 4,6-mm LD, columns. 


» Biotogical Sample 


Peptide mixture generated by digestion of protein (as in Protocol 5, 6, 7, 8, or 13) 
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b Solvents 


SOLVENT PREPARATION 


If beth Solyents A and B are 0.196 (v/v) TFA, then the absorbance at 214 nm of Solvent 
B Will. be slightly higher (20-40 milli-absorption units [mAUT) than that of Solvent A, 
düe,to the contributing absorbance of acetonitrile. This will result in a rising baseline 
during the development of a gradient from 0% to 60% acetonitrile. This does not pre- 
sefit a problem when working with 50~100-1g amounts of material (using a 4.6-mm LD. 
column). The rising baseline, however, can become a serious problem, especially when 
working at low-microgram levels using microbore columns («2.1 mm I.D.). The base- 
line will be off-scale, and late-eluting peaks cannot be detected without adjusting the 
baseline, which is extremely difficult to accomplish during the course of a chromato- 
graphic rui (10 ng of protein/peptide has an absorbance at 214 um of ~1 mAU). 

To overcome this potential problem, it is recommended that the amount of TFA in 
Solvent B be slightly less than that in Solvent A (e.g., 0.0996) (900 ul of TFA/I liter) com- 


` pared with 0.1% (1000 yl of TFA/1 liter). By these means, the amount of TFA can be 


A 


a 


adjusted carefully to-accomplish, “by trial-and-error,” a flat baseline. For high-sensitivity 
work (submicrogram levels of peptide/protein), where the expected peak heights are in 
10-20 mAU‘on a 2.1-mm LD, column (1 pg = 100 mAU and 100 ng = 1 mAU on a 2.1- 
mm LD. column), 0.085% or 0.086% (v/v) TEA in Solvent B may be needed. (The con- 
centration of TFA used in Solvent B will depend on the quality, ie., UV transmittance of 
the acetonitrile used.) | 


K solvents are not degassed, then bubble formation can occur during a chromatograph- 
ic xu as result of out-gassing. Such bubbles can lodge in the flow cell, resulting in an 
etratic detector signál (i.e., baseline). To minimize this risk, it is recommended that 
Solvent B be a "percentage buffer,” i.e. containing H,O (e.g., 60% TFA, 4096 H,O con- 
taining 0.196 TFA). Additionally, solvents should be degassed by purging with high-puri- 
ty helium of applying constant vacuum over a period of ~30 minutes. 

With alcohol-type solvents (e.g., methanol and ethanol), a slow buildup of esters occurs 


at room temperature, resulting in an increased baseline. For this reason, solvents must be 
sared every 1—2 days. 


Solvent A (aqueous 0.1% v/v trifluoroacetic acid [TFA]; 14 mm) 


Use a micropipettor to add 1.0 ml of undiluted TFA to 1 liter of H,O in a stoppered glass measur- 
ing cylinder. Mix thoroughly and then pour the diluted TFA into a clean liquid chromatograph/ 
HPLC reservoir bottle that has been washed extensively with Milli-Q 11,0. 

EXPERIMENTAL TIP: It is good practice to set aside a 1-liter glass cylinder to be used for this pur- 
pose only. 


Solvent B (60% v/v acetonitrile and 4096 H,O containing 0.1% v/v TFA) 


Add 600 ml of acetonitrile to a 1-liter glass-stoppered cylinder and adjust to 1 liter with H,O. Add 
i ml of undiluted TFA (final concentration 0.1% v/v TFA) and mix the solution thoroughly. 


EXPERIMENTAL TIP: Due to increasing accumulation of UV-absorbing contaminants, use ace- 
tonitrile-based solvents for no longer than 5 days, and alcohol/acid-based solvents for no more than 
24 hours. 
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METHOD 


1. Run a blank or “control chromatogram” (in which no sample is loaded in Step 1). 
Typically, this is the first chromatographic run of the day. 


The recommended standard chromatographic conditions for RP-HPLC system evaluation 
are às follows: 


Linear gradient from 096 to 10096 B, where Solvent A is aqueous 0.1% (v/v) TFA, and 
Solvent B is 0.09% (v/v) TFA in 60% aqueous acetonitrile. 


Gradient rate: 1% B/minute (i.e., 0—10096 solvent B in 60 minutes). 
Flow rate: | ml/min for a 4.6-mm ID. or 0.1 ml/min for a 2.1-mm 1.D. column 
Temperature: 45°C. 


Timetable for linear gradient chromatography 


Mobile-phase Flow rate 
Step number Time composition 4.6-mm L.D. col 2.1-mm LD. col 
l. Sample injection 0 min 100% Solvent A 1 ml/min 0.1 ml/min 
2. 60 min 10096 Solvent B 1 ml/min 0.1 ml/min 
3.Reequilibration a 10096 Solvent A 1 ml/min 0.1 ml/min 


a is the time taken to pass 20 column volumes of Solvent A through the column. As an 
approximate guide, column volumes can be calculated by dividing the internal volume of 
the column by 2 to take into account the volume of the chromatographic packing. The inter- 
nal volume of the column can be determined by measuring the column length and internal 
diameter and applying the equation V — x 7h, where V= column volume, r= internal radius 
of column, and h = length of the column. 

These conditions were used to produce the elution profile of the blank chromatogram shown 
in Figure 7.94. 

Before commencing the blank run, pump 20 column volumes of Solvent B through the col- 
umn and then follow this with an equal volume of Solvent A (ensure that the baseline is sta- 
bilized, i.e., remains flat). Adjust the flow rate to an operating flaw rate of 1 ml/min (or 0.1 
ml/min for 2.1-mm 1.D. column) and wait until the baseline stabilizes (~5 minutes), 


2. Run the protein standards (control run) as described in Step 1, but inject 100 pl of 0.1 
mg/ml protein stock standards for the 2.1-mm LD. column tie., 10 jig of each protein 
standard; 10 pg = —1000—2000 mAU at a flow rate of 0.1 ml/min). For a 4.6-mm LD. col- 
umn, load 100 pl of 1.0 mg/ml protein stock standards, i.e., 100 ug of each protein stan- 
dard (100 ug = 1000-2000 mAU at a flow rate of 1.0 ml/min). 


These conditions were used to produce the elution profile of the control protein standards 
chromatogram shown (only 2.1-mm LD. column profile shown) in Figure 7,9B. An alterna- 
tive to protein standards is a tryptic digest of cytochrome c (Figure 7.9C). 


3. Run the test peptide digest mixture, using the same conditions as in Step 2 above (see 
Figure 7.8 for peptide maps of IL-6). 


4. Collect the eluting peptide peaks in polypropylene tubes with tight-fitting caps and store 
them at —209C for further analysis. 


If sample collection is to be performed man ually, the dead volume of the plumbing from the 
detector flow cell to the outlet tubing must be determined accurately, 
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FIGURE 7.9. Standard conditions for RP-HPLC of peptides. (A) Blank run; (B) 10 jig each protein stan- 
dard (in order of elution): ribonuclease B, cytochrome c, lysozyme, bovine serum albumin, carbonic anhy- 
drase, myoglobin, and ovalbumin; (C) tryptic digest of cytochrome c (10 ug) in aqueous 0.1% TFA. 
Chromatographic conditions: linear 60-minute gradient of 010096 Solvent B, where Solvent A is aqueous 
0.1% TFA, and Solvent B is aqueous 0.09% TFA containing 6096 acetonitrile. Flow rate was 0,1 ml/min and 
column temperature was 45°C. Chromatographic support: Brownlee RP-300 2.1-mm I.D. x 100-mm car- 
tridge. HPLC system: Agilent 1100 HPLC equipped with diode-array detector. (Figure provided courtesy of 
Robert L. Moritz.) 


DETERMINING DEAD VOLUME OF THE PLUMBING FROM THE DETECTOR FLOW CELL 
to THE TUBING OUTLET 


A good approximation of the dead volume (d pl) can be made using the following equa- 
tion: d = werk, where h is the length of the tubing from the flow cell to the outlet (in mil- 
limeters), and r is the radius (in millimeters) of the tubing. 

Thé preferred method for determining the dead volume involves the injection of a vis- 
ible dye (aqueous 0.0196 w/v Coomassie Blue) and accurately determining the time (using 
a stópwatch) taken from the first appearance of dye in the flow cell (i.e., off-scale detector 
signal) and the appearance of blue dye from the outlet tubing (easily visualized by spotting 
the ehient onto white blotting paper), When performing this task, the column should be 
disassembled from the HPLC and replaced by a comparable length of small-bore tubing 
that is then connected to the flow cell. 


COLUMN STORAGE 


For long-tertti ‘storage, columns should be flushed and stored with an aqueous organic sol- 
vent (e.g. 30% v/v méthanol/7096 H,O). For overnight storage, pump the columns with 
Solvent B at a very slow flow rate (e.g., 50 (l/min). 
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Cautionary Nos! 


« Routine for new column usage: Flush the HPLC system completely before attaching the col- 
umn. Prime both solvent lines with Solvents A and B, respectively, to ensure that there are no 
bubbles in the system. Connect the solvent delivery tubing from the injector to the top end of 
the column and the tubing from the column outlet to the flow cell. Pass 30 column volumes 
of Solvent B through the column and then follow this with 50 volumes of Solvent A. The col- 
umn is now ready to be evaluated using the procedure described in Steps 1-4. 


+ Connection of columns to HPLC plumbing: Before connecting a column to the HPLC 
plumbing, take care to ensure that column end-fittings, ferrules, and tubing depths are cor- 
rectly matched, If column end-fittings are used incorrectly, the column can be seriously dam- 
aged, resulting in solvent leakage and/or high dead volumes (and hence poor chromato- 
graphic performance). For a technical description of the interchangeability of column fittings 
and possible problems that ensue from incorrect fittings, see Chapter 5 and the Upchurch cat- 
alog or proceed to their Web Site at www.upchurch.com/Techinfo/ interchange.asp. 


SOUWENTS CONTAINING SALTS 


Neutral pH solvent systems such as unbuffered 1% (w/v) NaCl/acetonitrile and 0.1 M sodium 
phosphate (pH 7.4)/acetonitrile are commonly used in RP-HPLC peptide separations when 
seeking selectivity other than that achieved with conventional acidic solvents such as 0.1% 
TFA/acetonitrile (for an example, see the peptide mapping of IL-6 in Figure 7.8). Always 
choose salts of the highest quality, such as Aristar-grade reagents from BDH. When using salt- 
containing solvents, the percentage of organic solvent in Solvent B should not exceed 5096; oth- 
erwise, there is a high risk that these salts will precipitate and clog the HPLC tubing (typically, 
9,007- and 0:010-inch capillary tubing is used in modern HPLC systems), as well the column. 
Such damage can be very expensive to repair! To avoid possible precipitation of salts, salt-con- 
taining solvents should be premixed by adding equal volumes of organic solvent to the buffer 
salt solution (eg. 2% w/v aqueous NaC! or 0.2 M sodium phosphate at pH 7.4) to give a final 
cohcentratign 1% (w/v) NaCl/50% organic solvent or 0.1 M phosphate/50% organic solvent. 


EQUIPMENT FOR GRADIENT ELUTION 


Figure 7.10B (left-hand panel) involves the high-pressure mixing of two solvents to generate a 
mobile-phase gradient by computer programmi) the delivery from the high-pressure system. 
The output from two high-pressure pumps (reciprocating and/or displacement-type pumps 
are used for this purpose) is programmed into a low-volume mixing chamber before flowing 
into the column. In the low-pressure gradient former shown in Figure 7.10B (right-hand 
panel), solvent gradients are first formed by mixing two or more solvents at atmospheric pres- 


the various solvents can be selected by proportioning the time that the valves to the various 
resérvoirs are opened or closed, using an electronic gradient programmer and controller. After 
the desired solvent mixture is generated, it is mixed and fed to a single reciprocating high-pres- 
sure pump for pressyrization before, flowing into the column. Like the binary pump-type 
device, useful gradients can be generated with two reservoirs containing solvents of different 
strengths. In general, the binary pump devices have a lower flow-path volume than the low- 
pressure gradient-forming devices and are considered to be technically superior. 
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FIGURE 7.10. Generic HPLC systems. (A) Schematic diagram of an HPLC system. An HPLC system principally consists of four 
components: a solvent delivery system, a separation device, a detection unit, and computer-controlled systems controller. (B) 
Solvent delivery systems. In the binary-solvent delivery system, each solvent is fed into a mixing chamber by a dedicated pump in a 
manner that allows formation of an accurate gradient with differing compositions of Solvent A and Solvent B (accomplished by 
differentially adjusting the flow rates of Solvent A and Solvent B delivery to achieve the desired solvent gradient). Because this type 
of gradient-forming system is generated at pressures greater than atmospheric pressure, it is referred to as a “high-pressure gradi- 
ent former." The basic principle of a proportioning-valve solvent delivery system is to "suck" the solvents from the solvent reservoirs 
through a proportioning valve. The desired solvent composition of the gradient is accomplished by accurately timing the delivery 
of each solvent, Le. volume per unit time, through the proportioning valve. Because the solvents are at atmospheric pressure, this 
solvent delivery system is referred to as a “low-pressure gradient former.” Since the solvents in this system are drawn at atmospheric 
pressure (or, in some cases, slightly negative pressure), these systems are prone to cavitation (i.e., air bubble formation), requiring 
that the solvent reservoirs be constantly purged with an inert gas such as helium or, alternatively, on-line vacuur-sparged. 
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SDS-PAGE Peptide-mapping Procedure 
(Cleveland Method) 


MATERIALS 


On: OF THE MOST QUOTED PROCEDURES FOR PEPTIDE MAPPING of proteins is the Cleveland 
method, named after its originator (Cleveland et al. 1977). This method relies on partial 
hydrolysis of the protein yielding a considerable number of large peptides that can be readi- 
ly identified by SDS-PAGE (see Chapter 2, Protocol 1) or tricine-SDS-PAGE (see Chapter 2, 
Protocol 11). (Small peptides obtained with a "total digest" are typically poorly fixed in the 
gel and may be washed out and lost, thereby contributing nothing to the peptide map.) There 
are two main approaches for performing Cleveland peptide maps. One approach, adapted 
from Cleveland et al. (1977), and described in this protocol, relies on performing a partial 
digest of the protein in solution and then separating the peptide fragments by SDS-PAGE. In 
the other approach (see Protocol 17), sample proteins are first separated by gel electrophore- 
sis (either one- or two-dimensional gel electrophoresis), and then in situ hydrolysis is per- 
formed, which is followed by a second gel step to separate the peptides. 


»b Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <:>. 


Buffers and acrylamide <!> solutions necessary for running SDS-PAGE 
Prepare buffers and an SDS-PAGE gel as described in Chapter 2, Protocol 1. 

Fixing/staining/destaining solutions for visualizing peptide fragments 
See Step 7. 

2-Mercaptoethanol <!> 


Protein standards of known molecular weights in SDS sample buffer 
For example, a low-molecular-weight kit from Bio-Rad or equivalent or peptide molecular-weight 
markers (Amersham Biosciences or equivalent). 


Proteinase of choice (see Table 7.4) 
The activity of many proteinases is hindered by SDS (e.g., trypsin, chymotrypsin, and thermolysin; 
see Table 7,6). Endoproteinase Glu-C (or V8 protease), which can tolerate high concentrations of 
SDS, is recommended for the Cleveland method. 


SDS (1006) <!> 
SDS-PAGE sample buffer 
0.125 M Tris-Cl (pH 6.8) 
0.5% SDS 
10% (v/v) glycerol 
0.001% bromophenol blue <!> 
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» Equipment 
Polypropylene tubes (tightly capped) 
Slab gel electrophoresis equipment (see Chapter 2) 


Either use a uniform concentration 15%T 2.696C gel, or a concentration gradient 5—2096T (or 
10—2596T) gel in the Laemmli buffer system. 


Water bath (boiling) 


P Biological Sample 
Protein, purified sample (lyophilized or in solution) 


METHOD 


1. Prepare the protein samples (0.5 mg/ml) in SDS sample buffer and boil them for 2-5 
minutes. 


2. Add proteinase to each sample, typically to 1-100 ug/ml final concentration. 


3. Incubate for a set period of time, typically 10-60 minutes at 37°C. 


Alternatively, load 10—30 pl (10-20 tig) of each protein to be compared into three separate 
wells of an SDS-PAGE gel. Overlay samples with 10-20 pl of digestion buffer (1% aqueous 
SDS, 1 mM EDTA, 196 glycerol, 0.125 M Tris-Cl buffer, pH 6.8) containing 0.005, 0.05, and 
0.5 ug of proteinase, respectively. Carefully fill sample wells with running buffer and elec- 
trophorese until the dye reaches the bottom of the stacking gel. Turn off the power and 
incubate for 2 hours at 37°C (partial digestion). Then continue electrophoresis until the dye 
reaches the bottom of the gel. Fix, stain, and destain gel (or electroblot onto PVDF or nitro- 
cellulose for western analysis). 


4. Stop proteolysis by adding 2-mercaptoethanol to 10% final concentration and SDS to 2% 
final concentration, and boiling for 2 minutes. 


5. Load 20-30 ul of each sample (10-25 yg of peptide) onto the SDS-PAGE slab gel. Also 
load standard proteins of known molecular weights in parallel tracks of the gel. 


6, Carry out the electrophoresis until the bromophenol blue nears the end of the gel. 


7. Detect the separated peptides by Coomassie Blue staining or silver staining (see Protocol 
9 or 11 and Chapter 2, Protocols 2 and 7). 


Radiolabeled peptides can be detected by autoradiography or fluorography. 


PROTOCOL 17 


In Situ SDS-PAGE Peptide-mapping Procedure 
(Cleveland Method) 


MATERIALS 


Tere ARE TWO MAIN APPROACHES FOR PERFORMING CLEVELAND peptide maps. One approach, 
described in Protocol 16, relies on performing a partial digest of the protein in solution and 
then separating the peptide fragments by SDS-PAGE. In the other approach (this protocol), 
sample proteins are first separated by either one- or two-dimensional gel electrophoresis. The 
gel is briefly stained and destained, and the proteins are excised for peptide mapping. Peptide 
fragmentation (either enzymically or chemically) is performed in situ in the gel pieces, and 
the partial digest is separated in a second gel. 


b Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Chemical cleavage reagent (for Method 2) 
See Tables 7.3 and 7.7. Fot peptide mapping after chemical cleavage at aspartyl-prolyl peptide 
bonds, see Rittenhouse and Marcus ( 1984). 


Coomassie Brilliant Blue (0.1% w/v) 
Dissolve the Coomassie Brilliant Blue in 50% (v/v) methanol <!> and 10% (v/v) acetic acid <!>. 
Ethanol containing 1 M Tris-Cl (pH 6.8) <!> 
Polyacrylamide peptide-mapping slab gel <!> 
Either a uniform concentration gel (e.g., 15%T or 2.6%C) or a concentration gradient gel (e.g. 
5.:1096T or 10-2596T) prepared in Laemmli buffer with a conventional stacking gel (sec Chapter 2, 
Protocol 1). 


Proteinase digestion buffer (for Method 1) 
0.125 M Tris-CI (pH 6.8) 
0.1% SDS <!> 
20% glycerol 
0.005% bromophenol blue <!> 
Proteinase solution (for Method 1) 
0.1% SDS 
10% glycerol 
0.005% bromophenol blue 
1-100 pg/ml proteinase in 0.125 M Tris-HCI (pH 6.8) 
Endoproteinase Glu-C (or V8 protease), which can tolerate high concentrations of SDS, is recom- 
mended for the Cleveland method. 
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b Equipment 


Light box and scalpel (or razor blade) 
Polypropylene tubes 

Siab gel electrophoresis equipment (see Chapter 2) 
SpeedVac concentrator 


P Biological Sample 
Polyacrylamide slab gel containing the separated proteins 


b Additional Reagents 
Electrophoresis buffers (Laemmli system) 


These buffers are necessary for running SDS-PAGE (see Chapter 2, Protocol 7). 


Fixing/staining and destaining solutions 


METHODS 


These solutions are needed for visualizing peptide fragments (see Protocol 9 or 11). 


Method 1: Enzymatic Fragmentation 


1. 


Fix the proteins that have been separated in the polyacrylamide gel. Stain with Coomassie 
Brilliant Blue (5-10 minutes) and destain. 


If the protein bands are too faint, recycle the gel through the staining and destaining proce- 
dure again or lengthen the period of staining. 


. Place the gel on the light box. Excise the protein bands with a scalpel, and piace the bands 


in separate polypropylene tubes. 


. Soak the excised gel bands, containing the proteins to be compared, in 10 ml of ethanol 


containing 1 M Tris-Cl (pH 6.8) for 30 minutes at room temperature to shrink the gel. 


This faciltates loading the gel into the second slab gel. The washing time should be reduced 
if the resulting peptide maps are too faint. 


. Cut the gel bands into fragments and place them in separate wells of the second slab gei 


formed in the long (5 cm) stacking gel. Use a clean spatula to guide the fragments care- 
fully to the bottom of the wells. 


. Fill the regions around the gel fragments with proteinase digestion buffer delivered from 


a syringe, 


. Overlay each sampie with 10 u] of proteinase solution. 


Use sufficient proteinase to complete the digestion. Try a 1:50 ratio of enzyme to protein 
substrate initially. 


. Carry out electrophoresis until the bromophenol blue nears the bottom of the stacking 


gel. 


. Switch off the electrical current for 20-30 minutes to allow proteinase digestion of the 


protein samples. 
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9. Resume electrophoresis and continue until the bromophenol biue nears the bottom of 


the slab gel. 


10. Visualize the separated peptides by Coomassie Blue or silver staining (see Protocol 9 or 


11). Alternatively, detect radiolabeled peptides using autoradiography or fluorography. 


Method 2: Chemical Fragmentation 


l. 


O © w 0 


Fix the gel, stain with Coomassie Brilliant Blue (5-10 minutes), and destain. 


If the protein bands are too faint, recycle the gel through the staining and destaining proce- 
dure again or lengthen the period of staining. 


. Place the gel on the light box. Excise the protein bands with a scalpel, and place the bands 


in separate 1.5-ml polypropylene tubes. 


. Dry the protein-containing gel pieces using a SpeedVac concentrator (or equivalent 


device; e.g., air-dry). 


_ Add ~10 pl of the appropriate chemical cleavage reagent (1 mg/ml) directly onto the 


dried gel pieces, followed by 90 pi of cleavage reagent. 


. Incubate the mixture for the required period of time (for incubation conditions, see 


Protocol 6, 7, or 8). 


. Carefully aspirate the peptide-containing supernatant liquor. 

. Dry the sample in a SpeedVac, add 50 ul of H,O, and dry the sample again. 

. Repeat the H,O wash and drying two more times. 

. Add 10—30 ul of SDS-PAGE sample buffer, and boil the samples for 5 minutes. 


. Load the samples onto the SDS-PAGE gel and proceed with the electrophoresis until the 


bromophenol blue nears the bottom of the slab gel. 


. Visualize the separated peptides by Coomassie Blue or silver staining (see Protocol 9 or 


11). Alternatively, detect radiolabeled peptides using autoradiography or fluorography. 
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NOMENCLATURE ON PROTEASES, PROTEINASES, AND PEPTIDASES 


In 1984, the International Union of Biochemistry and Molecular Biology (IUBMB) recom- 
mended the use of the term peptidase for the subset of peptide bond hydrolases (Subclass EC 
3.4.). The widely used term protease is synonymous with the term peptidase. Proteases com- 
prise two groups of enzymes: the endoproteases and the exoproteases. 


e Endoproteases cleave peptide bonds at points within the polypeptide chain, generating 
peptide fragments. They are used mainly for peptide-mapping studies. The endoproteases 
include enzymes that have a high selectively for cleaving peptide bonds exchusively adjacent 
to charged amino acids (the basic amino acids arginine and lysine, e.g., trypsin) and the 
acidic residues glutamic acid (e.g., Glu-C) and aspartic acid (e.g., Asp-N). For a more 
extensive list of proteases, see Tables 7.3 and 7.4. Proteases like chymotrypsin, thermolysin, 
or pepsin cleave adjacent to several amino acid residues and hence yield a more extended 
fragmentation. For a list of endoproteases and their selectivities, see Table 7.3. 


+ Exoproteases remove amino acids sequentially from either the amino or carboxy! terminus, 
respectively. They are of special importance for the removal of amino-terminal blocking 
groups such as pyroglutamate (e.g., pyroglutamate aminopeptidase, 5-oxopralylo-pepti- 
dase, EC 3.4.19.3; Armentrout and Doolittle 1969) or an N-acetyl group (acylamino-acid- 
releasing enzyme, acylaminoacyl peptidase, EC 3.4.19.1; Kobayashi and Smith 1987). For 
the acylpeptide hydrolases, the neighboring amino acid residues affect the efficiency of frag- 
mentation considerably, and applications are generaily restricted to peptides rather than 
proteins. Similarly, the acyl amino-acid-releasing enzyme (AARE) (Mitta et al. 1989), used 
to cleave N-acetylated amino-terminal! residues, is restricted to peptides of ~40 residues in 
length; hence, the polypeptide chain must be fragmented first and the amino-terminal pep- 
tide isolated prior to AARE digestion. Cathepsin C cleaves repetitively dipeptides from the 
amino terminus of a polypeptide chain. The carboxypeptidases are of particular impor- 
tance for the stepwise removal of amino acid residues from the carboxyl terminus. 


Proteinase is also a term used as a synonym for endopeptidase. There are four classes of 
proteinases recognized by the IUBMB, and they have been grouped according to their cat- 
alytic mechanism: serine proteinases, cysteine proteases, aspartic proteinases, and metallo- 
proteinases. It has been suggested that this classification by catalytic types be extended by a 
classification by families based on the evolutionary relationships of proteases (Rawlings and 
Barrett 1993). Additionally, there is a section of the Enzyme Nomenclature that is allocated for 
proteases of unidentified catalytic mechanism. Although this indicates that the catalytic 
mechanism for this group of proteases has not been identified, the possibility remains that 
novel types of proteases may exist. 

Proteases are involved in a great variety of physiological processes. Their action can be 
divided into two different categories: 


« Limited proteolysis. This group of proteases cleaves only one or a limited number of pep- 
tide bonds of a target protein, leading to the activation or maturation of the formerly inac- 
tive protein (e.g., conversion of prohormones to hormones). 


* Unlimited proteolysis. Proteases belonging to this group degrade proteins totally into their 
amino acid constituents. Proteins to be degraded are usually first conjugated with ubiqui- 
tin, a modification that marks them for rapid hydrolysis by the proteasome in the presence 
of ATP (for a review of ubiquitinylation, see Hershko and Ciechanover 1998; for a review 
of degradation, see Baumeister et al. 1998). Another pathway consists of the compartmen- 
tation of proteases, e.g., in lysosomes. Proteins transferred into this compartment undergo 
rapid proteolytic degradation (Baumeister et al. 1998). 
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Ms. SPECTROMETRY (MS)-BASED MFTHODS FOR THE IDENTIFICATION of gel-separated proteins 
are fundamental platform technologies for proteomics. Indeed, MS has essentially replaced 
the Edman degradation procedure, the classical technique for protein identification (sec 
Chapter 6), because it is much more sensitive, can cope witb protein mixtures, and offers 
much higher sample throughput. Curtent MS-based proteomic strategies relv primarily on 
digestion of either one-dimensional (1D) (see Chapter 2) or two dimensional (2D) (see 
Chapter 4) gel-separated proteins into peptides using a sequence-specific protease such as 
trypsin. The reasons for analyzing proteolytically derived peptides rather than intact proteins 
by MS are that proteins are inherently more difficult to elute from polyacrylamide gels and 
to analyze by MS. (Generally, the molecular weight of the protein alone is not sufficient for 
unambiguous protein identification. This problem is exacerbated when posttranslational 
modifications must be taken into account. Methods and instrumentation for the fragmenta- 
tion of whole-protein ions have not been readily available until recently; see information 
panel on “POp-DOWN” PROTEIN SEQUENCE ANALYSIS USING MS/MS.) In contrast, peptides are 
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readily eluted from gels following in situ proteolysis (see Chapter 7, Protocol 13), and the 
masses of even a small subset of tryptic peptides from a protein, or sequence information 
obtained from the fragmentation of selected peptides, usually provide sufficient information 
to allow unambiguous protein identification. 

The steps typically taken during an MS-based proteomics analysis are illustrated in 
Figure 8.1. Although the proteomic strategy outlined in Figure 8.1 is based on the character- 
ization of proteolytically derived peptides for subsequent protein identification (referred to 
as a “bottom-up” MS approach), direct MS/MS of intact protein molecular ions, which 
avoids the need to digest proteins prior to MS analysis, can also provide sufficient fragment 
masses for database searching (Mortz et al, 1996; Meng et al 2001; Reid and McLuckey 2002; 
also see the information panel on “TOP-DOWN” PROTEIN SEQUENCE ANALYSIS USING MS/MS). This 
latter approach to protein characterization is referred to as “top-down” protein characteriza- 
tion. More recently, a comprehensive method for proteome analysis that integrates both 
“bottom-up” and “top-down” approaches has been described (see VerBerkmoes et al. 2002). 


GENERAL PRINCIPLES 


Mass spectrometry, also called mass spectroscopy, is an instrumental approach that allows for 
the mass measurement of ions derived from molecules. Mass spectrometers are capable of 
forming, separating, and detecting molecular ions based on their mass-to-charge ratio {m/z}. 
(Note that, formerly, e and not z was used to denote the charge on an ion, Accordingly, many 
earlier publications in the literature represent the mass-to-charge ratio of an analyte as m/e. 
Here, the term m/z is used to conform to IUPAC recommendations.) Mass spectrometers 
consist of a set of essential modular components (Figure 8.2): 


* fonization source. The ionization source converts and transfers molecules (analytes) into 
gas-phase ions. 


* Mass analyzer. This device is used for separating gas-phase ions. Ions of a particular m/z 
value (for a definition of m/z, see the panel on GLOSSARY OF MASS SPECIROMETRY TERMS 
below) are separated one from another using physical properties such as electric or mag- 
netic fields. Typically, the physical field in the mass analyzer is changed as a function of 
time. The major types of analyzers are quadrupole analyzers, quadrupole ion-trap analyz- 
ers, Fourier transform-ion cyclotron resonance (FT-ICR) mass analyzers, and time-of- 
flight (TOF) analyzers. 


* Detector. ions of a particular m/z value pass from the mass analyzer to strike the detector. 
The magnitude of current produced at the detector as a function of time is used to deter- 
mine the intensity (abundance) and m/z value of the ion, In general, the relative abun- 
dances of ions generated in an ionization source are measured in a detector, and the results 
are displayed with m/z values as abscissae and ion abundances as ordinates. Mass spectra are 
usually depicted in either a normalized or percentage relative abundance (WRA) format or 
as a percentage of total ion current (96 TIC). In a normalized MS spectrum, the largest peak 
(i.e., the most abundant ion, which is not necessarily the analyte being studied) in a spec- 
trum is called the base peak and its height is adjusted to 100 units. The relative heights of 
all other peaks in the spectrum are normalized to this base peak and thus lie between 0 and 
100 units. 


* Data recorder/processor. 
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FIGURE 8.1. (See facing page for legend.) 
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FIGURE 8.2. Modular components of a mass spectrometer. (Adapted, with permission, from www.asms.org 
[Fig 1] the American Society for Mass Spectrometry [©1998 ASMS; What is Mass Spectrometry, 3rd edi- 
tion].) 


Mass spectrometers are able to produce two types of mass information: They can deter- 
mine both the mass of a molecule and the masses of product ions derived from the parent 
molecule following fragmentation, data that are especially valuable for determining the 
amino acid (or carbohydrate) sequence of biomolecules (see the information panel on FRAG- 
MENTATION MECHANISMS OF PROTONATED PEPTIDES IN THE GAS PHASE). 


FIGURE 8.1. Protein identification and characterization by mass spectrometry. Using global protein expression approaches, proteins 
are separated by 2D gel electrophoresis. Following visualization by Coomassie Brilliant Blue, fluorescence staining, or silver staining 
(for staining protocols, see Chapter 4), proteins of interest are then excised, alkylated, and subjected to in-ge) proteolysis using 
trypsin (see Chapter 7, Protocol 13). For differential expression proteomics, quantitation can be accomplished by first isotopically 
labeling proteins, either in vivo or in vitro (see also Chapter 1). For cell-mapping proteomics or protein- interaction studies, proteins 
of interest are obtained by (1) classical “pull-down” experiments using an antibody directed toward the protein of interest; (2) tag- 
ging a gene of interest with a small-peptide epitope that can be used to affinity purify the protein following transfection of the tagged 
gene into cells; or (3) isolation of macromolecular complexes (see Chapter 10). Separation of protein complexes is carried out by 
either 1D electrophoresis (Chapter 2) or, in the case of immunoaffinity approaches, by 2D electrophoresis (see Chapter 4). Visualized 
proteins of interest are processed using the same method as described for global profiling. A hierarchical MS approach is outlined, 
where tryptic digests are first analyzed in high-throughput using matrix-assisted laser desorption/ionization (MALDI)-quadrupale 
time-of-flight (qTOE) instrumentation. This approach should yield many protein identifications by interrogation of the databases 
using peptide ion masses and peptide fragment ion masses. For proteins that remain unidentified by this approach, tryptic digests 
are subjected to low-throughput ESI methods. (Reproduced, with permission, from Simpson and Dorow 2001 [@Elsevier Science].) 
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AN Historical Overview Or MASS SPECTROMETRY 


The fundamental physics behind modern mass spectrometry originated more than 100 years 
ago with the seminal work of J.J. Thomson (Cavendish Laboratory, University of 
Cambridge), which led to the discovery of the electron in 1897 during investigations of cath- 
ode rays. While Thomson was measuring the effects of electric and magnetic fields on ions 
generated by residual gases in cathode ray tubes, he noticed that the ions moved through par- 
abolic trajectories that were proportional to their “mass-to-charge” values. This observation 
led to the construction of the first mass spectrometer (then called a parabola spectrograph). 
In 1906, Thomson received the Nobel prize in Physics, “in recognition of the great merits of 
his theoretical and experimental investigations on the conduction of electricity by gases.” 

More sophisticated mass spectrometers were subsequently designed and constructed by 
Arthur J. Dempster (1918) and Francis W. Aston (1919) (Grayson 2002). Later, Alfred Nier 
incorporated these developments along with advances in vacuum technology and elec- 
tronics to greatly decrease the instrument's size. During the period from the late 1930s to 
the early 1970s, further accomplishments were made in the field of mass spectrometry. For 
example, in 1946, W.F. Stephens proposed the concept of time-of-flight (TOF) analyzers 
that separated molecular ions by measuring their velocities as they moved in a straight 
path toward a detector. In the mid 1950s, Wolfgang Paul developed the quadrupole mass 
analyzer, which separates molecular ions by use of an oscillating electrical field. Another 
Paul innovation was the quadrupole ion trap, which is a device specifically designed to trap 
and measure ions. The first ion trap became available commercially in 1983, and now both 
the quadrupole and quadrupole ion trap (along with TOF) are the most widely used mass 
analyzers today. For his innovative work with the quadrupole mass analyzer and the ion 
trap device, Paul was awarded the 1989 Nobel prize in Physics. 

Fast atom bombardment ionization (FAB), developed in the early 1980s (Barber et al. 
1981; Hunt et al. 1981) (along with liquid secondary ion mass spectrometry (LSIMS]) were 
the first soft-ionization MS techniques that could be used to ionize large, nonvolatile, polar 
compounds such as proteins and peptides (Dell and Morris 1982; Barber et al. 1985; 
Desiderio and Katakuse 1983; Cottrell and Frank 1985; Hyver et al. 1985; Biemann et al. 
1986), In FAB, the analyte is dissolved in a nonvolatile liquid matrix, such as glycerol, and 
placed under vacuum in the instrument. The sample is bombarded with a stream of fast neu- 
tral atoms (usually formed by ionization/neutralization of argon or xenon) to induce a shock 
wave within the solution that ejects analyte ions already present in the solution into the gas- 
phase. In LSIMS, the sample is also dissolved in a liquid matrix. However, the sample is then 
bombarded with heavy ions, such as those formed from cesium, instead of neutral atoms, as 
in FAB. Ions of 10,000 daltons and above can be observed with these techniques. 

In 1988-1989. two other soft-ionization techniques were introduced that allowed the 
transfer of large biomolecules, such as proteins and peptides, into the gas phase: matrix- 
assisted laser desorption/ionization (MALDI) (Karas and Hillenkamp 1988) and electro- 
spray ionization (ESI) (Fenn et al. 1989). MALDI uses a pulsed laser to desorb analytes 
from a solid matrix containing a highly UV-absorbing substance, producing primarily 
singly charged ions. With ESL a high-voltage potential is applied toa liquid as it passes 
through a small capillary. lons are desorbed into the gas phase following evaporation of the 
droplet as it exits the capillary. A unique feature of electrospray, compared to other ion- 
‘ization techniques, is that both singly and multiply charged ions can be formed from a sin- 
gle analyte molecule population, The development of MALDI and ESI, along with the 
development of mass spectrometry instrumentation with improved performance charac- 
teristics (e.g., improved inass range and resolution) during the last 15 years, has been fun- 
damental to the advancement of mass spectrometry as à core technology in proteomics. 

For an excellent historical perspective of mass spectrometry, see Grayson (2002). 
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“GS ossary OF MASS SPECTROMETRY TERMS 


+ Mass-to-charge ratio (m/z): Mass spectrometers measure the mass-to-charge values of 
molecular ions. For peptide ions, these are typically formed by ionization of the mole- 
cule via the addition of one (in the case of MALDI) or more protons (in the case of ESI). 
Thus, a peptide with a molecular mass of 2000 daltons or atomic mass units (amu) will 
have an m/z value of 2001 ([M--H]!*) after ionization by the addition of one proton and 
an m/z value of 1001 with the addition of two protons ([M+2H]**). 


X 


a: 


Resolution: Resolution can be defined as the ability to separate and measure the masses 
of ions of similar, but not identical, molecular mass, Mass resolution is generaily 
expressed as the ratio of m/Am and has historically been defined in two ways (Figure 8.3): 
1. The “10% valley” definition, in which a pair of adjacent ion peaks at mass m, and m, 
{in atomic mass units or daltons) are resolved so that a valley of 10% is observed 


between them, relative to the least most abundant ion of the pair (Am is defined as 
the mairi difference between tr, and m). 


- 2, Phe “full-width, half-maximum” (FWHM) definition, where the width of an ion at 
*mass m, at 5096 relative abundance, is used to define Am, 

Note that the resolution values determined using the FWHM definition are about 

twice that of those calculated using the 10% valley definition. For example, a resolution 

. of 1000 obtained using the 10% valley definition will correspond to a resolution of 

2000 using the FWHM definition. Hence, it is essential to know what definition is 

Ex being applied when comparing resolution values for different mass spectrometers. The 

$^ influence of resolution on the ability to separate isotopic peaks in molecular ion clusters 

,; is illustrated for the [M+H)* ions of angiotensin {~1283 daltons), the reduced A and B 

chains of bovine insulin (~2340 and ~3400 daltons, respectively), bovine insulin (+5734 

daltons), and horse heart myoglobin (—16,952 daltons) in Figures 8.4 and 8.5. An exam- 

ple of the utility of high resolution performance for the analysis of peptide mixtures is 

given in Figure 8.6, where the separation of two peptides of ~1846-dalton molecular 

mages, which differ in mass by only 0.26 dalton, is shown. 


» Monvisotopic mass versus average mass: it is important when expressing the molecular 
"weight of a compound to appreciate how it was calculated. A common way of defining 
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FIGURE 8.4. Resolution: The hypothetical {M+H]* molecular ion clusters for angiotensin (molecular mass, 1283) (a), reduced 
insulin A chain (molecular mass, 2340) (b), and reduced insulin B chain (molecular mass, 3400) (c) are shown at various resolu- 
tions. Monoisotopic mass, peak top mass, and average mass are indicated by the dark blue, gray, and light blue lines, respectively. 
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FIGURE 8.5. Resolution: The hypothetical (M+H])* molecular ion clusters for bovine insulin (molecular mass 5734) (a), and horse 
heart myoglobin (molecular mass 16,952) (b) are shown at various resolutions. Monoisotopic mass, peak top mass, and average 
mass are indicated by the dark blue, gray, and fight blue lines, respectively. 
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molecular weight is monoisotopic mass. The monoisotopic mass of an element refers to 
the lightest stable isotope of that element. For example, the monoisotopic mass of car- 
bon is 12.00000. (There are two major isotopes of carbon, namely, "C and "C, with 
masses of 12,00000 and 13.003355, with natural abundances af 98.9296 and 1.0896, 
respectively.) Similarly, there are two major isotopes of nitrogen: "N (14.003074; natur- 
al abundance, 99.6396) and **N (15.000109; natural abundance, 0.37961. A list of com- 
mon elements and their naturally occurring isotopic abundances is given in Table 8.1. 
The monoisotopic mass of an analyte is thus determined by summing the monoisotopic 
masses of each element present in the analyte. In practice, an ion comprises not only 
molecular species having the lightest isotopes of the elements present, but also a small 
percentage containing the heavier isotopes. The contribution of these heavier isotopes in 
a molecular ion cluster will clearly depend on the abundance-weighted sum of each ele- 
ment present in the analyte. Hence, in a mass spectrometer capable of resolving the con- 
stituent isotopes of an analyte ion, a molecular ion cluster will be observed (see Figures 
8.4 and 8.5). Another way of defining the molecular mass of an element is the chemical 
average mass, which is the sum of the abundance-weighted masses of all of its stable iso- 
topes (e.g. 98.9% "C and 1.1% "C, to give the isotope-weighted average mass of 12.011 
for carbon). The average mass of an analyte is therefore determined by summing the 
chemical average masses of its component elements. Yet another mass term, the peak top 
mass, often referred to as the maximum mass, is simply the mass of the peak maximum 
of the unresolved molecular ion isotope cluster. An important consequence of the con- 
tribution of heavier isotope peaks in a molecular ion cluster relates to whether the inves- 
tigator should use monoisotopic mass or the average mass when measuring and report- 
ing molecular masses of analytes. Clearly, this decision will depend on both the mass of 


` the analyte and the resolving capability of the mass spectrometer used to conduct the 


measurement. For instance, it can be seen in Figures 8.4 and 8.5 for peptides of mass 
21750-1800 that peaks corresponding to the monoisotopic mass no longer represent the 
most abundant in the isotopic cluster, and that for analytes with masses greater than 
~8006, the monoisotopic mass has an insignificant contribution to the isotopic ion cjus- 
ter envelope. 


Muss accuracy: The accuracy of a mass measurement can be presented as a percentage 
of the measured mass (e.g., molecular mass = 1000 + 0.01%) or as parts-per-million 
(e.g., molecular mass = 1000 + 100 ppm) (Carr and Burlingame 1996). Clearly, as the 
mass of an analyte increases, the absolute mass error corresponding to the ppm or per- 
cent error will also increase proportionally. For example, a mass accuracy of 0.0196 (or 
100 ppm) will correspond to a mass error of 0.1 dalton at m/z 1000, 0.5 dalton at m/z 
5000, and 5 daltons at m/z 50,000. Mass accuracy can be significantly affected by how 
datg;are treated. For example, in the mass range 1000 to ~5000, pronounced errors can 


, be introduced if unresolved isotope clusters must be measured. As illustrated in Figures 


8.4 and 8.5, for the mass range 1000-15,000, an unresolved isotopic cluster will be asym- 
metric in shape due to the intensities of the underlying isotopic masses in the molecu- 
lar ion clusters. In this situation, if the mass to be reported is the average mass, the cen- 


. troid of the distribution should be determined and used instead of the measured peak 


top mass since these will not be the same in this mass range. Note that as the mass of the 
analyte increases, the peak profile becomes more symmetrical] and the average mass and 
peak top mass become almost identical {see Figure 8.5b,c). In general, a resolution of at 
least several thousand (FWHM) is required for good mass measurement when working 
in the mass range of 1000 to ~5000 daltons. 


** 
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FIGURE 8.6. Resolution: Issues associated with the analysis of peptides in mixtures. The hypothetical mass spectra of two peptides 


(AEGWNFQDEHGEDRR, monoisctopic [M+HI* ion m/z of 1845.79, and VSAYVKPMITHALPYR, monoisotopic [M+H]" ion 
m/z of 1846.05), whose masses differ by 0.26 dalton are shown at a resolution of 2,000 (a), 5,000 (5), 10,000 (c), and 20,000 (4). 


MODERN IONIZATION METHODS: HOW IONS ARE FORMED 


The ability to create gas-phase ions from nonvolatile, polar, or charged molecules is quintes- 
sential to the mass spectrometric analysis of all biomolecules. Ionization (the act of placing a 
charge on a neutral molecule), in general, can occur by either of two fundamental processes: 
the loss/gain of an electron or the loss/gain of a charged particle (e.g. a proton), generating 
odd or even electron ions, respectively. For a general review, see Johnstone and Rose (1996), 
and for ESI and MALDI details, see Gaskell (1997) and Busch (1995), respectively. 
Traditionally, the transfer of small organic molecules into the gas phase has been accom- 
plished by thermal vaporization followed by electron impact (EI) or chemical ionization (CI) 
(Harrison and Cotter 1990; Harrison 1992). The major obstacle in applying these approach- 
es to biological mass spectrometric analysis, however, was the difficulty of vaporizing such 
molecules into the mass spectrometer without extensive thermal decomposition (Daves 
1979). Throughout 1950-1970, several workers demonstrated that the polarity of di-, tri-, and 
tetrapeptides could be reduced by acylation, esterification, and reduction, or N- and O-per- 
methylation (Biemann 1962, 1978; Morris 1972; Morris et al. 1973). By these means, smail 
peptides could be introduced into the gas phase for sequence analysis. An excellent personal 
recollection from Klaus Biemann discusses these developments (Biemann 1995}. 
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TABLE 8.1. Common elements and their naturally occurring isotopic relative abundance values 


Natural 
Element isotope Mass? abundance (%)° 
Hydrogen, H 'H 1.007 825 035 99.985 
‘VT 2.014 101 779 0.015 
Carbon, C Ee 12.000 000 000 98.90 
UC 13,003 354 826 1.10 
Nitrogen, N UN 14.003 074 002 99.63 
IN 15.000 108 97 0.37 
Oxygen, O Ic 15.994 914 63 99.76 
rO 16.999 131 2 0.04 
HO 17.999 160 3 0.200 
Fluorine, F IE 18.998 403 22 i00 
Sodium, Na Na 22.989 767 7 100 
Silicon, Si 8i 27.976 927 1 92.23 
si 28.976 494 9 4.67 
ngi 29.973 770 1 3.10 
Phosphorus, P "P 30.973 762 0 100 
Sulfur, $ =g 31.972 070 698 95.02 
bi 32.97] 458 428 0.75 
"S 33.967 866 650 4.21 
v8 35.967 080 620 0.02 
Chlorine, Cl BE] 34.968 852 728 75.77 
"Cl 36.965 902 619 24.23 
Potassium, K "K 38.963 707 4 93,2581 
K 39.963 999 2 0.012 
UK 40.96] 825 4 6.7302 
Bromine, Br "Br 78.918 336 1 50.69 
"pr 80.916 289 49.3] 
Iodine, I UI 126.904 473 100 


Adapted from Burlingame and Carr (1996). 
? Wapstra et al. (1985). Standard errors are omitted from the present table. 
> Weast ct al. (1989). 


In the carly 1980s, fast atom bombardment (FAB) (Morris et al. 1981; Barber et al. 1982; 
Seifert and Caprioli 1996) provided a major step forward in the development of MS-based 
procedures for peptide and protein characterization. However, it was the commercial intro- 
duction of two new ionization methods after 1988 that made MS routinely available to 
researchers in the biological arena: MALDI, discovered by Karas and Hillenkamp (1988), and 
ESI, developed by Fenn and co-workers (1989). 


Matrix-assisted Laser Desorption/lonization 


MALDI (Karas and Hillenkamp 1988; Beavis and Chait 1996) is performed by embedding 
analyte molecules in an excess of a specific wavelength-absorbing matrix (usually UV;5,,; 
typically, a 1:1000 molar ratio of analytes to matrix is employed), which is dried to produce 
a cocrystallized mixture. Typical matrices that are amenable to peptide and protein analysis 
include &-cyano-4-hydroxy-cinnamic acid and 3,2-dimethoxy-4-hydoxy-cinnamic (sinap- 
inic) acid (see Table 8.8 in Protocol 1). The chemical structure of some commonly used 
matrices are shown in Figure 8.7. Ions are produced by bombarding the sample with short- 
duration (1—10 ns) pulses of UV light from a nitrogen laser. The interaction of the laser pulse 
with the sample results in ionization (usually protonation) of both matrix and analyte mol- 


436 = CHAPTER 8 


ecules via an energy transfer mechanism from the matrix to the embedded analyte, rather 
than by direct laser ionization. A high potential electric field (typically +30 kV) applied 
between the sample probe and an orifice is used to accelerate the ions into the mass analyzer 
(sec Figure 8.8) (Beavis and Chait 1989a,b,c; Cohen and Chait 1996; Kussmann and 
Roepstorff 2000). MALDI primarily produces singly charged ions ([M-FH]* or [M-H]-}. An 
interesting model to explain why only singly charged molecular ions survive the MALDT ion- 
ization process, which can produce multiply charged molecular ions under certain circum- 
stances, has recently been proposed by Karas et al. (2000). 


Electrospray lonization 


ESI (Fenn et al. 1989; Banks and Whitehouse 1996) is unique compared to other MS ioniza- 
tion techniques in that both singly and multiply charged ions may be formed. To form ions 
by electrospray, a fine spray of charged droplets is created from a solution introduced 
through a small-diameter needle in the presence of a strong electric field (typically 3-5 kV) 
(see Figure 8.9). The charged droplets are desolvated by the application of a countercurrent 
flow of gas and/or heat causing the droplet to evaporate. When the surface charge density of 
the droplet exceeds the Rayleigh stability limit, electrostatic repulsion greater than the surface 
tension of the droplet results in the explosion of the droplet and expulsion of gas-phase ions. 
The most striking feature of electrospray is that a family of multiple charge states of the ion 
can arise from a single precursor. The extent of multiple charging of an analyte ion is influ- 
enced by factors such as the composition and pH of the electrospray solvent, as well as the 
chemical nature of the analyte. For peptides («2000 daltons), ESI typically generates singly or 
doubly charged ions, whereas for proteins (72000 daltons), this ionization process usually 
gives rise to a series of multiply charged species ("charge envelope"). These multiple charge 
states can be used to provide independent verification of the mass or to provide access to dif- 
ferent reaction channels in tandem MS experiments through the selection of different charge 
states for fragmentation (see the information panel on FRAGMENTATION MECHANISMS OF PRO- 
TONATED PEPTIDES IN THE GAS PHASE). Importantly, this charging phenomenon has the effect of 
lowering the m/z values of the analyte to a range that can be readily measured by many dif- 
ferent types of mass analyzers. Thus, quadrupole and ion-trap mass analyzers, which gener- 
ally have an upper limit of 2,000-10,000 m/z, may be used to determine the mass of proteins 
of much larger molecular mass. For instance, Figure 8.10 shows a hypothetical ESI mass spec- 
trum of horse heart apomyoglobin (average molecular mass, 16,951.5 daltons). The molecu- 
lar ions observed in the spectrum correspond to that of apomyoglobin with a varying num- 
ber of protons attached. (Note that the charge envelope of a protein profoundly changes 
when the protein becomes unfolded [or denatured].) 

Using an instrument that is capable of high resolution (510,000), it is often possible to 
determine the charge state of a particular ion in a mass spectrum as well as the molecular 
mass of the parent molecule (provided the molecular mass is somewhat less than the resolv- 
ing power of the instrument), by analyzing the isotopic cluster of an individual ion (see 
Figure 8.10b). Figure 8.11 shows the mass spectrum of a hypothetical charge state distribu- 
tion for a typical tryptic peptide (2484.95 daltons), whose predicted m/z values of the charge 
states can be calculated using the simple formula shown in the legend to Figure 8.11. 

All commercial mass spectrometers that are equipped with ESI capability provide a 
deconvolution algorithm for mathematically processing the charge state and/or isotopic 
envelope in order to obtain a statistically averaged (and more accurate) molecular mass. 
Additional mathematical methods such as smoothing, background subtraction, and noise fil- 
tering options are also incorporated in most commercial algorithms (Bonner and Shushan 
1995; Horn et al. 2000b). 
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FIGURE 8.7. Chemical structures of some commonly used matrices. 


a-cyano-4-hydroxycinnarnic acid 


UV laser A 337 nm 


MALDI plate — 
Q o 
9 oe Sa PP 
Q e 9o “o o analyzer 
Cocrystals s @e0 e c o9 o Mass 
@ Analyte 
@ Matrix 


FIGURE 8.8. Schematic representation of the MALDI process. To enable MALDI, an analyte is first dis- 
solved (or mixed} with an excess amount of matrix {which has a strong absorption at the laser wavelength} 
and then dried. Upon laser irradiation of the cocrystallized analyte/matrix sample, a plume of neutral mol- 
ecules and tons is desorbed. By applying an electrostatic potential between the sample slide and a sampling 
orifice, the desorbed ions are then guided into the mass analyzer by a series of lenses. Because MALDI gen- 
erates mainly singly charged ions (unlike ESI, which yields multiply charged ions), this MS approach is par- 
ticularty suitable for high-throughput and for analysis of complex mixtures. 
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FIGURE 8.9. Schematic representation of the ESI process. An electrospray is formed by the application of a 
high-voltage potential (3-5 kV) between a solutian containing an analyte passing through a fine capillary, 
and a counter electrode situated ~0.5-1.0 cm from the needle. A spray of fine charged droplets that contain 
the analyte and solvent molecules are generated, which can be desoivated to vield gas-phase ions. These ions 
are then transferred into the mass analyzer. A unique feature of ESI is that highly multiply charged ions can 
be produced from these droplets, thereby allowing analytes such as proteins or large peptides to be readily 
analyzed by mass spectrometers with limited m/z range (e.g., 2000). 
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FIGURE 8.10, Hypothetical ESI mass spectra of horse heart myoglobin (mass 16,951.5 daltons). (Panel a) Hypothetical multiple 
charge state distribution of myoglobin. A multiply charged ion series ranging from +9 (m/z 1884.48) to +27 (m/z 628.83) is shown. 
A simple method for calculation of the charge states of individual ions in the distribution, and determination of the mass of the 
protein, is described in the text. (Panel b) Hypothetical isotopic distribution of the {M+12H]"** charge state of myoglobin ata res- 
olution of 25,000. The mass difference of 0.08355 dalton between adjacent isotopic peaks readily allows determination of the 
charge state. 
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CALCULATION OF THE CHARGE STATE OF A MULTIPLY CHARGED ION AND THE Mass or Its 
PARENT PROTEIN 


In general, the charge state of a multiply charged ion and the molecular mass of the pro- 
tein from which it has been derived can be calculated as follows: P = m/z, where Pisa peak 
in a mass spectrum with a protonation (charge) state m/z. The charge state of an ion is 
defined as the number of charges a molecular ion carries such that a singly charged ion has 
a charge state of 1, a doubly charged ion has a charge state of 2, a triply charged ion has a 
charge state of 3, and so on. Two adjacent peaks in the ESI spectrum separated by a single 
charge (z) can be denoted P, and P., corresponding to (m/z), and (m/z), respectively. 

P, = (M + z,)/z, 

P, = [M + (z, - 1)]/(z, - 1) 
where M is the molecular mass of the molecule, and the unknown charges are z. Thus, two 
successive ion peaks yield two equations and two unknowns, which can be solved to reveal the 
molecular mass of the analyte. For the example shown in Figure 8.10a for the multiple charge 
state distribution of horse heart apomyoglobin, if P, — 1304.95 and P, = 1413.63, then 

1304.95 z, = M + z, 

1413.63 (z,- 1) = M + (z, - 1) 
Solving for M and z, for the ion at m/z 1304.95 yields M = 16,951.35 daltons and the charge 
state, z; = 13. The ion at m/z 1413.63 therefore corresponds to a charge state of z — 12 and 
mass M = 16,951.56 daltons. Typically, the experimental molecular masses observed for 
each pair of molecular ions in an ESI spectrum are averaged in order to obtain a more 
accurate value, In this example, this yields an average mass of 16,951.455. 
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To calculate the m/z values of predicted charge states in a mass spectrum for any given peptide 
when the molecular weight is known: 


[M + (nX)] 
mn? = —- 
n 


where M is the molecular mass of the peptide, X is the mass of the cation added to the molecule 
(usually a proton, thus X has a mass of 1.008 daltons), and n is an integer representing 
the number of charges. 


FIGURE 8.11. (Panel a) Hypothetical ESI mass spectrum o£ a typical tryptic peptide with a molecular mass 2484.95 daltons, acquired 
on a quadrupole ion-trap mass spectrometer. The m/z of the singly protonated ion ([M--H]") of 2485.95 is beyond the mass range of 
the mass spectrometer (up to m/z 2000). However, the +2, +3, and +4 charge states of the peptide can he observed. (Panel b) Simple 
method for calculation of the m/z ratios of various charge states of a peptide of known molecular weight, For example, the m/z ratio 
of the +3 charge state of the peptide sequence shown in panel a can be readily calculated: m/z = 2484.95 + (3 x 1.008)/3 = 829.32. 


440 * CHAPTER 8 


Often times, the question arises: Which instrament should I buy? Although the ionization 
techniques are complementary, often a choice must be made. To assist in this decision- 
making process, Table 8.2 provides a summary of various advantages and disadvantages of 


MALD!I and ESI. 


TABLE 8.2. Advantages and disadvantages of ESI-MS and MALDI-MS 


Advantages 


ESI 


Practical mass limit ~70,000 daltons. 

Good sensitivity. Femtomole to low picomole sensitivity 

is typical. 

Softest ionization. Capable of observing biologically native 
noncovalent interactions. 

Easily adaptable to microbore liquid chromatography. 
Capable of directly analyzing LC effluent at a flow rate 
of 50 nl/min. 

No matrix interference. 

Easy adaptability to triple quadrupole analysis, conducive 
to structural analysis (for various scan modes, see 
Figure 8.12) 

Multiple charging, allowing for the analysis of high-mass 
ions with a relatively low-m/z range instrument. 

Multiple charging, giving better mass accuracy through 
averaging. 

Excellent for determining peptide modifications. 

Sample preparation: Samples can be analyzed directly 
fram salt-frec solution or via RP-HPLC (coupled 
on line, 


MS/MS capabilities (e.g., peptide sequencing) are excellent. 


MALDI 


Practical mass limit -300,000 daltons. Species of much 
greater mass have been reported. 

"Typical sensitivity on the order of low femtomole to low 
picomole. Reports have indicated that attomole 
sensitivity is possible. 

Soft ionization with little to no fragmentation observed. 

Suitable for the analysis of complex mixtures. 

Sample preparation: Samples are added directly to 
appropriate matrix. 


Disadvantages 


Low salt tolerance. Typically, samples are desalted prior to 
analysis using RP-HPLC, either “on-line” (see Protocol 6} 
or “off-line” (see Protocol 7) (see alsa Chapter 5). 

Difficulty in cleaning overly contaminated instrument 
due to high sensitivity for certain compounds. 

Low tolerance for mixtures, Simultaneous mixture analysis 
can be poor. The purity of the sample is important. To 
overcome this problem, samples are generally introduced 
into the instrument via on-line RP-HPLC (see Protocol 5). 

Multiple charging, which can be confusing, especially with 
mixture analysis. 

Quantitation requires internal standard. 


Generally low resolution. Some MALDI instruments are 
capable of high resolution; however, this is only in à 
relatively low-mass range and is accomplished at the 
expense of sensitivity, 

Matrix background, which can be a problem for compounds 
below a mass of 1000 daltons. This background inter- 
ference is highly dependent on the matrix material, For 
a list of commonly used matrices, see Table 8.8. 

MS/MS capability is minimal unless the instrument is 
equipped with a reflectron device (post-source decay). 

Possibility of photodegradation by laser desorption/ 
ionization. 

Not possible to study noncovalent interactions. 

Quantitation requires internal standard. 

Poor for determining peptide modifications (especially 
linear mode MALDI instruments; however, MALDI 
instruments with reflectron capability are good for 
determining some peptide modifications, e.g. 
phosphorylation]. 

Low tolerance for various excipients (e.g., phosphate buffers, 
salts »150 mM); see Table 8.4. 
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TANDEM MASS SPECTROMETRY 


The revolution in biological mass spectrometry has its origins with the development in the 
1980s of ionization techniques suited to biomolecular analysis. As these "soft ionization meth- 
ods” produce pseudomolecular ions (e.g., protonated [M+H}* ions) with little or no frag- 
mentation, renewed interest has occurred in the use of tandem mass spectrometry (MS/MS) 
methods to induce fragmentation and thereby obtain structural information. The mass ana- 
lyzer types that are commonly configured for MS/MS experiments are shown in Figures 8.12 
through Figure 8.17. 

In broad terms, MS/MS can be described as a series of cvents consisting of mass selection 
of a precursor ion in a first stage of analysis, an intermediate reaction event, followed by analy- 
sis of the product ions in a second stage of analysis (Busch ct al. 1988; de Hoffmann 1996). (A 
reaction event is defined as an event that converts a precursor ion to a product ion having a 
different m/z value.) With the development of multistage tandem mass spectrometric (MS) 
experiments (Busch et al, 1988; Mcluckey et al. 1995), the structure of product ions can be 
further interrogated using additional stages of MS. Note that although the term "tandem rnass 
spectrometry" has traditionally been used to describe reaction events involving energetic dis- 
sociation of the mass sclected ion, many alternative reaction types that also fit the definition 
of a tandem mass spectrometric reaction sequence are commonly used to obtain information 
about the structure and reactivity of ions in the gas phase. Thus, an MS/MS (or MS") experi- 
ment may also include, for example, ion-molecule, ion-ion, and ion-mobility reaction events. 

in the context of proteomics, the major type of intermediate reaction event employed in 
an MS/MS experiment involves energetic dissociation of the ion to cause fragmentation, and 
thereby derives structural information relating to the sequence of the peptide. (Note that 
from herein, the term "product ion" is used interchangeably with the more technically cor- 
rect “fragment ion?) Many methods have been used to deposit energy into an otherwise sta- 
ble mass-sclected ion to induce fragmentation (Busch et al. 1988; McLuckey and Goeringer 
1997). These include: 

* Collisions with an inert gas (known as collision-induced dissociation [CID] or collisional- 
ly activated dissociation [CAD]) (Hayes and Gross 1990; McLuckey 1992], 


Collisions with a surface (known as surface-induced dissociation or SID) (Dongre et al. 1996). 


Interaction with photons (e.g., via a laser) resulting in photodissociation (Bowers et al. 
1984; Little et al. 1994), 


Thermal/black body infrared radiative dissociation (BIRD) (Price and Williams 1997). 


Electron-capture dissociation (ECD), whereby multiply charged cations are allowed to 
interact with low-energy electrons in an FT-ICR mass spectrometer (Zubarev et al. 1998). 


By far the most widely used technique for performing energetic dissociation is CID. The 
process of CID involves collisional activation of the ion, by energetic collisions with a neutral 
target gas, to convert translational energy into internal energy, thus placing the ion into an 
activated (excited) state, followed by unimolecular dissociation of the activated ion to yield 
products (McLuckey 1992). 

Both low-energy («100 cV) (Hunt et al. 1986) and high-energy (keV) (Biemann 1990) 
collisional activation processes can be observed, depending on the instrumentation used. A 
well-defined nomenclature scheme (which is summarized in Table 8.3) has been proposed 
for the product ions arising from the fragmentation of protonated peptides, Product ions 
resulting from backbone cleavage of the @C-C, the C-N amide linkage, or the N-aC bond are 
called a-, b-, and c-type ions, respectively, if the charge is retained on the amino-terminal 
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FIGURE 8.12. Schematic representation of triple quadrupole MS/MS scan modes. The product ion scan: An 
ion of interest is mass-selected in the first mass analyzer (Q1) and is transferred to the collision cell (Q2) 
where collision-induced dissociation (CID) takes place. The resultant product ions are then separated by 
scanning the second mass analyzer (Q3). The observed signal is therefore the result of the mass-analyzed 
product ions derived from a mass-selected precursor ion. The precursor ion scan: The first mass analyzer is 
scanned to sequentially transmit all ions into the collision cell for fragmentation. The second mass analyz- 
er is set to only transmit a single m/z product ion of interest to the detector. Q3 and Qt are linked so that 
the precursor mass in Q1 that gives rise to a product ion signa! in Q3 may be determined upon detection of 
any product ion signal. The mass spectrum obtained therefore reveals all precursor ions that fragment to a 
yield a common product ion. The neutral loss scan: Mass analysis in both Q1 and Q3 mass analyzers is per- 
formed where ihe scan range of Q3 is linked to Q1 by a constant mass difference. As for the other scan 
modes, fragmentation of precursor ions occurs in the collision cell region. In this scan mode, the mass spec- 
trum obtained reveals those precursor ions that fragment via a constant neutral loss (or gain). 


fragment, or x-, y-, and 2-type ions, respectively, if the charge is retained on the carboxy-ter- 
minal fragment. The product ions are numbered according to their position from their 
respective terminal end. Under high-energy CID conditions, d-, v-, and w-type ions corre- 
sponding to side-chain cleavages may also be formed. {n general, the most commonly 
observed product ions resulting from low-energy CID belong to the b- and y-type ion series. 
These ions can be regarded as being related to each other in that they both arise via cleavage 
of the amide bond (fora review of peptide fragmentation, see the information panel on FRAG- 
MENTATION MECHANISMS OF PROTONATED PEPTIDES IN THE GAS PHASE). 


TABLE 8.3. Nomenclature for product ions arising from the fragmentation of protonated peptides 
Product ion nomenctature 


if charge is retained on if charge is retained on 
Site of backbone cleavage amino-terminal fragment carboxy-terminal fragment 
ac—t bond a x 
C—N amide bond b 


N—ac bond c Z 
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MASS ANALYZER CONFIGURATIONS USED IN TANDEM MASS SPECTROMETRY 


Critical to the development of techniques for probing the gas-phase structure of biomole- 
cules have been advances in the development of mass analyzers that not only are more versa- 
tile, but offer ever-increasing levels of sensitivity (i.e., decreasing detection limits} (Brunnee 
1987; Jennings and Dolnikowksi 1990). Mass analyzers that are applicable to MS/MS fall into 
two basic categories: tandem-in-space and tandem-in-time (Busch et al. 1988). 


+ Tandem-in-space mass spectrometers have discrete mass analyzers for each stage of mass 
spectrometry. Examples include double-focusing-sector analyzers, hybrid combinations of 
sector and quadrupole analyzer instruments, time-of-flight, and triple quadrupole instru- 
ments. 


* Tandem-in-time mass spectrometers have only one mass analyzer. Each stage of mass spec- 
trometry takes place in the same region, but is separated in time by a sequence of events. 
Examples of tandem-in-time mass spectrometers include FT-ICR and quadrupole ion-trap 
mass spectrometers. 


Tandem-in-Space Mass Spectrometers 
Triple quadrupole 


The triple-quadrupole mass spectrometer (Yost and Boyd 1990; Wysocki 1992) consísts of 
three quadrupoles connected in series (see Figure 8.12). Both the first (Qt) and third (Q3) 
quadrupoles have a combination of DC (variable voltage) and RF (constant frequency, vari- 
able amplitude) applied to the four rods, but adjacent rods have opposite polarity. If the mag- 
nitudes of the DC and RF voltages are increased while maintaining a constant RF-DC ratio, 
stable trajectories are created for ions of sequentially increasing m/z. Thus, a mass range may 
be “scanned” in order to obtain a spectrum. For MS-only scans, a fixed RF-only voltage is 
applied to the second (Q2) and third (Q3) quadrupoles, resulting in the transmission of a 
wide m/z range of ions that occurs simultaneously through these sections of the instrument. 

For simple MS/MS experiments, the RF applied to Q1 is maintained at a frequency cor- 
responding to the stable trajectory of only one ion, enabling it to be selectively isolated. All 
other ions experience unstable trajectories and are lost. CID occurs in the second quadrupole 
region, where an offset DC voltage is applied to ions entering the collision cell (typically an 
enclosed octapole in commercial instruments), so that the ions experience a net potential, 
and hence increase in kinetic energy, resulting in energetic collisions with a heavy back- 
ground gas of argon, and subsequent fragmentation. Ions exiting Q2 then pass through 
quadrupole Q3, which is scanned (in the same manner as described in the preceding para- 
graph) in order to analyze the product ion population and obtain the mass spectrum. To date, 
only ESI sources are available for commercial triple-quadrupole instruments. 

In addition to the product ion scan event described above, two other scan modes are com- 
monly performed on triple-quadrupole instruments. In a precursor ion scan, QI can be 
scanned and, following a reaction event in Q2, detection of a single product ion can be per- 
formed in Q3. As long as Q3 and Q1 are linked, the precursor mass in Q1 that gives rise to a 
product ion signal in Q3 may be determined. Alternatively, both Q1 and Q3 may be scanned 
at the same time, where Q3 is offset by a constant value below the mass of Q1. If Q3 and QL 
are linked, after a reaction event in Q2, a neutral loss scan may be obtained. Similarly, if Q3 is 
offset by a constant value above the mass of QL, and the other conditions hold, then a neu- 
tral gain scan may be obtained. For further information concerning the use of scan modes for 
detecting specific diagnostic fragment ions, see the panel below on USE OF SCAN MODES FOR 
DETECTING PEPTIDE IONS THAT PRODUCE DIAGNOSTIC FRAGMENT IONS UPON CID and l.gure 8.12, 
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Use OF SCAN Mones For DETECTING PePripe lows THAT PRODUCE DiaGNostic FRAGMENT 
IONS UPON CID 


A salient feature of triple-quadrupole mass spectrometers is the highly sensitive scan mode 
feature (see Figure 8.12). For example, it is possible to use the precursor ion scan (also 
known as a parent ion scan) to detect in a mixture of peptides only those precursors that 
efficiently produce a specific product ion upon dissociation of the parent ion commonly 
performed in the collision cell. Using this approach, the first quadrupole (Q1) is used for 
scanning and the third quadrupole (Q3) is fixed at the m/z of a specific diagnostic prod- 
uct ion (e.g., for phosphorylation, at m/z 79, the mass of the PO; ion: for detection of cys- 
teine residues that have been S-alkylated using the alkylating reagent 4-vinylpyridine 
[Moritz et al. 1996], at m/z 106, the mass of the S-pyridylethyl ion). If these fragment ions 
are produced in the collision cell (Q2), they are transmitted to the detector. Thus, when the 
detector registers a signal, it is not using the mass of Q3 (which is fixed at the m/z of the 
specific diagnostic fragment ion, and always the same) but the mass of Q1, which just 
transmitted the intact S-pyridylethylated peptide ion to Q2 and Q3. This task is accom- 
plished by coupling Q1 data to Q3. Hence, only when an S-alkylated peptide ion is trans- 
mitted through Q1 is the m/z 106 product ion produced in Q2 and a signal recorded at the 
detector. This scan mode can be used for any structure that efficiently produces a specific 
diagnostic product ion. 


Time of flight 


This mass analyzer measures the m/z values of analytes by pulsing molecular ions from the 
ionization source into a flight tube. The m/z values are calculated by the time it takes ions to 
travel a set distance and strike a detector (i.e., their time of flight). In 1991, a method of pep- 
tide sequencing by MALDI-TOF-MS was described by Spengler et al. (1991, 1992). The 
method is based on metastable decay of laser-desorbed ions, which occurs in the first field- 
free drift region of a reflectron time-of-flight (TOF) mass spectrometer. Also referred to as 
post-source decay (PSD), this method can be used to obtain structural information in pep- 
tides (see Figure 8.13). 


Quadrupole time of flight 


Historically, TOF mass analyzers had been exclusively coupled with MALUI, due to the com- 
plementary pulsed nature of both the ion source and mass analyzer. However, the develop- 


Post-source decay 
fragmentation 
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PSD) MS/MS via reflectron time-of-flight mass spectrometry, 


FIGURE 8.13. Schematic representation of post-source decay ( 
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FIGURE 8.14. Schematic representation of CID MS/MS in an 
orthogonal injection quadrupole time-of-flight (QTOF) MS/MS. 
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ment of pulsed orthogonal ion injection into TOF mass analyzers has now allowed ESI 
sources to be coupled with TOF for high-resolution MS and MS/MS experiments (Morris et 
al. 1996). The operation of this type of tandem mass spectrometer is essentially the same as 
that for a triple-quadrupole mass spectrometer, except that product ions exiting Q2 follow- 
ing CID are orthogonally pulsed into a TOF mass analyzer to acquire the mass spectrum (see 
Figure 8.14). 

Both ESI and MALDI sources have been successfully coupled to quadrupole TOF (qTOF) 
instruments (Loboda et al. 2000; Shevchenko et al. 2000, 2001; Wattenberg et al. 2002; 
Baldwin et al. 2001). The qTOF mass spectrometer was developed in order to take advantage 
of the high-performance resolution capabilities of the TOF mass analyzer for product ion 
assignment, together with the ability to obtain efficient precursor ion selection and dissocia- 
tion using the quadrupole mass filter. This instrument therefore overcomes many of the lim- 
itations of triple-quadrupole (poor resolution and low sensitivity due to poor duty cycles) 
and TOF-based (poor product ion formation via post source decay) MS/MS methods, Recent 
applications of this type of instrumentation have included studies in phosphopeptide map- 
ping (Bateman et al. 2002) and quantitative proteomics (Griffin et al. 2001b). 
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FIGURE 8.15. Schematic representation of CID-MS/MS by TOF/TOF-MS. The MALD! ion source comprises a sample plate, laser 
source, and a set of focusing lenses that direct the ions ta the TOF mass analyzer. 
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A MALDI-TOF/TOF high-resolution tandem mass spectrometer has recently been intro- 
duced for high-throughput, high-sensitivity peptide sequencing applications (Medzihrads- 
zky et al. 2000; Bienvenut et al. 2002). Ions desorbed from the MALDI target are accelerated 
to 3 keV kinetic energies through the first TOF region. Precursor ion selection is then per- 
formed by a timed ion gate prior to CID in a gas-filled collision cell. The high kinetic energy 
of the ions entering the collision cell allows high-energy CID processes to be observed. Thus, 
a wider range of product ion types can be observed compared to those formed under low- 
energy CID conditions, including d-, v-, and w-type ions resulting from side-chain cleavages, 
thereby allowing the isobaric amino acids isoleucine and leucine to be differentiated. Mass 
analysis is performed in the second TOF mass analyzer as indicated in Figure 8.15. 


Tandem-in-Time Mass Spectrometers 
Quadrupole ion trap 


The quadrupole ion trap, originally developed by Paul (March 1997, 1998; Schartz and 
Jardine 2000), is essentially a three-dimensional quadrupole, consisting of a hyberbolic ring 
electrode, and two hyperbolic end-cap electrodes. Small holes in the end-cap electrodes allow 
the passage of ions into and out of the trap. A high-voltage RF potential is applied to the ring 
electrode while the end caps are held at ground. The oscillating potential difference estab- 
lished between the ring and end-cap electrodes forms a substantially quadrupolar field 
(Figure 8.16). Ions injected into the trap above a certain m/z value, determined by the ampli- 
tude of the ring electrode RF, have stable trajectories. Collisions of the injected ions with heli- 
um (present at ~10 ? torr) cause “damping” of their trajectories so that they can be trapped 
by the RF field into the center of the trap. To obtain a mass spectrum, the ring electrode RF 
voltage amplitude is increased while applying a supplementary RF voltage to the end caps, 
sequentially bringing ions of increasing m/z into resonance with the applied frequency, 
whereupon they become unstable and their velocity and trajectory increase in the axial direc- 
tion until they are ejected from the trap and detected. 

For MS/MS analysis, precursor ions of a selected m/z may be isolated by increasing the 
amplitude of the ring electrode RF to the point that all ions below the mass of the selected 
ion are ejected. During this "RF ramp process,” ions above the precursor mass are also eject- 
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FIGURE 8.16. Schematic representation of CID MS/MS in a quadrupole ion- trap tandem mass spectrom- 
eter. (a) Precursor ion accumulation: Ions injected from either ESI or MALDI external ion sources collide 
with à background gas of helium and are trapped in the center of the ion trap. (b) Precursor ion isolation: 
Ions at the m/z of interest may be isolated by sequentially ejecting all other ions higher and lower in m/z by 
increasing the amplitude of the ring electrode RF while applying a supplementary RF voltage to the end 
caps. (c) Collision-induced dissociation (CID): CID is achieved by increasing the motion of the isolated jon 
in the axial direction via the application of an RF resonance excitation voltage to the end- -cap electrodes to 
enable energetic collisions with the background helium. (d) Collisional cooling: Product ions are cooled and 
trapped by collisions with the background helium gas. (e) Mass analysis: The resultant product ions are 
sequentially ejected from the trap in order of increasing m/z to acquire a product ion spectrum. Note that 
by cycling back to Step b fram Step d, subsequent product ion isolation and further CID reaction events can 
be performed to enable further interrogation of first-generation product ions (i.e, an MS* experiment). 


ed by applying a supplementary RF voltage to the end caps, whose frequency and amplitude 
are set just above the mass of the precursor. Thus, as the ring electrode RF amplitude is 
increased, these ions of higher mass are successively brought into resonance with the supple- 
mentary voltage and ejected until only the precursor ion remains. CID of the selected pre- 
cursor ion is accomplished generally by increasing the motion of the ion in the axial direc- 
tion by applying an RF resonance excitation voltage to the end-cap electrodes, at a frequen- 
cy comensurate with the ion of interest, to enable energetic collisions with the background 
helium. The resultant product ion population is sequentially scanned out of the trap in order 
of increasing m/z to acquire a product ion spectrum. MS" experiments can easily be per- 
formed by adding subsequent ion isolation and reaction event steps to the experimental 
sequence following the initial MS/MS reaction. Note that whereas only product ion scans can 
be performed in an ion-trap mass spectrometer, precursor and neutral loss scan spectra can 
be constructed by post-acquisition data processing following the acquisition of full-product 
ion scan mode spectra. Both ESI and MALDI ion sources are now available on commercial 
ion-trap instruments. 
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FIGURE 8.17. Schematic representation of a Fourier transform-ion cyclotron resonance (FT-ICR) mass 
spectrometer. 


Fourier transform-ion cyclotron resonance 


MS/MS using FT-ICR (Marshall and Verdun 1990; Amster 1996) is similar to quadrupole 
ion-trap MS/MS because in both cases, ions are trapped in three-dimensional space. 
However, reactions in the FT-ICR are carried out in a cell bound by electrodes (known as 
trapping, excite, and detect plates) located in a magnetic field, where the m/z value of an ion 
is directly related to its cyclotron frequency (see Figure 8.17). For MS/MS experiments, a "tai- 
lored" stored waveform inverse Fourier transform excitation pulse (or SWIFT, for short) is 
applied to the excite plates of the cell (Marshall et al. 1985). This causes ions other than the 
desired precursor ion to be excited and ejected. Following isolation of the precursor ion, CID 
is effected by accelerating the precursor ion in the presence of a pulsed collision gas to high- 
er kinetic energies (but not ejecting it from the cell) by the application of a sustained off res- 
onance irradiation (SORI) frequency shifted away from the cyclotron resonance frequency of 
the trapped ion (Gauthier et al. 1991), To acquire a product ion mass spectrum, ions are 
excited by a broadband RF excitation pulse applied to the excite plates, and an image current 
(the current induced in the walls of the cell due to the proximity of charged particles) is 
detected by the detect plates as a function of time. This time domain signal is transformed 
into a frequency domain signal and the masses are determined. Note that MS" experiments 
may be performed by simply adding additional isolation and reaction events steps to the 
sequence in a manner similar to that for the quadrupole ion trap. For further description of 
FT-ICR, see the information panel on “TOP-DOWN” PROTEIN SEQUENCE ANALYSIS USING MS/MS. 


COMPLEX MIXTURES OF PROTEINS CAN BE SEPARATED AND ANALYZED BY COUPLING 
MS WITH PROTEIN SEPARATION METHODS 


A number of technologies have been successfully coupled with MS for the purpose of separat- 
ing complex mixtures of macromolecules and analyzing the components of the mixture. New 
technologies continue to be developed as the demand for efficient, high-throughput methods 
increases, The following sections explain some of these strategies for separating and analyzing 
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protein mixtures. For details for carrying out the methods, see the protocols in this chapter and 
Appendix 2, and for related information and protocols, see Chapters 1, 2, 4, 5, and 9-11. 


e MS can be preceded by 1D or 2D liquid chromatography (LC) to separate complex mix- 
tures and/or remove contaminants that interfere with MS analysis. 


e Proteins can be separated using 1D or 2D PAGE, followed by MS analysis of selected pro- 
teins taken from the gel. 


e Gel-separated proteins can be transferred to membranes prior to analysis by MS. 


+ Affinity chromatography can enrich for specific classes of proteins prior to MS analysis. 


Coupling MS/MS to 1D Liquid Chromatography 


Separation and concentration of peptides from mixtures prior to sequence analysis by 
MS/MS are almost universally accomplished by on-line reversed-phase high-performance liq- 
uid chromatography (RP-HPLC). A protein digest can be loaded onto an HPLC system, sep- 
arated on-line by RP-HPLC, and the eluate analyzed by MS/MS. With the mass spectrometer's 
ability to select peptide ions based on their m/z values, it is well suited as a detector for analy- 
sis of mixtures and provides more specific information (e.g, amino acid sequence and car- 
bohydrate structure) than that of a peptide mass map. As ion signals increase above a chosen 
threshold, MS/MS is performed on the ion until it drops back below the threshold value. It is 
possible to collect MS/MS data on several of the most intense ions per unit time and to incor- 
porate an exclusion list of ions that have already been selected for CID. (In practice, the mass 
spectrometer can be programmed to perform one scan to determine the peptide masses and 
then to sequence the three to eight most abundant peptides [a method known as data-depen- 
dent acquisition.) With an LC system coupled to the tandem mass spectrometer through an 
ESI source and the ability to perform data-dependent scanning, it 1s now possible to distin- 
guish proteins in complex mixtures containing more than 50 components without first puri- 
fying each protein to homogeneity (McCormack et al. 1994, 1997). 

In MS/MS, a peptide ion is selected and fragmented, making it theoretically irrelevant as 
to how complex the original protein digest is unless there are isobaric peptide masses. 
Therefore, the coupling of LC to data-dependent MS/MS and automated database searching 
is a very powerful technique for "shotgun" identification of proteins in very complex mix- 
tures. This approach now allows biologists to explore the cell with unprecedented speed, 
determining protein identities, sites of posttranslational modifications, and protein-protein 
interactions, along with measuring the relative quantities of proteins in a sample (Gygi et al. 
1999; Griffin et al. 2001a; Natsume et al. 2002). 

The use of capillary columns for LC increases the sensitivity of the MS/MS analysis. LC- 
MS/MS can be made exquisitely sensitive by using capillary columns and capillary needle 
columns instead of traditional large-bore HPLC columns (Griffin et al. 1991; Emmett and 
Caprioli 1994; Davis et al. 1995; Gatlin et al. 1998; Moritz and Simpson 1992a,b, 1993; 
Natsume et al. 2002) (see also Chapter 5). Referred to as nano- LC or nLC, these narrow-bore 
columns provide two advantages: 


* Flow rates through these very narrow columns are 50—400 ni/min. 
* The capillary column serves as the ionization source. 

These features improve the sensitivity by reducing the volume of solvent per unit time 
that is sprayed into the mass spectrometer, which in effect concentrates the sample as it enters 


the ion source of the instrument. Protocols 4 and 5 detail methods for preparing and using 
microcapillary columns for proteomic studies. 
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MS ‘Senseivity: WHAT ls THE TYPICAL SENSITIVITY EXPECTED? 


For most nano-LC (50-400 fil/min) systems coupled to triple-quadrupole mass spectro 
ters, 1-10-fmoke levels of sensitivity are achievable (Cox etal 1994). Further improvements 
in sensitivity, (and increased resolution) can be gained using the newer quadrupole TOF 
- instruments, Although 1-5-fmole sensitivity is achievable using ion-trap mass spectrometers 
(full-scan MS mode), this mass analyzer may be limited by the introduction of chemical noise 
accunsulating in the trap. With the introduction of the variable-flow LC (5-200 nl/min), also 
referred to as peak parking (Davis and Lee 1997}, which is possible with the advent of fully 
automated capillary LC systéms, 5-10-attomole levels of sensitivity have been reported for 
MS/MS analysis in ion-trap instruments (Martin et al. 2000). These detection levels are as 
sensitive as those obtained with silver staining of proteins on SDS-PAGE gels:(Link etal, 
1999). However, it should be stressed that the inherent sensitivities of the MS configuration 
mentioned above were determined on.taterial directly applied to nano-LC column and not 
necessarily the material visualized in a gel band. (Sample losses are frequently incurred dur- 
ingglectrophoretie separation and subsequent sample handling steps prior to MS analysis.) 


Coupling MS/MS to 2D Liquid Chromatography 


Although tandem mass spectrometers inherently possess the ability to sequence peptides 
directly from complex mixtures, highly complex peptide mixtures (e.g., proteome-scale} are 
best analyzed first by 2D chromatography techniques. This is most commonly achieved by 
the combination of strong cation exchange (SCX) (Mant and Hodges 1985; Alpert and 
Andrews 1988) and RP-HPLC (Link et al. 1999; Gygi et al. 2000a,b; Washburn and Yates 2000; 
Washburn et al. 2001). By exploiting two of the physical properties of peptides (i.e., charge 
and hydrophobicity), complex peptide mixtures can be effectively resolved and concentrated 
prior to sequence analysis by MS. SCX chromatography, with its greater loading capacity, is 
generally used first. The eluent from the SCX chromatography is loaded onto an RP-HPLC 
column, which removes salts from the solution while performing a second separation of the 
peptides based on their hydrophobicity. The RP-HPLC column eluent is applied directly to 
the mass spectrometer using ESI. 

2D chromatography can be accomplished by either an on-line or an off-line approach. 
For the on-line approach, an acidified complex peptide mixture is applied to an SCX column, 
and discrete fractions of the adsorbed peptides are sequentially displaced directly onto the 
RP-HPLC column using a salt step gradient. Peptides are then analyzed by LC-MS/MS. This 
approach can utilize as many as 15 for more) stepwise increases in salt concentration 
"bumps" to fractionate a peptide mixture (Link et al. 1999; Washburn et al. 2001). The on- 
line approach is amenable to being almost completely automated. Essentially, the sample is 
loaded onto the SCX column, and 24 hours later, the analysis is finished (see Protocol 6). 

The off-line approach is performed by applying the acidified complex peptide mixture to 
the SCX chromatography column followed by a binary gradient to high salt to elute the pep- 
tides. Fractions are typically collected every minute into a 96-well plate and reduced in vol- 
ume, and then each fraction is loaded onto a reversed-phase chromatographic column auto- 
matically via an autosampler and finally analyzed by LC-MS/MS. The off-line technique has 
several advantages over the on-line approach: 


e Peptide separation is superior using true chromatography instead of salt steps. 


e SCX chromatography is ideally performed with >20% acetonitrile in the buffers to lin- 
earize peptides (Burke et al. 1989), yet the on-line approach can tolerate only 5—10% ace- 
tonitrile. 
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* A greater number of peptide fractions can be collected with the off-line approach (e.g, 96 vs. 
15). 


* The investigator has the discretion to choose which fractions to analyze, and interesting 
fractions can be analyzed more than once. 


Protocol 7 provides the details of performing the off-line approach of 2D-LC. The on- 
line technique is discussed in great detail in Protocol 6 (see also Appendix 2). 


Microscale LC Cleanup of Protein Samples Improves the Value of Mass Spectra 


A major problem in MS analysis is the presence of nonvolatile low-molecular-mass contam- 
inants that increase jon suppression effects and chemical noise (see Figure 8.18). In MALDI- 
MS, the traditional dried-droplet sample preparation method (Hillenkamp et al. 1991) is per- 
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FIGURE 8.18. (a) 2D gel electrophoresis pattern of enriched M, 25,000-45,000 MDA-MB231 proteins. Shown is a MALDI-TOF-MS 
analysis of a total tryptic digest of protein #25 from MDA-MB231 breast carcinoma cells. Proteins from a total cellular lysate of ~6 x 
10’ MDA-MB231 cells were separated on a 1096 SDS-PAGE slab gel. The M, 25,000—45,000 region was excised and proteins were pas- 
sively eluted from the gel siice (H. Ji et al. 1993), concentrated, and further resolved by 2D electrophoresis using an 18-cm linear pH 
3-10 IPG (Amersham Biosciences) in the first electrophoretic dimension and 10% SDS-PAGE in the second dimension. The pH gra- 
dient profile was calibrated using a carbonic anhydrase carbamylate calibration kit (Amersham Biosciences), according to the manu- 
facturer's instructions. Proteins were visualized by staining with CBR-250, and spot $25 was selected for in-gel proteolysis (see Chapter 
7, Protocol 13) and subsequent MS analysis. (b) The total peptide mixture was analyzed by MALDI-TOF-MS either directly (A) (i.e., 
no concentration/desalting step) by loading 0.5 ul of the peptide mixture (total volume ~100 pil, ~500 pmoles) or indirectly (B) fol- 
lowing a simple capillary (0.32 mm I.D.) column RP-HPLC concentration/desalting step (ON/OFF), where the total peptide mixture 
(total volume ~-100 u}, ~500 pmoles) was applied to the column, the column washed extensively with Solvent A (aqueous 0.1% TFA), 
and the total peptide mixture was then recovered in ~25 [i by developing the column with 10096 Solvent B (60% acetonitrile in aque- 
ous 0.1% TFA) in a batchwise manner. MALDI-TOF-MS sample load: 0.5 ul, (Reproduced, with permission, from Reid et al, 1998.) 
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TABLE 8.4, Excipients compatible with MALDI-MS 


Excipient Maximum recommended concentration 
Urea 0.5 M 
Guanidine-HCl 0.5M 
Glycerol 1% 
Alkali metal salts DlM 
Tris buffer 0.05 mt 
NH,HCO, 0.05 M 
Phosphate buffer 0.01 Me 
Detergents nec? 
SDS 0.00596 
Sodium chloride 10 mM* 
Hexafluoroisopropanol up to 4096 


Useful Web Sites for sample preparation: University of Ihinois: www.biotech.uruc.edu/mass. guide htm; www.srsmaldi.cor? 
Maldi/Step/Desalt.htmlStanford Research; www.srstmaldi.com; www.astbury.leeds ac uk/ Facil/M^tut/mstutorial.h tm. 

"Phosphate buffers should be avoided, because they discriminate salts that are hygroscopic (e... sodium acetate) because of 
the resultant wet surface. 


tnc. indicates not compatible. Some samples require the addition of a detergent for solubilization (especially if the sample 


has a hydrophobic character. In these situations, Triton X-150 or Tween-80 can be added at a concentration of 0.1-0.0% 
(Bornsen 2000). 


-Addition of 2-propanol improves signal intensities of high-salt containing samples. 


formed by mixing the analyte and matrix solutions directly on the MALDI target (see 
Protocol 1). Alternatively, the matrix and analyte solutions can be mixed in a tube prior to 
application onto the target. The tolerance of this method toward nonvolatile contaminants is 
limited by the efficiency of the crystallization process and by sample washing following matrix 
crystallization (see Table 8.4) For reviews on sample preparation techniques for peptides and 
proteins analyzed by MALDI-MS, see Bornsen (2000) and Kussmann and Roepstorff (2000). 
It has also been reported that matrix solution conditions (especially the solvent) have a large 
influence on the quality of the MALDI-MS analysis (Cohen and Chait 1996). 

An increase in sensitivity and resolution over the dried-droplet technique was obtained 
by the introduction of the fast evaporation or thin-layer method (Vorm et al. 1994) for 
MALDI-MS matrix preparations. The matrix is dissolved in a highly volatile organic solvent 
and applied onto the target to create a thin homogeneous layer of matrix crystals. The ana- 
lyte solution is then placed on top of the film and dried at ambient temperature. A higher 
number of analyte molecules are presumably desorbed upon laser irradiation, as they are 
embedded exclusively in the outermost layer of crystals. In addition, increased signal intensi- 
ties and sensitivity, together with a higher tolerance toward contaminants, have been 
observed. The fast evaporation method was further improved with the inclusion of nitrocel- 
lulose in the matrix (Jensen et al. 1996a,b; Kussmann et al. 1997). The presence of nittocel- 
lulose reduced the intensity of the alkali metal ion adducts observed in the mass spectra, pre- 
sumably because it prevented the alkali metal ions from being desorbed into the gas phase. 

Nonvolatile contaminants interfere even more readily with ESI-MS than with MALDI- 
MS. The exact mechanism by which the nonvolatile contaminants suppress the ionization of 
the analyte has not yet been determined. Recently, it has been postulated that the suppression 
may be partly the result of attractive forces holding drops tightly together, thus inhibiting 
smaller droplets from forming as they enter the mass spectrometer. In addition, contaminat- 
ing salts cause the analyte to precipitate, further hampering the analysis of the sample (King 
et al. 2000). Consequently, a chromatographic step must be included to remove nonvolatile 
contaminants from biological samples prior to their analysis by ESI-MS. 

A significant improvement in sample preparation for both MALDI-MS and ESI-MS 
came with the introduction of custom-made disposable microcolumns as a fast cleanup step 
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prior to MS. This approach was first proposed by Wilm and Mann (1996), who used a thin 
capillary needle packed with a small volume of reversed-phase chromatographic material and 
eluted the analyte molecules directly into a nano-ESI capillary needle by centrifugation. In 
this way, small amounts of sample could be rapidly concentrated and desalted prior to MS 
analysis. Low-molecular-mass contaminants are easily removed from the column, resulting 
in an increased signal-to-noise ratio in the mass spectrum and thus improved sensitivity. In 
addition, significantly higher amino acid sequence coverage from peptide mass maps is 
observed, which is important for the complete characterization of a protein. More recently, 
this technique has been further simplified by using either GELoader tips (Eppendorf) packed 
with chromatographic material or prepacked microcolumns (e.g, ZipTips, Millipore) 
(Gobom et al. 1997; Kussmann et al. 1997; Erdjument-Bromage et al. 1998; Gobom et al. 
1999; Shaw et al. 1999; John et al. 2000; P.M. Larsen et al. 2001; M.R. Larsen et al. 2001b,c). 


Customized Microcolumns and Strategies for MALDI-MS Sample Cleanup 


Reversed-phase microcolumns are an efficient and fast way to clean up and concentrate pro- 
tein samples prior to MALDI-MS analysis (Kussmann et al. 1997; Gobom et al. 1999). In 
addition to reversed-phase resins, a variety of other chromatographic resins can be used in 
microcolumns for sample cleanup prior to M5 analysis. Protocols 2 and 3 detail the prepara- 
tion of microcolumns containing either reversed-phase or immobilized enzyme resin, respec- 
tively. Some popular resins include: 


+ Poros R1 and R2 and Oligo R3 (Applied Biosystems} are reversed-phase packings used to 
desalt and concentrate proteins and, especially, peptides. Poros R1 is designed for very 
hydrophobic proteins and peptides. The binding strength is similar to low carbon-ioading 
C, supports. Poros R2 is designed for general separation of proteins, peptides, and nucleic 
acids. The binding strength is similar to low carbon-loading C, or C, supports. The Oligo 
R3 medium is designed for hydrophilic peptides and nucleic acids and is similar to high 
carbon-loading C, supports. For other reversed-phase packings and capillary column fab- 
rication, see Chapter 5. 


Immobilized metal affinity chromatography (IMAC) material is used for purification of 
His-tagged proteins and characterization of phosphorylated proteins and peptides (see 
Chapters 9 and 10 and Protocol 1 in Chapter 9). 


Graphite powder can be used as a peptide cleanup medium, especially for smal} hydro- 
philic peptides or phosphopeptides (Chin and Papac 1999). However, it also works very 
well for all other peptides if they are eluted with the matrix, @-cyano-4 hydroxy-cinnamic 
acid (HCCA) (Larsen et al. 2002a). Graphite powder can also be used to desait and con- 
centrate carbohydrates prior to MS analysis (Larsen et al. 2002b). 


Immobilized proteolytic enzymes can be used for rapid and efficient protein/peptide diges- 
tion (Gobom et al. 1997; M.R. Larsen et al, 2001b). 


Immobilized RNA or DNA can be used to affinity-purify verv small amounts of nucleic- 
acid-binding proteins prior to SDS-PAGE. This has been used to isolate proteins binding to 
the human insulin-like growth factor I] ieader 2 mRNA (Pedersen et al. 2002). Only a lim- 
ited amount of cell material is necessary with this approach (M.R. Larsen et al. 2001a). 


In a significant number of studies, the use of microcolumns for sample cleanup prior to 
MS has improved sensitivity and enhanced the identification of low-abundance proteins sep- 
arated by gel electrophoresis (e.g., see Shaw et al. 1999; John et al. 2000; P.M. Larsen et al. 2001; 
M.R. Larsen et al. 2001b,c; Natsume et al. 2002). The next six sections provide examples of 
ways in which microcolumns can be used to facilitate sample cleanup prior to analysis by MS. 
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Desalting and concentration of proteins 


Microcolumns have been used to desalt and concentrate trypsinogen prior to MALDI-MS 
analysis. Trypsinogen was diluted from a stock solution in phosphate-buffered saline (PBS) 
buffer to a final concentration of 1 pmole/ul; 1 ul of this solution was mixed with the matrix 
solution, 2,5-dihydroxybenzoic acid (DHB) in 5096 acetonitrile/1% trifluoroacetic acid 
(TFA) on the MALDI target. The resulting MALDI-MS spectrum from this preparation is 
shown in Figure 8.19a. The signal-to-noise ratio is low, probably due to the high-salt content 
in the PBS buffer, resulting in very low signals from the protein. In addition, the salt mole- 
cules contribute to a decreased resolution of signal due to overlapping peaks from sodium 
and potassium adducts. If the same amount of sample is desalted and concentrated on a 
Poros R1 microcolumn and eluted with DHB, a pronounced increase in the signal-to-noise 
ratio is evident (Figure 8.19b) leading to improved signal intensity and resolution. 
Additionally, some fragments originating from the protein could also be detected. 


Desalting and concentration of peptides 


Another reversed-phase resin, Poros R2, can be used in microcolumns for desalting and con- 
centrating samples prior to MALDI-MS analysis. The advantages of using this resin are illus- 
trated by the analysis of a tryptic digest of a yeast protein separated by 2D gel electrophore- 
sis. Figure 8.20 compares the performance of the traditional MALDI-MS sample preparation 
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FIGURE 8.19. Comparison of the dried-droplet and micropurification sample preparation using irypsino- 
gen dissolved in PBS buffer. (a) MALDI spectrum of 1 yl of a stock solution of trypsinogen (1 pmole/ul) 
using DHB dissolved in 50% acetonitrile/1% TFA as matrix. (b} MALDI spectrum of the same amount of 
trypsinogen desalted and concentrated on a Poros R1 microcolumn prior to MS. The protein was eluted 
from the microcolumn with DHB dissolved in 50% acetonitrile/0. 190 TFA. (Courtesy of Martin R. Larsen, 


Macquarie University.) 
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FIGURE 8.20. Comparison of the traditional matrix and the micropurification methods using the peptide 
mixture from an in-gel tryptic digest of a weak [55] methionine-labeled yeast protein. (a) Antoradiogram 
illustrating the "intensity" of the excised protein spot. The total protein load on the gel equaled 200 ug. (5) 
MALDI-MS of an aliquot (1 pl) of the tryptic digest supernatant using the traditional matrix preparation 
method. (c) MALDI-MS of 1 pl of the tryptic digest supernatant desalted and concentrated on a Poros R2 
microcolumn. (d) MALDI-MS as in c but with an aliquot of 10 ul (Courtesy of Martin R. Larsen, 
Macquarie University.) 
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method (see Protocol 1) with the micropurification method described in Protocol 2. The 
excised spot is shown on the magnified image of a 2D gel containing 200 ug of [*S,methio- 
nine-labeled yeast proteins (Figure 8.20a) (Nawrocki et al. 1998). The protein spot was 
excised and subjected to in-gel digestion with trypsin (see Protocol 13 in Chapter 7). The 
MALDI-MS peptide mass map obtained using the traditional matrix preparation method 
with an aliquot of the digest supernatant (1 pl of 20 ul) is shown in Figure 8.20b. In this 
example, a large proportion of low-molecular-mass contaminants (e.g., salts) yielded a poor 
signal-to-noise ratio in the low-mass area, and consequently, only a few peptide signals were 
detected. The protein could not be identified from this peptide mass map. When the same 
amount of sample was desalted and concentrated on a Poros R2 microcolumn, the signal-to- 
noise ratio and the overall signal intensities increased, resulting in the detection of a sufficient 
number of peptides to unambiguously identify the protein (Figure 8.20c). The peptides in the 
latter peptide mass map covered 1896 of the amino acid sequence. If a larger aliquot (10 ul) 
of the sample was micropurified, a further improvement in the signal-to-noise ratio and sig- 
nal intensities were observed (Figure 8.20d), resulting in detection of more peptides (45% 
sequence coverage). 


Sequential elution of peptides 


Even though the use of microcolumns prior to MS increases the signal-to-noise ratio and the 
overall signal intensities, resulting in higher sequence coverage, some peptides may still be 
lost during the analysis. One of the major reasons is the pronounced suppression effect gen- 
erally observed in mass spectrometric analysis (Figure 8.22). The initia! competition for pro- 
tons in the ionization processes results in some components giving a much lower signal than 
if single-component analysis were performed. By using sequential extraction of proteins 
from the microcolumn with an increasing concentration of acetonitrile, a complex peptide 
mass map can be divided into several simpler peptide mass maps with an increased number 
of peptides detected. In addition to a greater sequence coverage, this approach is especially 
useful for reducing the number of components in the sample prior to de novo tandem mass 
spectrometric sequencing and post-source decay (PSD) analysis. 

The sequential extraction concept is illustrated by the analysis of a previously unknown 
protein originating from a pig sperm membrane protein preparation separated by 2D gel 
electrophoresis. The protein spot was excised and subjected to in-gel digestion with trypsin 
(see Chapter 7, Protocol 13). A small aliquot of the digest supernatant (196) was applied onto 
a Poros R2 microcolumn. After washing the column, the peptides were sequentially eluted 
with increasing acetonitrile concentrations. The resulting peptide mass maps are shown in 
Figure 8.22. When compared with the “normal” elution using HCCA in 70% acetonitrile and 
0.196 TFA, illustrated by the lowest peptide mass map, significantly more peptide signals are 
detected with the sequentíal elution (nine peptides). In addition, the signal-to-noise ratio and 
the overall quality of the spectra are improved. Although it is very useful for protein charac- 
terization, sequential elution is rather time-consuming. Recently, it has been shown that a 
two-step elution using 1696 and 3096 acetonitrile is sufücient for a significant increase in the 
number of detected peptide signals (Erdjument-Bromage et al. 1998). 


Use of increasingly hydrophobic microcolumns improves 
sequence coverage 


Small peptides that are very hydrophilic or have been posttranslationally modified are diffi- 
cult to detect by MS, which may be due to either peptide loss during purification or sup- 
pression by matrix signals. However, by trying different chromatographic materials in the 
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FIGURE 8.21. MALDI- TOF-MS analysis of a total tryptic digest of protein #25 from MDA-MB231 breast 
cancer carcinoma cells following stepped elution capillary column (0.32 mm I.D.) RP-HPLC. The total pep- 
tide mixture (total volume ~100 pl, ~500 pmoles) was applied to the column at 16 l/min, and the column 
was developed in a stepwise manner with 4096 Solvent B (a), 7096 Solvent B (b), and 10096 Solvent D (c), 
where Solvent A was aqueous 0.1% TFA, and Solvent B was 60% acetonitrile in aqueous 0.1% TFA. The vol- 
ume of collected fractions was ^25 ul. MALDI-TOF-MS sample load: 0.5 ul. Note in panel c that the jon- 
ization of tryptic peptide T1 {m/z 2402.2) is completely suppressed in the presence of other peptides in the 


mixture (cf. panels a and b). (Reproduced, with permission, from Reid et al. 1998.) 
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FIGURE 8.22. MALDI-MS peptide mass maps of peptides eluted from a Poros R2 column with increasing 
concentrations of acetonitrile; 1% of a peptide mixture originating from digestion of a gel-separated pro- 
tein was applied to a Poros R2 microcolumn. After washing, the peptides were eluted stepwise using increas- 
ing concentrations of acetonitrile. The MALDI-MS peptide mass map in the bottom of the figure illustrates 
results from elution using 4HCCA in 70% acetonitrile/0. 196 TFA. (Courtesy of Martin R. Larsen, Macquarie 
University.) 


microcolumn, some of these peptides can be recovered and analyzed. Two resins, Oligo R3 
and graphite powder, which are more hydrophobic than Poros R2, can be used efficiently for 
this purpose. 

The peptide mass maps of in-gel-generated tryptic peptides from in-vitro-phosphorylat- 
ed F-STOP protein illustrate the advantages of using Poros R2 together with Oligo R3 (Figure 
8.23). After digestion, an aliquot of the digest supernatant was applied onto a Poros R2 
microcolumn. The peptides were desalted and concentrated and analyzed by MALDI-M5 
(Figure 8.232). The flowthrough from the R2 column was passed through an Oligo R3 micro- 
column. The MALDI-MS peptide mass map obtained from the R3 column eluant (Figure 
823b) contained five peptide signals that were not detected in the R2 map. 'The amino acid 
sequence of these signals revealed a high number of hydrophilic amino acids. In addition to 
an increase in sequence coverage of almost 10%, a phosphorylated peptide could also be eas- 
ily detected in the R3 peptide map (M.R. Larsen et al. 2001c). The usefulness of using Oligo 
R3 for enrichment of phosphorylated peptides prior to MS has also been shown by Neubauer 
and Mann (1999) and Larsen and Roepstorff (2000). 

Graphite columns have been used in HPLC for desalting hydrophilic peptides prior to 
MALDI-MS (Chin and Papac 1999) and can also be packed into microcolumns and used in a 
sample preparation step prior to MS (Larsen et al. 2002a,b). The advantages of graphite pow- 
der are its low cost and compatibility with both Poros R2 and Oligo R3, permitting recovery of 
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FIGURE 8.23. MALDI-MS peptide mass maps of peptides originating from tryptic digestion of gel-separat- 

ed F-STOP protein. (a) MALDI-MS peptide mass map of a 596 aliquot of peptide mixture from the diges- 

tion of F-STOP protein, afier desalting and concentration on a Poros R2 microcolumn, (6) MALDI-MS pep- 

tide mass map of the flowthrough after desalting and concentration on an Oligo R3 microcolumn. Ihe pep 

tide signals marked with a circle illustrate those not retained by the Poros R2 chromatographic material 

(Courtesy of Martin R. Larsen, Macquarie University.) 


smaller peptides. Note, however, that the choice of matrix in MALDI-MS has a significant role 
in distinguishing the smaller peptides from the matrix background (Chin and Papac 1999). 

An example of the use of graphite microcolumns is given in Figure 8.24. Only the low- 
mass region from 500 to 1200 daltons is shown. A protein spot derived from a 2D gel of 
enriched pig sperm membrane proteins was submitted to in-gel tryptic digestion. A small 
aliquot of the peptides (2%) was desalted and concentrated on a Poros R2 column and ana- 
lyzed by MALDI-MS (Figure 8.24a). The protein was identified as voltage-dependent anion 
channel 2 protein with 53% sequence coverage. The flowthrough from the Poros R2 column 
was collected and concentrated on an Oligo R3 column. The MALDI-MS peptide mass map 
of the peptides bound to the Oligo R3 (Figure 8.24b) revealed only one peptide not detected 
in the Poros R2 map, increasing the sequence coverage by a further 3%. The flowthrough from 
the Oligo R3 column was collected and concentrated on a microcolumn packed with graphite 
powder. The MALDI-MS peptide mass map of the peptide eluted with HCCA in 70% ace- 
tonitrile and 0.1% TFA from the graphite column is shown in Figure 8.24c. Five peptides not 
detected in either the Poros R2 or the Oligo R3 maps were found in this map. The five pep- 
tides are relatively hydrophilic and are probably not bound to either of the previous columns 
or they are suppressed in the peptide mass maps. The five peptides cover 10% of the protein 
sequence and thereby increase the overall sequence coverage to 66%. 

The use of a multi-tiered approach with increasingly more hydrophobic chromato- 
graphic material packed in microcolumns for characterizing proteins is advantageous for 
detection of a larger number of components in the sample and thus improved sequence cov- 
erage, The analysis is fast and uses only a limited amount of the total sample. Graphite pow- 
der can also be used in a one-tiered approach for desalting and concentration of peptides 
prior to MALDI-MS if the peptides are eluted off of the column with HCCA in 70% ace- 
tonitrile and 0.1% TEA (Larsen et al. 2002a). 
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FIGURE 8.24. MALDI-MS peptide mass maps of a micropurified peptide mixture originating from an in- 
gel digestion of voltage-dependent anion channel 2 protein. Only the low-mass region from 500 to 1200 dal- 
tons is shown. (a) Peptide mass map of the peptides eluted off the Poros R2 microcolumn. (5) Peptide mass 
map of the flowthrough from the Poros R2 desalted and concentrated on an Oligo R3 microcolumn. (cl 
Peptide mass map of the flowthrough from the Oligo R3 column desalted and concentrated on a graphite 
microcolumn. The sequences of the peptides not retained on either the Poros R2 or Oligo R3 are shown. 
{Courtesy of Martin R. Larsen, Macquarie University.) 


Comparison of Poros R2 microcolumns with prepacked uC., ZipTips 


The effectiveness of homemade microcolumns (packed in GELoader tips) for desalting and 
concentration of samples prior to MS was compared with commercially available prepacked 
HC, ZipTips (Millipore). The results are summarized in Figure 8.25. Peptides originating 
from an in-gel tryptic digest of a previously unknown protein were prepared with either a 
Poros R2 microcolumn or yC; ZipTips. The MALDI-MS peptide mass maps of increasing 
amount of digestion supernatant (1—4 pl) using the GELoader tip approach are shown in 
Figure 8.25a. The number of peptides that could be detected with increasing amount of 
digestion supernatant was 10, 17, 23, and 25, respectively. For comparison, the same amount 
of peptides was desalted and concentrated using the pC, ZipTips, according to the manu- 
facturer's protocol (Figure 8.25b), and the number of peptides that could be detected with 
increasing amounts of digestion supernatant was 0, 0, 3, and 9, respectively. 

The reasons for the different performances of the two columns are only speculative. The 
GELoader tip columns have a much lower bed volume, permitting the peptides to be effi- 
ciently eluted with a very low volume (typically 0.5 ul) of HCCA in 70% acetonitrile con- 
taining 0.1% TFA, whereas the ZipTip requires a higher elution volume, which dilutes the 
sample. In addition, the elution volume can in the GELoader tip approach be divided into 
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FIGURE 8.25. Comparison of the performante of GELoader tip microcolumns and the commercially avail- 
able uC, ZipTip. The peptide mixture was generated from the protein marked on the gel picture. (a) 
GELoader tip microcolumns: MALDI-MS peptide mass maps of increasing volumes of a peptide mixture 
originating from a pig sperm membrane protein, 1~4 pl, respectively. (b) C,, ZipTip: MALDI-MS peptide 
mass maps of increasing volumes of a peptide mixture originating from a pig sperm membrane protein, 1—4 
jl. (Courtesy of Martin R. Larsen, Macquarie University.) 


several spots on the MALDI target (up to ten), where the peptides are only found in the first 
and second spots. This cannot be done using the ZipTips as the peptides seem to be eluted 
randomly in all of the small fractions (data not shown). The low bed volume also increases 
the ratio of surface area to volume, resulting in increased loading capacity in the GELoader 
tip. The effect described above is most noticeable when characterizing low-abundance pro- 
teins. The efficiency of the ZipTips can be significantly improved if the buffers and protocol 
described for the GELoader tips above are used, i.e., loading the peptide solution from the top 
of the ZipTip (data not shown). 


A ZipTip pipette tip (from Millipore) is a microcolumn with the resin prepacked into the 
narrow end of a 10-ul pipette tip. The resins currently available are reversed (C. 
and C,), immobili natal ions Qd hyenas: En en ed wh 

columns are called for in any of the protocole in this chapter. M TOV. 


protocols at their Web Site (http://www.millipore.com/ziptip) on s n of 
Pm noe) EE aration of | 
Loses eae p e: afr eic 
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FIGURE 8.26. MALDI-MS peptide mass map of peptides originating from an on-column tryptic digestion 
of 2 pmoles of BSA. An aliquot (4 ul} of a solution of BSA (0.5 pmole/pl) was diluted in 20 pl of 50 mM 
NH,HCO, (pH 7.6] and applied onto a microcolumn containing immobilized trypsin packed as described 
in Protocol 3. The peptides were desalted and concentrated on a Poros R2 microcolumn prior to MALDI- 
MS analysis. (Courtesy of Martin R. Larsen, Macquarie University.) 


On-column enzymatic digestion 


GELoader tips packed with immobilized enzyme material can also be used to rapidly digest 
proteins in solution with high efficiency (Gobom et al. 1997; M.R. Larsen et al. 2001b). For a 
procedure for preparing immobilized enzyme microcolumns, see Protocol 3. 

As an example, 2 pmoles of bovine serum albumin (BSA, M, ~67 kD) was digested on a 
microcolumn containing immobilized trypsin. The flowthrough was concentrated and 
desalted on a Poros R2 microcolumn, and the peptides were eluted directly onto the MALDI 
target using HCCA in 70% acetonitrile and 0.1% TEA. The resulting MALDI-MS peptide 
mass map is shown in Figure 8.26 (only the mass range of 800—3500 daltons is shown). From 
the peptide mass map, peptides covering ~40% of the BSA sequence could be assigned read- 
ily, even though the protein had not previously been reduced and alkylated, thus leaving most 
of the disulfide bonds intact. In a similar experiment using carbamidomethylated BSA, the 
sequence coverage from the on-column digestion was 82.3% compared to only 72.7% from 
the in-solution digest (Gobom et al. 1997). This approach is fast and in many cases more effi- 
cient than traditional solution digestion. 


Global Protein Arrays: Identifying Proteins immobilized on Membranes 


One consequence of the increased sensitivity of modern mass spectrometers is that only 
small quantities of protein are needed for identification. Hence, clinical samples such as tis- 
sue biopsies can be prepared for electrophoresis and many of the separated proteins can be 
identified, Another advantage of separating proteins by electrophoresis is that the proteins 
can be electrophoretically transferred to a membrane for archiving valuable clinical samples. 

Various groups have identified proteins directly from electroblotted proteins with 
MALDL-TOE-MS. Infrared MALDI-TOF-MS has been used to determine intact protein 
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masses of 1D and 2D electrophoretically separated proteins electroblotted onto a variety of 
membranes (Eckerskorn et al. 1992). The sensitivity of analysis was comparable to silver- 
stained gels, and some posttranslational modifications were identified by comparison of the 
theoretical mass (predicted from the DNA sequence) and the apparent mass (Eckerskorn et 
al. 1997a,b). UV MALDI-TOF analysis has been used to measure intact protein masses of 1D- 
separated bands blotted onto polyvinylidene difluoride (PVDT) membranes (Strupat et al. 
1994; Vestling and Fenselau 1994; Patterson 1995; Blais et al. 1996) and derivatized PVDF 
membranes (Immobilon CD) (Patterson 1995). 

Another approach for the identification of proteins directly from the membrane surface 
is an endoproteinase digest of the protein followed by MS analysis of the resulting peptides 
(Schleuder et al. 1999). Immobilon CD, a cationic durapore membrane, showed the most 
reproducible results for infrared MALDI. It was thought that the hydrophilic surface was 
more conducive to an efficient tryptic digestion than the hydrophobic surface of underiva- 
tized PVDF (Schleuder et al. 1999), The results for underivatized PVDF demonstrated high- 
er signal intensities and better reproducibility, and larger numbers of tryptic fragment signals 
were observed with an increase of the pore size diameter (Schleuder et al. 1999). Enzyme- 
compatible matrices have also been investigated to acquire an intact protein mass by MALDI 
and then perform an enzymatic digestion on the protein (Schleuder et al, 1999), 


The Molecular Scanner 


One automated technology that has emerged is the molecular scanner (Bienvenut et al. 1999; 
Binz et al. 1999), a process where all proteins separated on the 2D polyacrylamide gel are 
digested simultaneously during the electroblotting process. Endoproteinase digestion occurs 
via a membrane that contains immobilized trypsin, which is intercaiated between the gel and 
a PVDF capture membrane. The membrane is then scanned directly by MALDI- TOF-MS. 
Minimal sample manipulation occurs compared to “in-gel” digestion, and the gel image is 
created by direct matching of the MS scanning results with the corresponding robotic coor- 
dinate. The molecular scanner is only suitable for proteins below 60,000 daltons with a pl 
below 8.5 because electroblotting is performed at a pH that is suitable for trypsin activity. The 
technique has been demonstrated on 1D bands of standard proteins, small sections of 2D 
blots of human plasma, and Escherichia coli (Bienvenut et al, 1999; Binz et al. 1999). Protocol 
12 describes the use of the molecular scanner technology. A disadvantage of the molecular 
scanner is that it is a “shotgun” technique whereby all proteins are fully digested by the treat- 
ment, and it requires global scanning to find the proteins present. 


The Chemicai Printer 


An electroblotted archive of 2D-PAGE-separated proteins (bound to a membrane) is a type 
of protein chip, albeit a macroarray of proteins. The membrane-derived macroarray is iden- 
tical to a chip-based protein microarray except that rather than the coordinates of each pro- 
tein being predetermined in robotic pixels as in the microarray, they are determined by the 
physical attributes of the protein's apparent molecular mass and isoelectric point. Once the 
coordinates of each protein within the protein macroarray have been identified by an image- 
capture device, each protein spot now has a defined X, Y position and can be manipulated by 
any state-of-the-art robotic platform. 

Protocol 13 describes the use of a technology that combines the advantages of both pro- 
tein chips and 2D electrophoresis, termed a "chemical printer" (Figure 8.27). The chemical 
printer uses a microdispensing device, such as drop-on-demand type ink-jet technology 
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FIGURE 8.27. Prototype chemical printer at Proteome Systems. (4) Motion control XY stage, (b) MicroJet 
piezoelectric devices, (c) sample reservoirs, (d) membrane viewing camera, (e) elbow drop formation cam- 
era, (f) pneumatics control panel; (g) delay strobe for droplet viewing. The fluid dispensing, XY motion of 
the target platform, and Z motion of the piezoelectric devices are controlled by integrated software and con- 
troller boxes. As marty as 4 piezoelectric devices can be mounted on the instrument and up to 4 MALDT tar- 
gets with membranes attached. (Courtesy of Andrew A. Gooley, Proteome Systems.) 


FIGURE 8.28. Enlarged view of the Microjet piezoelectric device assembly. ‘The assembly has three of four 
possible devices installed (a) connected to the reagent delivery system (b) with vacuum connection (c). In 
this view, device 4 is currently active with the vacuum toggle in the "on" position (dy. A MALDI target with 
attached electroblot is held in position via a vacuum on the XY motion stage (e). (Courtesy of Andrew A. 


Gooley, Proteome Systems.) 
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(Adams and Roy 1984; Bogy and Talke 1984; Dijkstnan 1984). Small volumes of liquid can be 
dispensed by application of a voltage pulse to a piezoelectric material (PZT, lead, zirconium, 
and titanium) that imparts an acoustic wave to a liquid sample. A motion-control stage 
enables the drop to be accurately dispensed on a predefined target location. Microdispensing 
of solutions onto a membrane of electroblotted proteins at defined coordinates permits local- 
ized endoproteinase digestion and subsequent MALDI-TOF-MS peptide mass fingerprinting 
directly from the membrane surface. This approach bypasses the multiple liquid-handling 
steps associated with "in-gel" digestion procedures. Importantly, the ability to analyze immo- 
bilized proteins allows both for archiving of samples pre/post analysis and for multiple chem- 
ical reactions, such as several endoproteinase reactions or reactions which uncover protein 
modifications (e.g., glycosylation) that can be performed on the same protein spot. 
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IN VIVO AND IN VITRO LABELING METHODS PERMIT MS TO BE USED F 
QUANTITATION AND EXPRESSION PROTEOMICS Mi 


It is well acknowledged that the traditional 2D gel electrophoresis approach for differential 
protein profiling has significant, inherent limitations, especially when analyzing whole-cell 
lysates (Gygi et al. 2000b) (see also Chapter 1). The following are the major drawbacks: 


* Limited sample capacity. 
* Limited detection sensitivity (see Figure 1.8). 


* Difficult to resolve hydrophobic proteins (e.g., membrane-associated proteins). 


A significant portion of 2D electrophoresis-separated gel spots contain more than one pro- 
tein due to coelectrophoresis and/or differentially modified (or processed) forms of the 
same protein. 


2D electrophoresis is a very labor-intensive, slow, and technically demanding method that 
is not readily amenable to high-throughput or automation. 


These limitations severely restrict the detection of low-abundance proteins, which make 
up a large proportion of a given proteome (Gygi et al. 2000b). Thus, 2D electrophoresis does 
not give a true representation of all expressed proteins. Rather, it reflects only a small slice of 
the proteome. 

An alternative proteomic approach that has the potential to identify trace-abundant pro- 
teins involves the affinity capture of cysteine-containing peptides (Spahr et al. 2000). 
Described in Protocol 8, this in vitro "labeling" method, which relies on an affinity tag for cys- 
teines, simplifies the mixture of peptides ( generated by proteolytic digestion of a whole 
lysate), such that the mixture of peptides contains approximately tenfold fewer peptides than 
the original mixture. (This simplification can facilitate the MS analysis of the sample.) By 
extending this approach to include a coded affinity tag (ICAT)—with three functional moi- 
eties (a cysteine reactive moiety, a linker that contains an isotope or mass "tag; and a biotin 
moiety, i.e., the affinity tag)—Qquantitation information can be obtained (Gygi and Aebersoid 
1999, 2000; Gygi et al. 1999, 2000) (see Protocol 9). 

It has been shown that global isotope labeling can also be achieved by intrinsically Jabel- 
ing proteins in vivo while they are being expressed (Oda et al. 1999; Chen et al. 2000; Veenstra 
et al. 2000; Peng and Gygi 2001). For example, isotopic labeling can be performed by grow- 
ing identical strains of bacterium, such as £. coli, in normal and ^N-enriched media where 
proteins isolated from the two cultures are differentially labeled (Oda et al. 1999; Conrads et 
a]. 2001). Protocol 10 outlines a simple protocol for intrinsic ^N-labeling of protein in E. coli. 

In vivo isotopic labeling can also be performed by growing bacteria in media depleted of 
the naturally abundant isotopes of carbon (3C) and nitrogen (^N), and using FT-ICR-MS 
(Marshall et al. 1997; Pasa-Tolic et aj. 1999). Bacteria grown in isotope-depleted versus nor- 
mal media yield proteins whose peaks are shifted toward a monoisotopic mass. For example, 
a protein with 1000 carbon atoms (^22 kD) yields a protein with an average molecular mass 
that is 11 daltons lighter than at natural abundance. Only ET-ICR-MS is capable of resolving 
such small differences in molecular mass (Miranker 2000); these differences in mass are used 
to measure differential protein expression. For further discussion on quantification as a tool 
for measuring global protein changes, see Chapter 1. 

In many proteomic problems, it is necessary to define the carboxyl terminus of a protein 
(e.g., to determine the carboxy-terminal processing site). Although this can be accomplished 
by automated carboxy-terminal sequencing (see Chapter 6 ), the anhydrotrypsin method out- 
lined in Protocol 11 has the potential for high-throughput and enhanced sensitivity. 


THE USE OF MASS SPECTROMETRY IN PROTEOMICS © 467 


SEARCH ENGINES FOR IDENTIFYING PROTEINS USING MS DATA 


History 


This section was kindly contributed by Eugene A. Kapp and James E. Eddes from the Joint 
ProteomicS Laboratory (JPSL) of the Ludwig Institute for Cancer Research and the Walter 
and Eliza Hall Institute of Medical Research, Melbourne, Victoria, Australia. 


In 1984, Biemann et al. (Gibson and Biemann 1984) proposed a strategy for mass spectro- 
metric verification of three protein structures based on their DNA sequence. This strategy 
included digesting the proteins using the enzyme trypsin, separating the pool of peptides by 
HPLC, and then determining the masses of the peptides by fast atom bombardment (FAB-MS) 
(Morris et al. 1981). These values were then compared with molecular masses of tryptic pep- 
tides predicted from the DNA-deduced amino acid sequence using all three reading frames. 

The computer program (FRAGFIT) described by Henzel et al. (1993) identified proteins 
by matching two or more molecular masses of peptide fragments obtained from chemical or 
enzymatic cleavages with all the fragment masses in a protein sequence database, Similar 
strategies were presented by C.J. March and T. Farrah (pers. comm.), using plasma desorp- 
tion MS, and J.R. Yates et al. (pers. comm.). In 1993, five groups outlined their strategies for 
protein identification using MS by relying on the fact that multiple observed peptide frag- 
ment masses could be correlated with the theoretical peptide fragment masses obtained from 
a protein sequence database. The groups of Pappin et al. (1993) and James et al. (1993) out- 
lined superior scoring schemes, taking into account the size of protein entries in the database. 
Searching of expressed sequence tag (EST) libraries and their six-frame translation was 
described by James et al. (1994). Probability scoring was introduced to take into account the 
large number of possible matches, as well as the concept of adding extra information to the 
search by means of dual digests or hydrogen-deuterium exchange experiments. Masses alone 
no longer afforded the discrimination required for a positive identification due to the size of 
the database being searched. During the last 10 years, the main focus on improving these 
strategies has been in providing reliable input to these search engines and developing 
improved scoring schemes to improve confidence in the search result. 

A software algorithm for interpreting MS/MS data and hence deducing the amino acid 
sequence of a protein was first published by Biemann et al. (1966). The use of amino acid 
sequence information, derived by Edman sequencing or MS/MS, as well as the mass of a pep- 
tide in a database search, was first described by Mann (1994). Rather than searching multiple 
peptide masses, it was suggested that one peptide mass plus a sequence tag was sufficiently 
discriminating in searching a sequence database. The characteristic fragment ion patterns of 
peptides, previously described by Biemann (1988) and modified by Roepstorff and Fohiman 
(1984), were used to pinpoint the tag (usually two to four residues in length). In 1994, Eng et 
al. (1994) proposed searching protein sequence databases using uninterpreted MS/MS data, 
which required searching the mass of the peptide and correlating the observed MS/MS frag- 
ment ions against the theoretical fragment ion patterns for a peptide (SEQUEST program). 
This strategy has stood the test of time and has proven popular due to the relative ease and 
nonexpert nature of operation. 


Sequence Databases 


The most sensitive comparisons between sequences are made at the protein level. Detection 
of distantly related sequences is easier in protein translation, because the redundancy of the 
genetic code of 64 codons is reduced to 20 distinct amino acids. However, the loss of degen- 
eracy at this level is accompanied by a loss of information that relates more directly to the 
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TABLE 8.5. Internet addresses for nonredundant protein and DNA sequence databases 


NCBI http//www.ncbi.nlm.nih.gov 

EB] (EMBL) http://www.ebi.ac.uk 

Japan http://www.ddb).nig.ac.jp 

SwissProt http://www.expasy.ch 

NCBInr from NCBI ftp://ftp.nchi.nih.gov/biast/db (click nr.Z) 

dbEST from NCBI ftp://ftp.ncbi.nih.gow/blast/db (click est human.Z. or 


est. mouse.Z, or est others.Z) 
GenPept (protein translation of GenBank) — ftp///ftp.ncbi.nih.gov/genbank (click genpept.fsa.qz) 
OWL ftp://ftp.ncbi.nih.gow/repository/OWL (click owl.fasta.Z) 
PIR http://pir.georgetown.edu/pirwww/search/pirntef shtml 


evolutionary process, because proteins are a functional abstraction of genetic events that 
occur in DNA {Attwood and Parry-Smith 1999). DNA sequence databases contain genom- 
ic sequence data, including information at the level of the untranslated sequence, introns 
and exons, mRNA, cDNA, and translations. ESTs are generally 200—400 bases in length, 
obtained from cDNA libraries, and represent the expressed genome of a specific cell type. A 
search of a DNA or EST database therefore requires conceptual translation in six reading 
frames because it is not known whether the first base marks the start of the coding sequence. 
There are three forward frames, which are achieved by beginning to translate at the first, sec- 
ond, and third bases, respectively; the three reverse frames are determined by reversing the 
DNA sequence and again beginning on the first, second, and third bases. ESTs by their very 
nature are incomplete and to a certain degree inaccurate, with an error frequency of 2—596. 
Error rates of 0.01—0.0596 are reported for genomic sequence data. Currently, with more 
than 6 million EST entries in the public dbEST database, it is prudent to first search a no nre- 
dundant protein sequence database before a six-frame DNA translation is attempted. 

The major available public domain sequence databases are maintained at NCBI, EBI 
(EMBL outstation), and Japan. Although extensive interchange of information occurs 
between these groups, there is no rigorous system to ensure that any one database is truly 
nonredundant and contains a complete set of known protein sequences. Table 8.5 lists 
Internet addresses of commonly used protein and DNA sequence databases. 


Poptide Mass Fingerprinting: Protein Identification by Peptide 
Mass Mapping 


MALDI-TOF-MS is sensitive, capable of high-throughput, and currently the first choice for 
protein identification from 2D electrophoresis gels. A sample protein is digested with a spe- 
cific protease, such as trypsin, and the peptide fragment masses are determined by MS. The 
peptide fragment masses provide a fingerprint of the protein of interest and are not unilat- 
erally affected by the presence of posttranslational modifications. The masses of the mea- 
sured proteolytic peptides are compared to predicted proteolytic peptides from sequence 
databases (Figure 8.29). In detail, a protein sequence entry from the database is theoretically 
digested according to the enzyme cleavage rules, taking into account user-specified parame- 
ters such as number of missed cleavage sites, monoisotopic or average Masses, and known 
modifications (e.g., gel-induced). A theoretical mass spectrum is therefore constructed for 
each protein entry from the sequence database and compared to the mass-measured spec- 
trum. The comparison or scoring function can be simple (i.e., number of matching ions) or 
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FIGURE 8.29. Schematic of peptide mass fingerprinting (PMF) versus MS/MS-based protein identification 
strategies. PMF algorithms compare the measured masses of intact peptide ions with the calculated masses 
of peptides derived from in silico proteolysis of proteins from a database. Each protein is scored primarily 
by counting the number of matching peptide masses within a user-prescribed mass tolerance. PMF protein 
identification strategies are well suited to analysis of relatively pure samples containing few proteins. 
MS/MS-based algorithms attempt to correlate uninterpreted CID spectra to theorctical spectra of peptides 
derived from in silico proteolysis of proteins fram a database. As with PMF, CID spectra are initially 
matched to peptide sequences with an intact peptide ion mass similar to that of the observed precursor ion. 
The calculated peptide fragment ion spectrum of the peptide sequence is then compared to the observed 
CID spectrum. As CID spectra vield sequence information, unambiguous protein identifications can be 
made with significantly fewer matching peptides than required by PME approaches. MS/MS-based protein 
identification strategies therefore allow confident analysis of samples comprising complex mixtures of pro- 


teins. 
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more complex, taking into account a multitude of variables such as the molecular mass of the 
protein, pl, digestion conditions, and posttranslational modifications. The difference in the 
score between the top two ranked proteins often suggests protein identity, The success of pep- 
tide mass fingerprinting (or mass mapping) depends on the detection of a representative set 
of peptide masses derived from a protein and that tbe protein in question is known (i.e. it 
exists in a protein sequence database). 

Figure 8.30a shows an example MALDI spectrum from a tryptic digest of a 2D gel spot. 
Two proteins were positively identified using a hierarchical-based search, in which the second 
protein was identified by searching unmatched masses from the top scoring protein. Figure 
8,30b shows a MALDI spectrum that resulted in a false-positive hit (1.e., an incorrect identi- 
fication). MS/MS was used to identify the protein successfully. Table 8.6 displays a compre- 
hensive list of PMF search engines. 


Peptide Fragment lon Searches: Protein Identification by MS/MS of Peptides 


The set of fragment ions generated by MS/MS act as a fingerprint for an individual peptide. 
‘Two or more peptides identified in this way are usually sufficient to unambiguously identify 
a protein. This approach allows searching of databases that contain incomplete gene infor- 
mation (e.g., ESTs) and also allows the identification of proteins from complex mixtures. In 
the CID process, peptides fragment in a semipredictable manner; thus, sequences from the 
database can be used to predict an expected fragmentation pattern and match the expected 
pattern to that observed in the acquired data. The matching of observed against theoretical 
(expected) fragment ion masses (as depicted in Figure 8.29) involves counting and summing 
the matching ions. A thorough understanding of the various fragmentation pathways or ions 
resulting from an MS/MS experiment under particular ionization conditions is required for 
successful correlation (see the information panel on FRAGMENTATION MECHANISMS OF PROTO- 
NATED PEPTIDES [N THE GAS PHASE). At present, the mass spectrometric idiosyncrasies of indi- 
vidual amino acids, present in a peptide, as well as the rather subjective assessment of the 
importance (abundance) of an ion, remain unresolved and require further wotk. 
Notwithstanding this issue, peptide fragment ion searches are generally more reliable and 
successful at identifying peptides and therefore proteins, especially if the proteins arc of 
mammalian origin. Example ESI spectra are shown in Figure 8.31 with the rankings of the 
correct peptide identifications based on two MS/MS search algorithms. Both search engines 
correctly identified the peptide shown in Figure 8.31a, whereas the identifications from 
Figure 8.31b,c were ambiguous, with both search engines failing due to selective enhanced 
cleavages at specific amino acid residues. Table 8.7 shows a comprehensive list of MS/MS 
search engines. 
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FIGURE 8.30. (a) MALDI-MS spectrum of an in situ digest of a 2D electrophoresis gel spot. A PME algo- 
rithm was used to compare the observed peptide ion masses with those in a nonredundant protein database. 
Peaks identified with a gray triangle belong to the top scoring protein match, heterogeneous nuclear ribonu- 
cleoproteins A2/B1 (accession number: P22626). Peaks labeled with a blue circle are from the second place 
protein match, malate dehydrogenase (accession number: P40926). (b) MALDI-MS spectrum of an in situ 
digest of a 2D electrophoresis gel spot. Interrogation of the database using the observed peptide ion masses 
yields a false-positive identification. 
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476 " CHAPTER 8 


Algorithm A 


1. WODVANNTNEEAGDGTTTATVLAR 


Algorithm B 


1. LVODVANNTNEEAGDGTTTATVLAR 
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Algorithm A 


1. EITALAPSTMK 


Algorithm B 
1. KITALAPSTMK 


2. EITALAPSTMK 
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Algorithm A 
1. LSVDIATRER 


2. EITALAPSTM'K 


Algorithm B 
1. KITALAPSTM"K 


Relative abundance 


2. EITALAPSTM*K 
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FIGURE 8.31. (See facing page for legend.) 
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SUMMARY 


Highly reliable and fully automated identification procedures are required in the modern 
proteomics laboratory. The balance between the effort to identify as many proteins as possi- 
bie with the effort to avoid misidentifying a protein requires integration among sample han- 
dling (chemistry), experimental design and goals, and bioinformatics workflow. A technique 
or methodology, which was previously used in isolation and which had little or no impact on 
downstream processes, might now require a radical re-think when integrated into the mod- 
ern proteomics workflow. For example, particular chemistries that result in a nonspecific 
modification will certainly hinder fast database searches. The impact of each technique or 
methodology on downstream techniques therefore requires careful and strategic planning on 
the basis of the overall objective of the study. Currently, valuable time and resources are being 
spent on verifying database search results and on manually inferring MS/MS data that do not 
give reliable search results, due in many cases to unknown modifications. Our lack of under- 
standing of the fundamental gas-phase ion chemistry of biomolecules and the mechanisms 
that yield observed fragment ions also hinders the development of robust software tools for 
the required high-throughput proteomics arena. 

Many improvements to database search engines have been made in the last few years, 
particularly with respect to scoring schemes for peptide mass fingerprint searches but also the 
development of faster algorithms, particularly multithreaded versions. The success of these 
algorithms relies on the completeness of databases and the availability of a good scoring 
mechanism. Correct usage and a better understanding of these search engines will also lead 
to higher-quality search results and fewer false-positive identifications. With increased 
automation and less interactive assessment of results come the need for quality control, espe- 
cially in developing methods to assess the reliability of protein identification. The impact of 
hybrid search strategies, particularly PMF combined with MS/MS of selected peptides, is 
expected to have a major role in proteomics applications. Labeling experiments, whereby 
fragmentation mechanisms of peptides could be better controlled or directed, and the simul- 
taneous measurement of the amount of protein in question are expected to be further refined 
for the high-throughput environment. Protein/peptide information obtained by these MS- 
based methodologies can be further analyzed for its biological significance (such as structure, 
functional context, and interacting partners) using the bioinformatics tools described in 
Chapter 11. 


FIGURE 8.31. (a) The ESI-IT CID spectrum shown was searched against a nonredundant protein database using two separate 
algorithms. Both algorithms return the peptide sequence [VODVANNTNEEAGDGTT TATVLAR as the best match. Fragment ions 
corresponding to b- and v-type ions are labeled and shown in pale blue and dark blue, respectively. (5) The ESI-T CID spectrum 
shown was searched against a nonredundant protein database using two separate algorithms. Algorithm A returns the correct pep- 
tide sequence, EITALAPSTMK, as the top match. Algorithm B returns the correct sequence as its second match. Although both 
sequences returned by Algorithm B are confirmed by the dominance of the y, ion, corresponding to preferential cleavage of the 
peptide backbone amino-terminal to the proline residue at position seven, the identification of EITALAPSTMK as the correct pep- 
tide is inferred from the frequency of identified peptides from the same protein in analysis. (c) The ESI-IT CID spectrum shown 
was searched against a nonredundant protein database using two separate algorithms. Both Algorithm A and Aigorithm B return 
the correct peptide sequence, EITALAPSTMK, as the second match (asterisk denotes oxidized methionine). The identification can 
be confirmed by the characteristic loss of CH,SOH from the precursor ion associated with the oxidized methionine residue at posi 


tion ten. 


PROTOCOL l 


General Method for MALDI-MS Analysis of 
Proteins and Peptides 


MATERIALS 


For MALDI-TOF-MS ANALYSIS OF PROTEINS AND PEPTIDES, samples are cocrystallized with an 
excess of organic matrix that absorbs at a specific wavelength (usually, UVys:y.,). Typically, 
sinapinic acid (SA) is the matrix of choice for large proteins, whereas ot-cyano-4-hydroxy- 
cinnamic acid (HCCA) is the preferred matrix for peptide mapping (for matrix selection, see 
Table 8.8). Following a short laser pulse, analytes are protonated and desorbed into the gas 
phase, and their m/z values are determined in a TOF mass analyzer, Mass accuracy determi- 
nations vary from +0.01% to 0.1% depending on the sample preparation technique and the 
method used for mass calibration. This protocol was contributed by David Frecklington 
(Joint Proteomics Laboratory [JPSL] of the Ludwig Institute for Cancer Research and the 
Walter arid Eliza Hall Institute of Medical Research, Melbourne, Australia). 


» Reagents 
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CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Calibration standards (1-10 pmoles/}1] in 60% acetonitrile, <!> aqueous 0.1% viv TFA <!>) 
To obtain the best interpretation of MALDI-TOF-MS data, standards of known molecular mass 
close to the molecular mass of the unknown sample are required to ensure a linear calibration 
curve. These must be prepared and chosen carefully. Typically, angiotensin I or angiotensin I, 
whose accurate molecular masses are known, can be used for peptide mapping studies. Tables 8.9 
and 8.10 contain a range of peptides and proteins that can be used routinely as standards in 
MALDI-MS analysis. All of these standards are commercially available, and therefore, stock solu- 
tions can be accurately prepared. Stock solutions of peptide calibrants are prepared at a concentra- 
tion of 100 pmoles/pl aliquots (frozen stocks can be kept for several years). For working solutions, 
these aliquots are diluted to 1-10 pmoles/jti with aqueous 0.1% (v/v) TFA. Working solutions of 
calibrant can be used for 1 week when kept at 4°C. 

The concentration of the calibration standards must be simüar to that of the unknown sample 
(range 1-10 pmoles/[il) to ensure accurate results. 

Matrix solution (20 mg/ml matrix in 60% acetonitrile, 0.1% TFA) 

Under these conditions, the matrices are saturated solutions and must be centrifuged prior to use. 
For uniform sample/matrix formation, avoid using undissolved matrix crystals. HCCA <!>, DHB 
«V», and SA <!> can be obtained from Sigma. 


Methanol <t> 
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Peptide" Small Large Glyco Glyco 
Matrix* Mapping proteins proteins peptides proteins References 
HCCA 1 3 - 2 - Beavis et al. (1992) 
SA - 1 1 3 2 Beavis and Chait (1989b) 
DHB 2 2 1 l l Strupat et al. (1991); Karas et al. (1993) 
THAP 2 - - - ~ Kussman and Roepstorff (2000} 
DHAP/DAHC 2 — - - Gorman et al. (1996) 


Portion adapted, with permission, from Kussmann and Roepstorff (2000). 

The choice of matrix must be adapted to the physical properties of the analyte. First choice is denoted by 1, second choice by 2, and third choice bv 3, 

*(HCCA) u-cyano-4-hydroxy-cinnamuc acid; (SA) sinapinic acid; (DHB) 2,5-dihyroxybenzoic acid; { THAP} 2,4,6-trihydroxyacetophenone; (DHAP) 
2,6-dihydroxyacetophanone; (DAHC) diammonium hydrogen citrate. 

‘The choice of matrix can considerably influence the sequence coverage in peptide mapping. For example, peptide mapping data obtained with the 
matrix THAF can complement those obtained with DHB and HECA (Kussman and Roepstorff 2000). 


TABLE 8.9. Peptide mass calibration standards 


Average Sigma 

Monoisotopic mass product SwissProt 
Name Source mass [M+H]* [M+H]* number Reference 
Leu enkephalin Human 556.277 556.6 [9333 PENK. HUMAN 
Met enkephalin Human 574.234 574.7 M 6638 PENK HUMAN 
[p-Ala-2]-Deltorphin II 782.396 782.9 DAD? PHYBI 
Angiotensin II Human 1046.542 1047.2 À 9535 ANGT HUMAN 
Bradykinin Human 1060.5692 1061.23 B 3259 KNH HUMAN 
Lutienizing hormone-releasing 

hormone Human 1182.581 1183.3 L 7134 GONL HUMAN 
Ala-Pro-Gly- [Ile-5, Val -5] 

Angiotensin H Human 1271.654 1272.5 A 0289 ANGT_HUMAN 
Angiotensin I Human 1296.6853 1297.50 A 9650 ANGT_HUMAN 
Substance P 1347.7360 1348.7 5 6883 TKNB HUMAN 
a-endorphin Human 1745.842 1747.0 E 6136 COLI HUMAN 
Peptide sequencing standard 1657.841 1658.8 P 2046 
Neurotensin 1690.928 1692.0 N 6383 NEU'T_HUMAN 
Renin substrate Porcine 1758.933 1760.1 R 8129 ANGT. PIG 
Benin substrate Human 1759.940 1761.0 R 5880 ANGT HUMAN 
Dynorphin 2147.1988 2148,52 
Adrenocorticotropk hormone 

fragment 18-39 Hurnan 2465.199 2466.7 A 0673 COLI HUMAN 
Insulin A-chain, oxidized Bovine 2530.921 2532.7 [1633 INS BOVIN 
Melittin Bee Venom 2845.762 2847.5 M 1407 MEL]. APIME 
fi-endorphin 3463.8296 3466.04 
Glucagon Bovine 3481.624 3483.8 G 7774 GLUC BOVIN 
Insulin B-chain, oxidized Bovine 3494.651 3496.9 I 6383 INS BOVIN 
Adrenocorticotropic hormone 

fragment 7-38 3657.929 3660.2 A 1527 COLI. HUMAN 
Insulin Sheep 5700.598 5704.5 19254 INS_SHEEP 
Insulin Bovine 5.730.609 5734.6 T5500 INS_BOVIN 
Insulin Equine 5744.624 5748.6 INS HORSE 
Insulin Porcine 5774.635 5778.6 INS PIG 
Insulin Human 5804.646 5808.6 10259 INS HUMAN 

(Recombinant 
in E. coli) 
Ubiquitin Bovine 8360.625 8565.8 U 6253 UBK) BOVIN 


Reproduced from: bttp://www.srsmaldi.com/Maldi/Res/Pep | Stds.html. 
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b Equipment 


TABLE 8.10. Protein mass calibration standards 


Sigma 
Average Mass product SwissProt 

Name Source IM4H]* number Reference 
Cytochrome £ Tuna 12043.6 C 2011 CYC_KATPE 
Cytochrome c Bovine 12232.0 C 3131 CYC BOVIN 
Cytochrome c Equine 12361.2 C 7752 CYC HORSE 
Ribonuclease A Bovine 1 3683.4 R 5500 RNP BOVIN 
(-lactalbumin Human 14071.1 L 7269 LCA, HUMAN 
a-lactalbumin Bovine 14179.1 L 6010 LCA BOVIN 
Lysozyme « Chicken egg 14306.1 1. 6876 IXC CHICK 
Hemoglobin a-chain Human 15127.3 G 5890 HBA HUMAN 
Hemoglobin B-chain Human 15868.7 G 5890 HBB HUMAN 
Apomyoglobin Equine 16952.5 A 8673 MYG HORSE 
B-lactoglobulin B Bovinc 18278.2 L 8005 LACB BOVIM 
B-lactoglobulin A Bovine 18364.3 L 7880 LACB BOVIN 
Trypsin Inhibitor, @-chain Soybean 20037.6 T 9003 ITRB SOYBN 
Trypsin Inhibitor, B-chain Soybean 20091.7 T 9003 ITRA SOYBN 
Trypsin Inhibitor, X- chain Soybean 20163.8 T 9003 ITRA, SOYBN 
Growth hormone Human 22126.2 S 4776 SOMA HUMAN 
Trypsinogen Bovine 23982.0 T 1143 TRYP BOVIN 
Chymotrypsinogen A Bovine 25657. C 4879 CTRA BOVIN 
Aldolase Rabbit 39204.7 A 7145 ALFA RABIT 
Pepsinogen Porcine 39607.9 P 4656 PEPA PIG 
Alcohol dehydrogenase, 5 Equine 397]2.4 A 6128 ADHS, HORSE 
Alcohol dehydrogenase, E Equine 39833.4 A 6128 ADHE HORSE 
Ovalbumin Chicken 44401.09 A 2512 OVAL_CHICK 
Glucose-6- phosphate 

dehydrogenase Yeast 57433.5 G 4134 G6PD YEAST 
Albumin Bovine 66431.0 A 0281 ALBU_BOVIN 
Albumin Human 66439.2 A 3782 ALBU, HUMAN 
Urease Jack Bean 90762.9 U 0376 UREA, CANEN 
Phosphorylase B Rabbit 07219.5 P 6635 PHS2_RABIT 
Albumin dimer Bovine 132859.0 A 9030 ALBU_BOVIN 


Reproduced from hitp;/ www.srsmaldi.com/Maldi/Bes/Prot, Stds.html. 


MALDI mass spectrometer 
See the note to Step 6. 


MALDI plate 


» Biological Sample 


Lyophilized samples (dissolved in 60% acetonitrile, 0 


1% viv TFA at a concentration of 1-10 


pmoles/ul) or RP-HPLC-purified fractions, which can be used directly 

Samples containing excipients such as buffers, salts, detergents, and denaturants must be desalted 
prior to analysis (Rabilloud 1990; Burlet et al. 1992; Papayannopoulos 1995; Patterson 1997) (see 
Protocol 2). Even minor quantities of sodium (m/z 23) and potassium {7/z 39) ions, readily gener- 
ated via laser ionization, can cause significant ion suppression. 

The detergent n-octyl glucoside (0.196) can be added at different stages of sample preparation (e.g. 
digestion or solubilization of digested peptides} to prevent adsorption of peptides on the sample 
tube wall and/or pipette tip, thereby yielding increased peptide peak number and improved 
sequence coverage (Katayama et al. 2001). 
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METHOD 


. Pipette 0.5 wi of the matrix solution (using a 2-1] pipette for accuracy) onto the sample 


wells of the metal sample plates used for MALDI-MS analysis. 


. Immediately add 0.5 ul of the standard or sample to the matrix before it dries. 


. Allow the solvent to evaporate and the samples to dry for ~5 minutes either at room tem- 


perature or in a 40°C oven. 


. Transfer the metal sample plate to the vacuum chamber of the mass spectrometer. 


. Acquire an initial mass spectrum at a laser power well above the ionization threshold to 


“warm up" the calibration standard. 


. Decrease the laser power until a good spectrum is obtained. 


Steps 5 and 6 apply to MALDI-TOF instruments that are equipped with a high-vacuum ion- 
ization source (e.g., Kompact MALDI IVTM?", Kratos Analytical/Shimadzu, or equivalent). 
For instruments equipped with a CCD camera for viewing the cocrystallized sample (e.g., o- 
MALDI QSTAR™ Pulsar I, Applied Biosystems, or equivalent), select a large crystal for analy- 
sis (c.g., large crystals form when using the matrix DHB; this is not so important when using 
HCCA or SA, which usually yield uniformly small crystals). HCCA is preferred as a MALDI 
matrix for peptide mapping because it yields the highest sensitivity and forms a uniform 
matrix layer on a MALDI sample plate, which makes it amenable for automated analysis 
(Beavis et al. 1992}. For optimal results, run the unknown samples at approximately the same 
laser power as the calibration standard. Although a high laser power may give a good signal, res- 
olution may be compromised due to peak broadening. The use of a low laser power at the ion- 
ization threshold may give high resolution, but result in poor signal-to-noise ratios. Typically, 
the thresholds of ionization for HCCA and DHB are 20 and 30 Joules, respectively. Thus, 
HCCA is considered to be a "hotter" matrix than DHR, which gives rise to increased metastable 
ion formation and concomitant PSD. A consequence of the latter is broader peak formation 
and reduced resolution. Although a sample may appear to be unifotm, it is recommended that 
different regions of the spot be examined to find "sweet spots," i.e., regions of the sample spot 
that give superior signal-to-noise ratios. 


7. Obtain a finear external two-point calibration using an appropriate calibrant (refer to 


Tables 8.9 and 8.10), 


When using HCCA, the matrix-derived ion (e.g, [M+H]' ~ OH, m/z 173.2) and the singly 
charged ion of the appropriate calibration standard can also be used as calibrants. 


8. Repeat Steps 5 and 6 for unknown samples and compare with the calibration values to 


obtain accurate masses. 


9. After use, clean the MALDI plate by rinsing thoroughly with H,O to remove any visible 


crystals and then with methanol; wipe with lint-free tissues (e.g., Kimwipes). 


PROTOCOL 2 


Preparation of Reversed-phase Microcolumns 


MATERIALS 


A POWERFUL PROTEOMIC TOOL FOR IDENTIFYING AND CHARACTERIZING PROTEINS and peptides is 
MALDI-TOF-MS. It is well recognized that sample preparation is a crucial step in MALDI- 
MS analysis of proteins and peptides and that a single component or mixture must be free of 
excipients such as buffers, salts, detergents, or denaturants. One versatile strategy for sample 
cleanup prior to MALDI-MS analysis uses microscale columns designed for direct sample 
elution onto the MALDI target plate. This protocol describes the fabrication of a reversed- 
phase microcolumn designed for this purpose. 

The microcolumns are prepared from GELoader tips essentially as described previously 
(Gobom et al. 1997; Kussmann et al. 1997). This protocol has been optimized for sample 
deanup prior to MALDI-MS. However, with slight modifications, it works equally well with 
samples destined for ESI-MS. This protocol was provided by Martin R. Larsen (Macquarie 
University, Sydney, Australia). 


» Reagents 


p Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
IMPORTANT: All reagents used in this protocol must be sequence-grade. 


Acetonitrile <!> 
Matrix solutions {see note to Step 8) 
Protein matrix solutions: SA <!> or DHB <!> dissolved in 50% acetonitrile containing 0.1% 
TFA <!> 
Peptide matrix solution: HCCA <!> in 70% acetonitrile containing 0.1% TEA 
Methanol <!> 


Trifluoroacetic acid (TEA) (0.1%) or 1% Formic acid <!> 
See note to Step 4. 


Forceps, blunt tip 

GELoader pipette tips (Eppendorf) 

MALDI-MS target 

Pipette tip (disposable, 20-200 ul size) 

Poros R1, R2, or Oligo R3 chromatography resins (Applied Biosystems) <!> 
Syringe (1 ml) 


» Biological Sample 


Protein or peptide sample to be analyzed by MS 
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METHOD 
1. Partially constrict a GELoader pipette tip by squeezing the narrow end. The two most 
common ways to do this are illustrated in Figure 8.32 and listed below: 
+ Place the narrow end of a GELoader tip flat on a hard surface. Roll a 1.5-ml microfuge 
tube over the final 1 mm of the tip. 
e Squeeze the narrow end of a GELoader tip using blunt forceps. Turn the tip once 
while holding it with the forceps, to close the end. 
a 
-4— —— — Microfuge tube 
or another 
round device 
Forceps 
b Loading resin 
Syringe a s | 
20 ul acetonitrile Apply gentle air | Wash with 10 of 
0.5 ul resin slurry pressure using 20 pl 0.196 TFA 
a 1-ml syringe 
I 
| Resin 
Loading sample, washing, and elution 
Apply the sample ' MALDI: Elute in very small 
on top of the i " Wash with " droplets directly on the target. Y 
remaining 10 pl 20 ul 0.196 ESI: Elute directly into the 
0.196 TFA and ] TFA capillary needle 


flush through 


FIGURE 8.32. Methodology for preparing and using GELoader tip microcolumns. (a) Preparation of a constricted GELoader tip. 
(b) Generation of the column, application of the analyte sample, and elution of the analyte molecules. (Courtesy of Martin R. 


Larsen, Macquarie University.) 
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. Prepare a slurry of 100—200 pl of chromatographic resin (Poros R1, R2, or Oligo R3) in 


70% acetonitrile (use ~1.5 mg of resin/100 Ul of acetonitrile). 


Steps 3—8 are illustrated in Figure 8.32b. 


. Load 20 pl of 70% acetonitrile in the top of the constricted GELoader tip, and add 0.5 ll 


of the resin slurry on top of the acetonitrile, Use a 1-ml syringe fitted to the GELoader tip 
with a disposable pipette tip to gently press the liquid down to create a small column at 
the end of the constricted microcolumn. Dry the column by letting all of the liquid escape 
from the bottom of the column before performing the next step. 


The yellow pipette tip must be cut twice to fit both the syringe and the GELoader tip. The 
amount of resin-slurry used to create the column should be varied with the approximate con- 
centration of the sample. In general, the column height should be 1-6 mm (approximate bed 
volume is 10-60 nl) when working with peptides generated from poorly abundant, gel-sepa- 
rated proteins. 


. Apply 20 pl of 0.1% TEA to the top of the column. Fquilibrate the column by gently 


pushing 10 ul of 0.1% TFA through it, using gentle air pressure generated by the syrtnge. 
The remaining 10 pl of 0.196 TFA should remain on top of the column bed. 


When the microcolumn is used as a cleanup step prior to nano-ESI-MS, 1% formic acid should 
be used instead of TFA, which is incompatible with ESI-MS. 


. Apply the protein/peptide sample on top of the remaining 10 ul of 0.1% TFA. 


. Press the liquid gently through the column by applying air pressure with the syringe. Do 


not allow the column to dry out; leave ~2 pil of solution on top of the column hed. 


. Wash the column with 20 ul of 0.1% TFA, and allow the column to run dry. 


. Elute the analytes, using 0.5 pl of matrix solution, directly onto the MALDI-MS target. 


The 0.5 pl of matrix solution should be spotted as several droplets (5-10) on the target. 
Alternatively, if the analytes are to be analyzed by ESI-MS, elute the peptides from the col- 
umn using methanol/formic acid/H,O (50:1:49) either directly into the capillary needle 
or into a microfuge tube. 


The preferred MALDI matrices used for eluting protein from the column are either SA or DHB 
dissolved in 50% acetonitrile/0. 196 TEA. For eluting peptides from the column, HCCA in 7096 
acetonitrile containing 0.1% TEA is preferentially used. Alternatively, the analytes can be elut- 
ed directly into a microfuge tube for storage or further analysis, using any percentage of organ- 
ic solvent, When eluted in several small spots, only the first two or three spots will contain the 
analytes, resulting in a further concentration of the sample. 


The column can be reused after washing it extensively with 100% acetonitrile. Depending on 
the size of the column and the abundance/concentration of the analyte molecules that have 
been loaded onto it, the column can be reused two to ten times without observing any meme- 


ry effects. 


PROTOCOL 3 


Preparation of Immobilized Enzyme Microcolumns 


Pirro: MAPPING BY MALDI-TOF-MS has emerged as a powerful proteomics tool for identify- 
ing and characterizing proteins. One of the key steps in this method involves the proteolytic 
cleavage of proteins, followed by MS analysis of the generated peptides. A limitation of the 
method, especially for high-throughput proteomics, is the speed and efficiency of proteolvt- 
ic cleavage. 

In this protocol, a method for rapid tryptic digestion using immobilized enzyme micro- 
columns is described, This protocol was provided by Martin R, Larsen (Macquarie University, 
Sydney, Australia). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


» Reagents 
NH,HCO, (50 ma, pH 7.8, sequence grade) <!> 
NH,HCO, (50 mM, pH 7.8, sequence grade) containing 10% (v/v) acetonitrile «t» 


» Equipment 
Forceps (blunt tip) 
GELoader pipette tips (Eppendorf) 


Immobilized trypsin chromatography resin 
Immobilized enzymes are available from several different manufacturers, However, they are also 
easily prepared using, for example, cyanogen-bromide-activated Sepharose 4B fast-flow media 
( Amersham Biosciences). 


Pipette tip (yellow disposable) 
Syringe (I1 ml) 


» Biological Sample 
Protein or peptide sample to be analyzed by MS 


P Additional Reagents 
Step 6 of this protocol may require the reagents listed in Protocol 2 of this chapter. 
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METHOD 


. Partially constrict a GELoader pipette tip by squeezing the narrow end. The two most 


common ways to do this are illustrated in Figure 8.32a and listed below: 


» Place the narrow end of a GELoader tip flat on a hard surface. Roll a 1.5-ml microfuge 
tube over the final 1 mm of the tip. 


e Squeeze the narrow end of a GELoader tip using blunt forceps. Turn the tip once and 
squeeze it with the forceps again to close the end. 


. Prepare a slurry of the immobilized trypsin resin using ~3 mg of resin per 100 ul of 50 


mM NH,HCO, (pH 7.8). 


The concentration of the immobilized enzyme resin should be varied with the amount of 
enzyme immobilized onto the resin. 


. Apply the slurry to the empty GELoader tip (from Step 1). Use a 1-mi syringe fitted to 


the GELoader tip with a yellow pipette tip to gently push the slurry down to create a col- 
umn at the end of the constricted tip. Do not allow the column to dry out. 


The yellow pipette tip must be cut twice to fit both the syringe and the GELoader tip. 
NH,HCO, (50 mM, pH 7.8) can be used with the majority of enzymes having a pH optimum 
of ~7.5. For enzymes active at other pH optima or buffer conditions, alternative buffers should 
be considered. 


- Wash the column with several 30-1] aliquots of 50 ma NH,HCO, (pH 7.8). Do not allow 


the column to dry out during the wash steps. 


Alternatively, the column can be washed with 50 mm NH,HCO; (pH 7.8) containing 1096 (v/v) 
acetonitrile. 


. Apply the protein/peptide sample to be digested onto the column in 50 mm NH,HCO, 


(pH 7.8). 


. Use a syringe to slowly and gently press the liquid through the column bed. Collect the 


flowthrough in a microfuge tube for further analysis. Optionally, for very small amounts 
of protein or peptides or very diluted samples, collect the liquid in a micro-reversed- 
phase column for desalting and concentration (see Protocol 2). 


EXPERIMENTAL TIP: The flowthrough from the enzyme column can be re-applied to the col- 
umn several times to ensure a more complete digestion. 


. If the amount of protein or peptides in the flowthrough is low, or for the sake of com- 


pleteness, wash the column with 5 ul of 50 mm NH,HCO, (pH 7.8) containing 1096 (v/v) 
acetonitrile. The wash can be collected together with the flowthrough. 


. If picomole amounts of protein (0.02-0.05 pg) or peptides (0.001—0.005 ug) have been 


recovered, mix a small amount of the flowthrough with 2% formic acid and a suitable 
matrix on the MALDI target. If the amount of material is too low, first desalt and con- 
centrate the material recovered in Steps 6 and 7 using a reversed-phase microcolumn, as 
described in Protocol 2. 


PROTOCOL 4 


Preparation and Use of an Integrated Microcapillary HPLC 
Column and ESI Device for Proteomic Analysis 


Mass SPECTROMETRY, AND IN PARTICULAR THE APPLICATION OF FSI coupled on-line with high- 
performance separation techniques such as capillary electrophoresis (CE) and microcapillary 
HPLC {uLC), has had a dramatic effect on the sensitivity and the speed with which the pri- 
mary structure of proteins and peptides can be determined (Aebersold and Goodlett 2001). 
Advances in separation techniques, particularly their implementation in miniaturized for- 
mats on-line with high-performance mass spectrometers (Hunt et al. 1991; Kennedy and 
Jorgenson 1991; Moseley et al. 1991; Wahl et al. 1992, 1993; Emmett and Caprioli 1994) and 
the development of miniaturized sprayers as ESI ion sources (Wahl et al. 1994; Wilm and 
Mann 1994; Emmett and Caprioli 1994; Wilm and Mann 1996) have reduced the amount of 
peptide required for complete and routine sequence characterization from several picomoles 
of peptide (Hunt et al. 1986; Matsudaira et al. 1987) in the mid 1980s to a few femtomoles 
and below by the mid 1990s (Davis and Lee 1997; McCormack et al. 1997; Figeys et al. 1998; 
Lazar et al, 1999; Martin et al, 2000). 

Arguably, much of this gain in sensitivity of the mass spectrometer is due to the combi- 
nation of concentration-dependent (Goodlett et al. 1993) type ionization devices such as ESI 
and on-line capillary separation devices of very small internal diameter (1.D.). The increase 
in sensitivity with small capillary internal diameter can be primarily attributed to a reduced 
mass flow rate of solvents and other background constituents into the ESI source, which 
allows for greater sample ionization efficiency (Wahl et al. 1992, 1993). For reasons of robust- 
ness, most WLC work is done with 75- or 100-um LD. capillary columns that clog less fre- 
quently than 50-um LD. capillary columns. The following protocol describes the preparation 
of an integrated C,,-packed capillary column—ESI microspray device, This type of integrated 
device was first reported in 1994 (Emmett and Caprioli 1994) and has recently gained pop- 
ularity from work by Yates and co-workers (Gatlin et al. 1998} most notably as a clever bipha- 
sic capillary column (Link et al. 1999), One of the key steps in preparing packed J£LC columns 
is devising a means to hold the C,,-derivatized particles in place. In this protocol, a polyimide 
capillary tip tapered to —5-Lim is used to hold the C,,-derivatized particles in place (Emmett 
and Caprioli 1994; Gatlin et al. 1998; Link et al. 1999). An alternative method uses underiva- 
tized silica particles tamped into place and then sintered in a flame to form a porous frit 
(Hunt et al. 1991; Kennedy and Jorgenson 1991; Moseley et al. 1991). 

This protocol describes the preparation of an integrated microcapillary column--micro- 
electrospray ionization device, system suitability testing, and some typical results that investi- 
gators could achieve with practice. The protocol was provided by David R. Goodlett and 
Eugene C. Yi (Institute for Systems Biology, Seattle, Washington) and Philippe Mottay (Brech- 
buhler, Inc., Spring, Texas). 
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MATERIALS 


b Reagents 


b Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acetic acid <!> 
Acetonitrile <!> 
Ethanol (7096) <!> 
Methanol (50%) «1» 


Autosampler (e.g., DIONEX FAMOS, Sunnyvale, California) 
Cj, trap cartridge (Michrome Bioresources, Auburn, California) 
Capillary cutter (Scientific Instrument Services, Ringoes, New Jersey) 


Derivatized silica resin (5 jum, C44) <!> 
Derivatized silica resin is available from numerous suppliers, including Michrome Bioresources 
(Auburn, California), Western Analytical (Murrieta, California), and Appelera (Framingham, 
Massachusetts). 


Divert valve (six-port) 
Forceps 
Helium gas (pressurized) <!> 


Hex driver 
Used to seal the pressure cell. 


HPLC system (e.g. the Agilent 1100 HPLC) 
Mass spectrometer (e.g., ThermoFinnigan ITMS, San Jose, California) 


MicroCross with 0.006-inch through-holes (Upchurch Scientific, Oak Harbor, Washington) 
A Micro-electrospray ionization device available from Brechbuhler, Inc. (Spring, Texas) is used to 
hold the MicroCross in place. 


Microscope (low-resolution) 

pH paper 

Polyimide-coated capillary (75 um I.D. x 360 um Q.D.) (PolymicroTechnologies, Phoenix, 
Arizona) 


Pressure cell (Brechbuhler, Inc., Spring, Texas) 
The pressure cell is used for capillary packing (see Figure 8.33). 


Propane torch 

Regulator (two-stage, high-pressure) 

Sonicator 

Teflon ferrules (Chromatography Research Supplies, Louisville, Kentucky) 


» Biological Sample 


Protein sample of interest 
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Microcapiliary Column Construction 


A microcapillary column inside a polyimide capillary has one end manually tapered to 5 um 
to serve as a frit. The tapered end also serves as the point from which ESI emanates. A low- 
resolution microscope will be needed to monitor capillary packing and the tapering process. 


1. Cut a 30-cm long piece of 75 um LD. x 360 um O.D. polyimide-coated capillary. Hold one 
end of the capillary with forceps and remove 2—3 cm of polyimide by passing it through a 
flame from a propane torch. Pull the capillary slowly to make a straight tapered tip. 


2. Place the tapered end on a clean index finger (rinse it with 70% ethanol) and carefully trim 
the end with a capillary cutter. Rinse the tapered tip with 70% ethanol. To ensure good ESI 
performance, inspect the tip under a low-resolution (5-10x magnification) microscope to 
ensure that the tapered tip is blunt (i.e., square and not jagged) and ~5 um across. 


There is an art to manually tapering the tip so be patient. Alternatives to manually tapering the 
tip are to purchase the tips from New Objectives Inc. (Woburn, Massachusetts) or to purchase 
a laser puller from Sutter Instruments Inc. (Novato, California). 


3. Use a spatula to place ~50-100 pl (dry volume) of 5 um of C,,-derivatized silica resin in 
a 1.7-ml microfuge tube. 
EXPERIMENTAL TIP: Due to the low back pressure of POROS (Appelera) beads, it may be 
preferable for the novice to begin with these. 
4. Add 500 ul of 7096 ethanol to the C,, resin, vortex to mix, and sonicate it briefly (1.e., no 
more than 1 minute). 
EXPERIMENTAL TIP: Due to the occurrence of apparently irreversible changes in the C; 
beads that adversely affect separation, prepare the slurry daily. 
5. Place the tube containing the slurry in the pressure cell (Figure 8.33). 


Forceps are useful for handling the slurry-filled tube. The chamber is drilled to accept tubes of 
various lengths so it may be necessary to place some Kimwipes in the bottom to bring the tube 
containing the slurry to a height inside the chamber that is easy to use. 


The viscosity of the solvent can be modified (in Step 5) to affect the packing rate. 


FIGURE 8.33. Pressure cell. The cell is shown with a polyimide-coated 
capillary coming out the top and the high-pressure valve for turning gas 
in the cell on/off in the front. 
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ô. 


Thread the blunt untapered end of the capillary into the top of the pressure cell; i.e., 
through the nut, the Teflon ferrule, and the top of the pressure cell (Figure 8.33). The end 
of the capillary inside the pressure cell should sit about halfway down into the 500 yl of 
slurry. Looking across the top of the pressure cell chamber, it is possible to observe when 
the capillary begins to hend and thus determine that it is at the bottom of the vial. Pull it 
up ~0.5 cm. Close the top of the cell making sure that the O-ring is in place and then 
tighten ali four bolts with the hex driver. Now secure the capillary in place by tightening 
the nut on top of the cell. 


WARNING: Always secure all four bolts before turning on the helium gas. If a leak occurs, turn 
off the gas and re-seat the O-ring or replace it with a new one and try again. 


. Set the two-stage, high-pressure regulator to 1000 psi. The capillary column is now ready 


to be packed. Slowly turn on the pressure in the cell by turning the valve 180°, Pack the 
capillary to 10 cm and stop, or pack the entire length of the capillary. Trim the length of 
the capillary column as desired. 


If the beads do not appear to flow, try one or all of the following: 


Turn the pressure inside the cell on/off several times to dislodge an air pocket in the slur- 
ry. 


Vortex the slurry again; the slurry may have settled to the bottom of the tube. 
Alternatively, add a micro-stir bar and perform the packing on a magnetic stir plate. This 
keeps the slurry suspended, but will not be necessary with experience. Use of the stir bar 
can fragment the C,, beads so it is best to prepare the slurry fresh daily. 


Cut off a 0.5-cm length of capillary inside the cell and reposition; it may have become 
clogged. 


Hold the capillary taut between the top of the cell and the microscope stage and vigor- 
ously flick the capillary so that it vibrates like a violin string between a finger and the top 
of the pressure cell. 


. When the packing has come to within 0.5-1.0 cm of the desired length, turn off the pres- 


sure inside the cell. Remove the slurry and replace it with a vial of 0.1 M acetic acid. Seal 
the cell, turn the pressure on, and let the capillary rinse at least until the effluent turns 
acidic (check with pH paper}. This wash will finish the packing because there remains 
some unpacked silica in the back of the capillary. 


. The column can be stored indefinitely by placing the end inside a 15-ml polypropylene 


tube filled with 50% methanol and sealing the top. 


The following are some general comments on the selection of derivatized silica and capil- 
lary dimensions. In general, the larger the LD. of the capillary, the longer the column can 
be packed without exceeding the HPLC operating back pressure. The complete novice 
should start out with Appelera’s POROS IIR packing, which is very easy to work with 
because it generates lower back pressure than other particles, due to a large pore size. 
Investigators who branch out to more specific applications will find that there are many 
derivatized beads from which to choose. In addition, be aware that particle size directly 
affects back pressure and chromatographic fidelity. In general, 5-um particles provide 
good results. Many of these final caveats can be readily explained by seeking out an expert 
on chromatography. 
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MICROSPRAYER SETUP 


* The capillary column is ready for testing and use if it passes the system suitability test. 
Figure 8.34 shows a HESI source with an integrated capillary column-ESI sprayer such 
as the one just packed with the blunt end of the packed capillary column inside the 
Upchurch all PEEK four-way union at Port 1. The tapered tip should be placed to with- 
in 1-2 cm of the MS orifice and will be the point from which ESI emanates due to the 
high voltage applied via a platinum lead at Port 2 in Figure 8.34, which for this particu- 
lar setup will be ~2 kV. 


The sample is loaded and washed while the six-port divert valve is open so that flow 
enters the four-way union at Port 3 and exits at Port 4. After suitable washing, the divert 
valve closes and flow is redirected to the C,, capillary column in Port 1. This type of 
setup using a trap cartridge provides extended life for the separation column. The device 
shown in Figure 8.34 can be used without the trap cartridge in Port 3 if desired, but 
remember to keep the six-port divert value closed if not using the trap. 


* Begin by running fast linear gradients, i.e., 0-60% acetonitrile in 15 minutes. Most 
investigators, after becoming comfortable with the loading and elution, find that the 
capillary columns work well with all sorts of gradients. 


Flow rate through the column will vary depending on internal diameter and packed 
length. For a 75-um capillary column packed to 7 cm, the flow rate can be as low as 150 
nl/min when optimized. However, investigators should begin with higher flow rates (e.g, 
500 nl/min) until they are comfortable with the system and how flow rate affects ESI sta- 
bility/sensitivity. To compensate for the decrease in flow that occurs with a decrease in 
viscosity (i.e., higher-percentage acetonitrile has lower viscosity relative to H,O) when 
using a restrictive flow splitter of constant length, the investigator may program increas- 
ing flow rate at the HPLC pump over the duration of the linear acetonitrile gradient. 


EXPERIMENTAL TIP: The same setup can be used with an empty tapered tip and a syringe 
pump for tuning the mass spectrometer. For optimal MS performance, the MS should be tuned 
under conditions as nearly identical to operation conditions as possible. 


System Suitability Test 


Once the LC-MS system is set up, it must be tested routinely to determine if all of the com- 
ponents are functioning properly before any precious sample is analyzed. If the system is rou- 
unely tested using the same conditions and it fails, then logical systematic procedures can be 
followed to correct the problem. Thus, before using the LC-MS setup, it is recommended that 
a system suitability check be carried out. This will mean analysis of a sample that is well-char- 
acterized. Often, the easiest sample is a purified peptide, but it could be any sample that is well- 
characterized. For LC-MS systems, there are four simple questions to ask to assess the system 
performance. These follow along with some idea of what system parameters to monitor: 


1. Is the mass spectrometer calibrated? Routinely compare observed mass to theoretical 
mass (Figure 8.35a). 


2. Is the HPLC and IC column functioning properly? Compare measured to expected 
retention times, monitor back pressure at the HPLC, and calculate peak width of a single 
analyte by plotting the single ion current trace (Figure 8.35c). 
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FIGURE 8.34. Diagram of an automated ur Trap-ALC-ESI-MS/MS system. A tapered ES] emitter packed 
with C, (Port 1), a high-voltage lead (Port 2), a miniature C ,- packed peptide utrap cartridge (Port 3), and 
a six-port divert valve (Port 4) were connected to a PEEK four-way union. Peptides were loaded for desait- 
ing on the C,, pu Trap cartridge with the six port divert valve open. When closed, the flow is redirected to the 
ESI emitter/C,, WLC column. 


3. Are the solvents (and sample) clean? Measure signal to noise (Figure 8.35b) at a point 
where an analyte elutes by summing mass spectra across the point of elution. An analyte 
may "drag" along uncharacterized components, by electrostatic and hydrophobic interac- 
tions, that decrease the ratio of signal to noise; aging solvents in general increase the noise 
floor. 


4. Is the mass spectrometer tuned properly? Measure the ratio of signal to noise at the point 
when an analyte elutes. Because mass spectrometric data are not absolute, monitoring the 
signal-to-noise ratio will be more meaningful than signal response alone. 


All of these questions can be answered using data from a single LC-MS analysis per- 
formed prior to analyzing real samples. To provide a basis for comparing operating perfor- 
mance of multiple instruments within and between laboratories, it is best to designate a sin- 
gle system suitability test. It should be an analyte in the same class as that being analyzed and 
be well-characterized. For example, the data shown in Figure 8.35 are from injection of 500 
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fmoles of neurotensin onto a WLC column as described above and analyzed using an ion-trap 
mass spectrometer. The results obtained are analyzed in light of the above four questions. 


e If the HPLC system has functioned properly to deliver solvents as programmed, then the 
elution time for the peptide will be reproducible. 


s If elution time is outside of the expected range, then either the HPLC system is not func- 
tioning as programmed or perhaps the column 1s beginning to fail. 


* [f the peak shape is not symmetrical or too broad (it should be ~10-30 seconds for a 
0.5-1.0-pmole injection of peptide standard) (Figure 8.35), then there may be a problem 
with the packing material, or dead volume has been introduced somewhere in the ULC col- 
umn. 


+ If the HPLC-MS system is working properly, then the signal-to-noise ratio of the analyte 
should be above an empirically determined level. An example of how to calculate this is 
shown in Figure 8.35c, where the ordinate axis is expanded for demonstration purposes to 
be 200x of normal. The ratio of signal to noise is estimated by expanding the ordinate axis 
to the point at which individual charge states are obscured by the noise. If the signal-to-noise 


e 90 20-sec wide 
2 70 at 1/2 height 
E. ———— 
E 60 
à 837 (42) 
© 
2 40 
m" 
E 30 
20 16.6 17.0 18.0 19.0 


Time (min) 


Relative abundance 


500 (600 700 800 900 1000 1100 1200 1300 t400 1500 1600 1700 
m/z 


FIGURE 8.35. Example of multiple-charged peptide mass spectrum. The three most abundant charge states 
of neurotensin (a), the chromatographic total ion current trace at the point where neurotensin eluted show- 
ing an average peak width at half-height (b), and the same mass spectrum as in a but with the ordinate axis 
expanded 200x normal to demonstrate how to check for signal/noise (c). 
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ratio of the analyte is below an acceptable value, then perhaps the solvents should be 
changed, or the mass spectrometer requires re-tuning, or the lenses involved tn focusing ions 
must be cleaned. When a lens becomes coated with even a thin invisible film (e.g., from 
repeated analysis of samples), then the applied voltage is no longer equal to the field voltage 
experienced by the ions during transmission, and a decrease in performance (€g. signal to 
noise) is observed. 


If the calculated and observed molecular weight for the analyte does not match within the 
tolerance of the mass spectrometer, the instrument may simply need to be cleaned and 
tuned, or it may need to be tuned and calibrated after cleaning. 


Following these simple steps will help the novice operator troubleshoot the complex LC- 


MS setup. As a challenge, the best sensitivity for a peptide standard will be ~5 attomoles 
loaded on a column. 


Analysis of a Complex Yeast Lysate 


Proteins from a yeast whole-cell lysate were subjected to proteolysis with trypsin and pre- 
pared for MS analysis as follows: 


Protein-containing pellets from total yeast cell lysates were resuspended in 0.5 ml of 50 mM 
NH,HCO, (pH 8.3) and 0.5% SDS and solubilized by incubating them for 30 minutes at 
60°C with occasional vortexing. The resulting mixture was diluted to have 0.05% SDS with 
50 mm NH,HCO, (pH 8.3). Modified trypsin was added at an enzyme:substrate ratio af 
100:1, and the samples were incubated overnight at 37°C. 


Prior to LC-MS analysis, the resulting peptides were purified by an OASIS MCX cartridge 
according to the manufacturer's protocol. 


Acidified peptide mixtures were injected onto a C, trap cartridge connected to Port 3 (see 
Figure 8.34) from a DIONEX FAMOS autosampler, and the flow was directed through the 
six-port divert valve (Port 4) to waste. 


After a S-minute washing period at 10096 A, the divert valve closed, redirecting flow 
through the 10 cm x 75 um LD. LC column (Port 1) that also served as the ESI emitter. 
Simultaneous to the redirection of flow to the capillary column, the linear gradient flow of 
acetonitrile (Solvent B) from an Agilent 1100 HPLC began. To decrease chemical noise in 
the mass spectrometer, no acid was included in Solvent B. The linear gradient was formed 
over 60 minutes from 2% B to 45% B. The flow rate at the pump was initially 100 ul/min. 


After restrictive flow splitting using an empty piece of polyimide-coated capillary (the 
dimensions of which must be set empirically) placed between the autosampler and the spray 
source, the flow at the device shown in Figure 8.54 was 300 nl/min. The Upchurch Scientific 
MicroCross with 0.006-inch through-holes shown in Figure 8.34 was held in place using a 
device (from Brechbuhler, Inc., Spring, Texas) that mounted on a ThermoFinnigan ITMS. 


ESI was initiated by delivering high voltage to Port 2. lon selection for CID was via an auto- 
mated routine that used top-down data-dependent ion selection of the three most intense 
ions in a survey scan, including a 3-minute dynamic exclusion períod to prevent reselec- 
tion of previously selected ions and repeated continuously throughout the #LC-ESI- 
MS/MS analysis. 

A total of 1 ug of the subsequent complex peptide mixture, inferred from measured pro- 
tein concentration, was separated using the WLC column described above, and CID of the 
eluting peptides was conducted in series using an ion-trap mass spectrometer. The analyt- 
ical setup was identical to that shown in Figure 8.34. 
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e Peptide ions for CID were selected from a broad m/z range of 400-2000 by a data-depen- 
dent routine that included dynamic exclusion for 3 minutes to prevent immediate reselec- 
tion of the most intense ions already subjected to CID. 


e Yeast proteins were identified from the tandem mass spectra of peptides by a database 
search using SEQUEST (Eng et al. 1994). Figure 8.36 shows an example of a typical base 
peak trace from such an experiment. 


Triplicate ULC-ESI-MS/MS analysis of this sample identified 401 proteins by requiring all 
correctly matched peptide sequences to have tryptic amino and carboxyl termini and using a 
standard set of scores known to give correct identifications (Keller 2002). The peptide tan- 
dem mass spectra used to identify two different proteins are shown in Figure 8.36 along with 
their SEQUEST Xcorr and dcorr scores. Currently, data from a SEQUEST database search 
must be manually confirmed by an expert analyst even after applying the above specified 
scoring criteria because the top-scoring SEQUEST score is not always the correct peptide 
sequence match. However, in the future, analysts using SEQUEST and other "expert" routines 
will have access to programs that screen tandem mass spectral quality prior to a database 
search (Moore et al. 2000) and provide probabilities of the correctness of a database match 
such as QSCORE (Moore et al. 2002). These types of tools will greatly advance the expert's 
ability to process large amounts of data. 
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FIGURE 8.36. Yeast peptides analyzed by yLC-ESI-I'T-M5. All soluble proteins from yeast were digested with trypsin, and 1 ug 
total of the peptide mixture was analyzed by the p Trap-LLC-ESI-LT-MS/MS setup shown in Figure 8.34 (a). (bc) Examples of pep- 
tide tandem mass spectra and the corresponding peptide sequences identified by a sequence database search using SEQUEST, 
along with the Xcorr and den scores used by experts to judge the correctness of the sequence match. 


PROTOCOL 5 


Analysis of Complex Protein Mixtures Using Nano-LC 
Coupled to MS/MS 


IN xourskuc COUPLED TO TANDEM MASS SPECTROMETRY (nano-LC-MS/MS) permits the 
rapid and sensitive determination of protein-protein interactions (see Chapter 10). By using 
a specific purification technique such as coimmunoprecipitation or affinity purification in 
conjunction with nano-LC-MS/MS not only are proteins identified, but specific protein-pro- 
tein interactions are elucidated as well. This was first demonstrated in Saccharomyces cerevisi- 
ae by using a macromolecular complex as “bait” to enrich for proteins that interacted with the 
complex (McCormack et al. 1997). The macromolecular complex was generated by polymer- 
izing bovine tubulin and then incubating it with a yeast cell lysate. The lysate was removed, 
and the microtubule complex was treated with two different concentrations of salt to elute dif- 
ferent sets of interacting proteins. These mixtures of proteins were enzymatically digested and 
subjected to nano-LC-MS/MS and database searching, which identified the proteins. In a sec- 
ond example, the yeast mitochondrial nucleoid complex, which is responsible for the packag- 
ing and the maintenance of mitochondrial DNA (Meeusen et al. 1999), was analyzed by 


« fractionating a mitochondrial lysate using differential sucrose centrifugation, 
+ treating the enriched nucleoids to free the proteins from the DNA, 


s subjecting them to a sucrose cushion, 


digesting the protein mixture with trypsin, and 
analyzing the fractions by nano-LC-MS/MS. 


Thirty-four proteins were identified (J. Nunnari, unpubl.) including one, mgm101, 
which was determined to be responsible for the repair of oxidatively damaged mitochondri- 
al DNA (Meeusen et al. 1999). This protocol was provided by David M. Schieltz, Michael P. 
Washburn, and Lara G. Hays (The Torrey Mesa Research Ínstitute, San Diego, California). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


P Reagents 
Acetonitrile (HPLC grade) <!> 
Acetonitrile (5%)/5% formic acid <!> 


HPLC solvents 
Solvent A: 5% acetonitrile and 0.02% heptafluorobutyric acid (HFBA) <!> 


Solvent B: 80% acetonitrile and 0.02% HFBA 
Methanol (HPLC grade} <!> 
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b Equipment 

Alcohol burner 

Binary or quaternary pump (e.g., series 1100, Agilent Technologies) 

C, reversed-phase packing material (5 um) (e.g., Zorbax XDB, Agilent Technologies) 
CAUTION: Do not inhale; use in a chemical fume hood. 

C, solid-phase extraction pipette tips (e.g., SPEC Plus PT C18, ANSYS Technologies, Lake 
Forest, California) 
These C4, solid-phase disk pipette tips have a 0.4 ug sorbent capacity and a loading volume of up to 800 uL 

Ceramic scribe 


Fused-silica capillary (100 um I.D. x 365 um O.D.) 
The capillary can be obtained from Agilent Technologies or Polymicro Technologies. 


Gold wire (0.025 diameter) (Scientific Instrument Services, Inc., Ringoes, New Jersey) 
Graduated glass capillaries 

Helium tank with regulator (at least 1000 psi pressure) <!> 

High-performance liquid chromatography equipment 

Laser puller (e.g., P-2000, Sutter Instruments) 

Nano-LC ion sources (ThermoFinnigan, The Scripps Research Institute, and Cytopea, Inc.) 
PEEK MicroCross, Microtight tubing sleeves (Upchurch Scientific) 

Stainless steel pressurization bomb (The Scripps Research Institute and Cytopea, Inc.) 


Tandem mass spectrometer 
A variety of tandem mass spectrometers are suitable from a number of manufacturers (e.g, Thermo- 
Finnigan and Micromass, Inc.) 


b Biological Sample 
Protein fractions 


METHOD 


Preparation of a Nano-LC Column 


1. Make a window in the center of an ~12-15-inch, 100 x 365-j1m fused-silica capillary by 
holding it over an alcoho! flame until the polyimide coating has been charred (see Figure 
8.37). Remove the charred material by wiping the capillary with a tissue soaked in 
methanol {see Figure 8.38). 


Unlike an alcohol burner, a Bunsen burner flame is too hot and will seal the inside of the capillary. 


2. To pull a needle, place the window portion of the capillary into the P-2000 laser puller 
{Figure 8.39}. Position the window in the mirrored chamber of the puller, where the laser 
will concavely focus and melt the fused silica, 

Arms on each side of the mirror have grooves and small vises, which properly align the fused 
silica and hold it in place. Table 8.11 includes typical parameters for pulling ~3-|1m tips from 
a 100 x 365-um capillary. 


TABLE 8.11. Typical parameters for pulling —-3-|un needle tips from 100 x 365-4 fused-silica capillary 


Heat Filament Velocity Delay Puli 
1 320 9 40 200 0 
2 310 0 30 200 0 
3 300 u 25 200 0 
4 290 0 20 200 0 


These values will differ slightly from instrument to instrument. For a very useful section on the construction of capillary nee- 
dles, see the Sutter Instrument P-2000 manual. 
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FIGURE 8.37. Preparation of a window in the 
fused-silica capillary. The capillary is held over 
an alcohol flame to char the polyimide coating. 
The length of the charred portion is ~!-3 cm. 


FKSURE 8.38. Charred portion of the coating 
on the capillary is removed by wiping it away 
with a tissue soaked in methanol. All of the 
burned polyimide coating must be removed to 
prevent deposition on the laser pullers retro- 
mirror when the laser is focused onto the newly 
made window. 


FIGURE 8.39. The fused-silica capillary is 
placed into the laser puller to produce two 
pulled needle capillaries. The windowed area of 
the capillary is placed within the “shroud” con- 
taining the retro-mirror. The ends of the capil- 
lary are fastened within the vises to hold the 
capillary in position. 
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3. Pack the pulled needle capillary with C,, reversed-phase packing material using a stain- 
less steel pressurization bomb. 


WARNING: Always wear safety glasses during this step. In the event that the capillary has not 
been seated properly, it will be forced out of the bomb at great velocity due to the pressure. 


This packing step is similar to the method described by McCarmack et al. (1994). 


a. Place ~5 mg of C,, reversed-phase packing material into a 1.7-ml microfuge tube and 
add ~1 ml of methanol. Shake the tube to suspend the particles and place it into the 
bomb. Secure the bomb lid by tightening the five bolts. 


It is very important to secure al] five bolts. Both the lid of the bomb and the bomb itself 
have grooves that fit an O-ring; this provides an air-tight seal when the lid is seated 
against the bomb. 


WARNING: High-pressure gas will escape violently if the lid is not secured tightly. 


b. The lid has a Swagelok fitting containing a Teflon ferrule. Feed the fused-silica, pulled 
needle capillary down through the ferrule until the end of the capillary reaches the 
bottom of the microfuge tube. Tighten the ferrule to secure the pulled needle capil- 
lary. Figure 8.40 shows the pressurization bomb and the microcapillary to be packed 
with reversed-phase material. 


c. Apply pressure to the bomb by first setting the regulator on the helium gas cylinder 
to ~400-800 psi and then opening a valve on the bomb to pressurize it. 


The packing material will begin filling the pulled needle capillary. This now becomes the 
capillary microcolumn. To achieve good chromatographic separation, pack the capillary 
with 10-15 cm of material. 


4. Attach the needle to the HPLC system through an Upchurch PEEK MicroCross, For a lay- 
out of the connections for the Upchurch MicroCross, see Figure 8.41. 


* The first connection point of the cross contains the transfer line from the HPLC 
pump. This consists of a 50 x 365-[um fused-silica capillary, where the length is suffi- 
cient to reach from the HPLC pump to the mass spectrometer. 


FIGURE 8.40. Pressurization device or “bomb” with 
high-pressure line and valve. The inner portion of the 
bomb contains an open area to allow a microfuge tube 
to stand upright with the cap open. The bomb and the 
lid have a groove, which holds a viton O-ring to ensure 
a high-pressure seal when the lid is tightened down. 
The lid contains five holes for bolts and in the center, a 
Swagelok fitting. Within the fitting sits a Teflon ferrule 
that allows the capillary to be inserted down into the 
bomb and into the microfuge tube. The ferrule is tight- 
ened to hold the capillary in place and provide a high- 
pressure seal. 
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FIGURE 8.41. Layout of the Upchurch 
MicroCross. The first connection at the 
bottom is for a transfer line to bring the 
solvent flow from the HPLC pump to the 
MicroCross. Moving clockwise to the sec- 
ond connection is the split line, which is 
used to control the final flow rate of the sol- 
vent through the microcolumn. The next 
connection is to hold a small section of 
gold wire, which makes electrical contact 
with the solvent. The final connection is 
where the microcolumn is attached. 


« The second connection point contains a length of fused-silica capillary that is used as 
a split line. This split line allows a majority of the flow to exit through the split; there- 
fore, very low flow rates can be achieved through the packed capillary microcolumn. 
The size and length of this section of capillary depend on the flow rate from the pump 
and the length of the microcolumn. A good starting point is to use a 2-foot section of 
100 x 365 jim for the split line (but see Step 6). 


The third connection point contains a section of gold wire, which will raise the volt- 
age of the solvent entering the needle from zero to ~1800 V, thus allowing electro- 


spray to occur. 


+ The fourth connection point is for the packed capillary microcolumn. 


5. Before loading the sample, equilibrate the column by pushing the methanol out with 


100% Solvent A for 5 minutes at a flow rate of 150 ul/min at the pump. 


6. After 5 minutes, measure the flow from the tip of the capillary microcolumn, using grad- 


Concentration of Sample 


uated glass capillaries. The target flow rate at the tip should be ~100—300 ni/min. If the 
flow rate is above this value, use a ceramic scribe to cut off a portion of the split line cap- 
illary. This will allow more of the flow to exit out of the split and less flow through the 
microcolumn. If the flow is less than 100 nl/min, use a longer piece of capillary or a sec- 
tion with a smaller inner diameter to force more flow through the microcolumn. 


Measuring the flow rate and adjusting the split line may have to be repeated a number of times 
until the target flow rate is reached. 


Peptide samples can be solubilized in any number of reagents, including Tris, ammonium 
bicarbonate, acetic acid, formic acid, and urea. However, peptide samples are typically the 


product of a digested protein or prot 
sent including some that will interfere 


ein mixture, in which a variety of reagents may be pre- 
with the performance of the reversed-phase column 


and the mass spectrometer (see the Experimental Tip below Step 11). 


Peptide sample volumes range from several microliters to 1 ml or more. The bed volume 


of the microcolumn is ~1.5 pl, which allows samples up to 50 ul to be loaded directly onto 
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the column. For sample volumes greater than 50 ul, concentration of the sample (as detailed 
in Steps 7-11) is necessary. 


7. 


10. 


11. 


Wet the SPEC Plus PT C, solid-phase extraction pipette tip with 1 m} of methanol as fol- 
lows. Push approximately half of the methanol through the disk, and then wait for 15-30 
seconds to allow the disk to activate. Push the remainder of the methanol through, but 
do not push air through the disk. 


, Equilibrate the disk with 1 ml of Solvent A by pushing it through the disk. 


. Pull the peptide solution into the pipette tip and then push the solution back out. 


This can be repeated two to three times. Peptides remain in the tip and are concentrated onto 
the disk. The flowthrough can be discarded. 


Elute the peptides with ~ 100 ul of 90% acetonitrile/0.5% acetic acid. Push the eluting sol- 
vent through the disk, puil the solution back up through the disk, and then push it 
through one final time, to give three passes across the disk. 


Use a vacuum concentrator to remove the acetonitrile until the peptides are nearly dry. 
Resuspend the peptides in —10—15 ul of 5% acetonitrile/5% formic acid. The sample is 
ready to load onto the reversed-phase capillary microcolumn. 


EXPERIMENTAL TIP: Avoid detergents whenever possibie or remove them prior to loading 
the sample onto the reversed-phase column. Once a detergent enters the column, it will bind 
to the reversed-phase and "leak off" in the elution gradients, contaminating subsequent analy- 
sis. Detergents ionize more readily than peptides and therefore will mask any peptide ions. If 
the sample contains solubilization or fractionation agents such as urea, glycerol, or sucrose, 
loading may be somewhat difficult due to the viscosity of the solution. In such cases, once the 
sample is loaded, wash the column extensively with 100% Solvent A to remove these chemicals 
before starting the gradient. 


Loading the Sample onto the C,, Column 


12. 


13. 


15. 


16. 
17. 


Centrifuge the samples in a microfuge at 14,000 rpm for 10 minutes to pellet any solid 
material. 


If any pellet is visible, transfer the supernatant (peptide sample) to a fresh microfuge 
tube. Even minute amounts of solid material will plug the microcolumn. Place the tube 
containing the peptide sample into the pressurization bomb, and tighten the lid to the 
bomb (for instructions on tightening the bomb lid, see Step 3a). 


. Feed the reversed-phase capillary microcolumn down through a Teflon ferrule until the 


end of the capillary reaches the bottom of the microfuge tube. Tighten the ferrule to 
secure the capillary microcolumn. 


Set the regulator on the gas cylinder to 400-800 psi, and then pressurize the bomb by 
opening the valve on the bomb that connects to the gas cylinder. The peptide sample will 
begin to flow into the column. 


Measure the loaded volume at the tip of the needle using a graduated glass capillary. 


When 8-10 [il are loaded, release the pressure, remove the column from the bomb, and 
place it back into the Upchurch MicroCross. 
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lon-source Setup 


18. Place the Upchurch MicroCross with the connections into a stage, which is designed in 
this case for the ThermoFinnigan LCQ series mass spectrometer. This stage performs a 
threefold purpose: 


* Supports the MicroCross and holds tt im place along with the connections, 


* Insulates the MicroCross from electrical contact with its surroundings when it is held 
at high-voltage potential. 


* Allows for fine position adjustment of the microcolumn with respect to the entrance 
of the mass spectrometer (heated capillary) by using an XYZ manipulator. 


Figure 8.42 shows the stage comprising the Plexiglas support, MicroCross, high-voltage connec- 
tion, microcolumn, and entrance to the mass spectrometer. Plastic tabs with small Teflon screws 
are used to hold the connections in place. The potential needed to raise the salvent in the cross 
and the microcolumn to 1800 V is delivered through an insulated cable that attaches to an alu- 
minum block seated in the Plexiglas support. Electrical contact is made when the plastic tab is 
tightened down onto the gold wire, which presses down onto the aluminum. This allows only 
the MicroCross to be energized and not the XYZ manipulator or metal support plate. 


19. Position the microcolumn using the XYZ manipulator so that the needle tip is ~2-5 mm 
from the orifice of the mass spectrometer's heated capillary, and set the voltage at 1.5-1.8 
kV. 


FIGURE 8.42. Movable Plexiglas stage containing nano-LC electrospray ton source. The Upchurch 
MicroCross with HPLC connections is held in position in the Plexiglas stage with plastic tabs. The solvent 
enters the MicroCross from the transfer line of the HPLC pump. A majority of the flow leaves the cross 
through the split line, but a small fraction moves through the microcolumn toward the opening of the mass 
spectrometer. An insulated cable supplies the high voltage that is connected to the aluminum portion of the 
stage. The aluminum makes contact with the gold wire, energizing the solvent flowing through the cross. 
This provides a large voltage potential between the tip of the microcolumn and the opening of the mass 
spectrometer, allowing electrospray jonization to occur. An XYZ manipulator is used to provide fine posi- 
tioning of the microcolumn with respect to the entrance of the mass spectrometer. 
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HPLC Programming 


20. Program the HPLC system as follows: 


e Start the gradient with 100% Solvent A (0% Solvent B) and a flow rate of 150 l/min. 


+ Over a period of time (call it X minutes), ramp up the concentration of Solvent B to 
60% along with an increase in the flow rate to 250 ui/min. 


e Set a ramp-down period of 5 minutes, during which the concentration of Solvent B 
returns to 0% and the flow rate slows to 150 l/min. 


As a guide, if the sample contains -15—20 proteins, (X) should be a 30-minute gradient; if the 
sample contains ~40—50 proteins, (X) would be either a 60- or 90-minute gradient. The gradi- 
ent profile can be programmed through the ThermoFinnigan Xcalibur software or the 
Micromass Inc. Masslynx software, depending on the mass spectrometer being used. 


Tandem Mass Spectrometry Analysis 


Data-dependent acquisition of tandem mass spectra during the HPLC gradient is also pro- 
grammed through the LCQ Xcalibur or the QTOF2 Masslynx software. Guidance is provid- 
ed here for setting the parameters for data-dependent acquisition using the ThermoFinnigan 
LCQ system. For experimental design of data-dependent tandem mass spectra acquisition for 
the QTOE2, refer to the Masslynx manual (version 3.5). 

The settings in Step 21 are for a typical data-dependent MS/MS acquisition analysts. The 
method consists of a continual cycle beginning with one scan of MS (scan one), which 
records all of the m/z values of the ions present at that moment in the gradient, followed by 
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two rounds of MS/MS. The initial MS/MS scan is of the first mest-intense ion recorded from 
the MS scan. The second MS/MS scan is of the second most-intense ion recorded from scan 
one. Dynamic exclusion is activated to improve the protein identification capacity during the 
analysis (described in Step 21i). 


21. Use the settings below to create a typical data-dependent MS/MS method. 


a. 


b. 


- 


es 


In the main Xcalibur software page, select Instrumental Setup. 


In the next window, select the button labeled Data Dependent MS/MS, 


. The next window will contain the design parameters for the acquisition setup (see 


Figure 8.43). At the top, input the time of the acquisition. 


Typically, the time of the acquisition is the same as the duration of the HPLC gradient 
plus time to allow for dead volume in the lines. 


. The Segments setting is generally set to 1 and the Start Delay setting for the acquisi- 


tion is set to 0. 


. The Scan Event is set to 3. The first scan event is a full scan of MS, and the next two 


scan events are MS/MS, to allow the mass spectrometer to perform one round of MS 
followed by two rounds of MS/MS, of the first and second most-intense peaks. These 
parameters will be set in the following sections. 


Highlight the bar showing Scan Event 1 Settings. Below this, check Normal Mass 
Range as Normal, Scan Mode as MS, and Scan Type as full. Set the Mass Range to 
400-1400, the Polarity to Positive, and the Data Type to Centroid. This range gener- 
ally gives good coverage for peptides from a tryptic digest. 


. The Tune Method box specifies the path for a file containing the parameters for the 


electrostatic lenses and ion trap. These parameters are established through the “LCQ 
Tune” as described in the ThermoFinnigan LCQ Getting Started Manual. 


. Highlight the bar labeled as Scan Event 2, check the box next to Dependent Scan, and 


then click the Settings button. A window will come up where the parameters can be 
set for all of the MS/MS scan events (Global and/or Segment) and for individual 
MS/MS scan events. 


. At the side of the window, press the Global button. These are parameters that are 


common among all segments and scan events. Several tabs will be visible. 


e The Global tab sets the values for both the MS and the Ms" data-dependent mass- 
es and are left at the default settings. 


a In the next tab (Mass Widths) for high-throughput protein identification, these 
values are all left at the default settings. 


* The Dynamic Exclusion tab contains several parameters that are important for 
effective data-dependent tandem mass analysis. Dynamic exclusion prevents an 
abundant peptide ion, with a broad elution profile, from being continually select- 
ed for MS/MS. The intense ions prevent other lower-abundance peptides from 
being selected. Ions that are selected for MS/MS are placed onto a list. While on this 
list, they will be excluded for a period of time so that other ions may be selected. 
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The Enable box is checked and typical settings include a Repeat Count of 2, a 
Repeat Duration of 0.5 minute, and an Exclusion Duration of 3 minutes. 


The Exclusion Mass Width is set for “by mass" and a width of 3 daltons is set 
around the peak, 0.8 dalton on the low-mass side, and 2.2 daltons on the high- 
mass side. This ensures that the entire isotopic distribution of the peak is put 
into the exclusion list. 


These parameters would be appropriate for a 30-minute gradient. It a longer 
gradient were used such as a 90-minute gradient, then the Exclusion Duration 
could be increased to 10 minutes. 


j. The Analog tab and the Isotopic Data Dependence tab are not enabled and are not 
used in the analysis. 


k. Again, on the left side, press the Segment button to bring up the parameters for the 
current segment (in this case, there is only one segment containing three scan events). 


Under the Current Segment tab, leave blank the Parent Masses and the Reiect 
Masses and therefore do not check the box next to Most Intense [f No Parent Mass 
Found. 


The Normalized Collision Energy is set at 35%. This is the amount of energy 
delivered to the peptide to cause fragmentation. 


The Activation Q and Activation Time are both left at their default setting of 0.250 
and 30, respectively. 


The Default Charge State is set to 2, which makes the assumption that all peptides 
entering the mass spectrometer are doubly charged. 


The Min MS Signal (10° counts) 15 set to 10, which is the signal intensity needed 
for a peak recorded in an MS scan to trigger the mass spectrometer to select this 
peak for MS/MS. The Min M&n Signal (10* counts) is set to 0.5; this is the thresh- 
old to trigger the mass spectrometer to perform higher-order MS/MS. In this type 
of analysis, no higher-order MS/MS is programmed into the segments so this 
value does not get utilized. 


The last value is the Isolation Width, which is the window around the ion in the 
MS scan that is selected for MS/MS. This is typically set to 2.5 daltons. The 
Add/Sub tab is not enabled. 


l, Press the Scan Event button to access the last set of values, which defines parameters 
for this specific scan event: Scan Event 2, The user can then select the ranking of ion 
abundances per MS scan (Scan Event 1) to be selected for MS/MS. In this case, check 
the Nth Most Intense Ion and set to 1 (this is the first most-intense ion). After press- 
ing the okay button, repeat the process by going back to the “method page" and press- 
ing the Scan Event 3 bar, checking Dependent Scan and pressing the Settings button. 
'The Scan Event button is then selected. Use Previous List of Ions is checked so the 
selection for MS/MS is from Scan | and is set to select the second most-intense ton. 
Press okay to accept. 


During the gradient, ihe mass spectrometer will continually cycle through these three 
scans collecting tandem mass spectra in a data-dependent fashion. 
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DATABASE SEARCHING 
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tures, such as cell lysates, nonspecific cleavages can occi 
would be missed in the database search. If tryptic per 

trypsin, this adds a degree of confidence in the result 
Search results that lead to positive identification of a 


e High-value SEQUEST <X,,,,> scores. 
o Peptides that are tryptic. 


In pao far tA PRES RANE 
tides, good scores start at 2.4.an gou 


larger than 0.08 (Eng et al. 1994). 
It OMNCM to spec dii the 


In these cases, spectra were manually confi 
SEQUEST results fit criteria described by f nk ı 
(1) any given MS/MS spectrum had to. lea 
must be continuity to a b or y ion series ( 


picis dier d ular gee dis : 
> 


epo 12 MASS GPP OTROME RY IN PROTEOMICS w 507 


IDENTIFICATION OF PROTEINS INTERACTING WITH THE SACCHAROMYCES CEREVISIAE 
PROTEASOME BY NaNC-LC-MS/MS AND DATABASE SEARCHING 


The yeast 265 proteasome is a complex of ~32 proteins contained in a 19S regulatory cap and 
a 20S core. This complex is responsible for ubiquitin-dependent proteolysis, but the assembly 
of the components in the complex and the mechanism of function are not well understood. A 
one-step affinity purification protocol of the 265 proteasome was developed to look at the pro- 
teiits that interact with the 265 proteasome and the 19S cap in both the absence of ATP and the 
presence of the ATP analog ATP-y-S, which is much more difficult to hydrolyze. Direct analy- 
sis of large protein complexes (DALPC) (Link et al. 1999? also referred to as multidimension- 
al protein identification technology CMuDPIT) was used to identify these proteins without 
prior separation by gel electrophoresis( Verma et al. 2000). A method for identifying interact- 
ing proteins using 1D nano-EC coupled to MS/MS is given in Protocol 5. 


1. The proteasome samples (20 pg from 375 jig/ml) were buffer exchanged into 50 mm Tris- 
Cl (pH 8.5) containing 8 M urea. 


2. Digestion was initiated by adding 0.3 [1g of endoproteinase Lys-C and incubating in the dark 
for 4 hours at 37°C. 


3. The samples were diluted fourfold by the addition of 50 mm Tris-Ct (pH 8.5) to reduce the 
urea concentration to 2 s, where trypsin still maintains activity. In some cases, when pro- 
teins are solublized in 8 m urea (without prior digestion with Lys-C), the proteins precipi- 
tate upon dilution to 2 « urea. Lys-C therefore provides a useful method for altering the ter- 
tiary structure of some protein enough that there should not be any aggregation when the 
solution is diluted to 2 M urea. 


4. CaCl, (10 mM) was added to a final concentration of 1 mM along with 0.2 ug of trypsin. The 
samples were incubated in the dark overnight at 37°C, 


£n 


The samples each were separately loaded onto a nano-LC column after a 5-minute equili- 
bration in a manner similar to that described in Protocol 5, Steps 12-17. 


'6. Once the sample was loaded, the column was washed for ~5 minutes, The HPLC pump was 
set to ramp the gradient from 2% to 60% Buffer B over a 90-minute period. During this 
time, the ThermoFinnigan LCQ Classic was programmed to perform four rounds of MS. 
The first was to lake an MS sean from in/z 400 10 1400 to measure masses and intensities of 
ion signals. This scan was followed by three MS/MS scans: MS/MS of the first most-intense 
4on recorded in the MS scan, then MS/MS af the second most-intense ion, and then MS/MS 
of the third most-intense ion. This cycle repeated itself for the duration of the gradient, The 
exclusion option was set 1o exclude an ion for 5 minutes after it had been selected. 


7. Tandem mass spevita were extracted from the LCO's raw file and searched using the 
SEQUEST program, The search was done against the yeast ORF (open reading frame) data- 
base that is publicly available from Stanford University. Common contaminant sequences 
such as keratin, IgG, trypsin, and Lys- were added to this database. Criteria for a positive 
identification followed the method described in the panel above on DATABASE SEARCHING. 


‘Phe analysis determined that 71 proteins copurified with the cap in the absence of ATP. Of 
these proteins, I9 were known 19S subunits, and the remaining 52 proteins were identified as 
possible proteasome-interacting proteins, Upon subsequent analysis, 25 were discounted 
"because they were considered to be nonspecific interactors. This left 27 potential proteasome- 
"interacting proteins, which were validated by generating fusion proteins with either mye or 
hemagglutinin epitopes for coimmunoprecipitation; 24 of these were categorized as novel pro- 
"^teasome subunits, chaperones, regulatory proteins (including transcriptional regulators), and 
abundant proteins (Verma et al. 2000). 


PROTOCOL 6 


Analysis of Complex Protein Mixtures Using Multidimensional 
Protein Identification Technology 


Tus ANALYSIS OF HIGHLY COMPLEX MIXTURES of proteins, like the ribosomes of yeast (Link et 
al. 1999) or the yeast proteome (Washburn et al. 2001), requires resolution capabilities 
beyond that of LC-MS/MS. 2D-PAGE followed by MS is the most commonly used method 
for resolving and identifying proteins from such highly complex mixtures (Hanash 2000; 
Pandey and Mann 2000). 2D-PAGE (see Chapter 4) separates proteins in one dimension by 
their pI values and in the other dimension by their molecular masses. However, proteins with 
extremes in pl and molecular mass (Corthals et al. 2000; Oh-Ishi et al. 2000), low-abundance 
proteins (Fountoulakis et al. 1999a,b; Gygi et al. 2000b), and membrane-associated or bound 
proteins (Molloy 2000; Santoni et al. 2000) are rarely identified in a 2D-PAGE study. 

Like 2D-PAGE, an alternative 2D separation system must subject proteins or peptides to 
two independent separation modalities and must during the second dimension maintain the 
separation of any two components resolved during the first dimension (Giddings 1987). A 
variety of efforts are under way to utilize multidimensional chromatography coupled with 
MS to characterize proteomes (Nilsson and Davidsson 2000; Washburn and Yates 2000). Link 
et al. (1999) developed an on-line method called multidimensional protein identification 
technology (MuDPIT), which couples 2D-LC to MS/MS, to resolve and identify peptides 
from complex mixtures. In this method, a pulled capillary microcolumn is packed with two 
independent chromatography phases, a strong cation exchanger, and reversed-phase matrix 
material. Once a complex peptide mixture is loaded onto the system, no additional sample 
handling is required, because as the peptides elute from the column, they are directed into 
the ESI ion-trap mass spectrometer, where they are ionized, mass selected, and fragmented. 
Finally, advanced search algorithms match the fragmented peptides to their respective pro- 
teins in a database. Using MuDPIT, 1484 proteins and 5540 peptides from the S. cerevisiae 
proteome were resolved and identified, including low-abundance proteins such as protein 
kinases and transcription factors, proteins with extremes in pl and molecular mass, and inte- 
gral membrane proteins (Washburn et al. 2001). For an alternative method for performing 
2D-LC-MS, see Appendix 2. 

This protocol, which was kindly provided by David M. Schieltz and Michael P. Washburn 
(The Torrey Mesa Research Institute, San Diego, Cali fornia), describes the analysis of a pro- 
teome using MuDPIT. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


P Reagents 
Acetonitrile (596)/0.596 acetic acid «1» 


Ammonium bicarbonate (1 M) <!> 
This solution should be freshly made each time. 


508 


THE USE OF MASS SPECTROMETRY IN PROTEOMICS = 509 


P Equipment 


CaCl, «ct» 
Chromatography solvents 
Solvent A: 5% acetonitrile/0,02% heptafluorobutyric acid (HFBA, Pierce) <!> 
Solvent B: 80% acetonitrile/0.02% HEBA 
Solveni C: 250 mM ammonium acetate<!>/5% acetonitrile/0.0296 HFBA 
Solvent D: 500 mM ammonium acetate/596 acetonitrile/0.0200 HFBA 
Dithiothreitol (DTT) «t» 
Endoproteinase Lys-C (sequencing grade) 
Iodoacetamide (LAA) <!> 


Modified trypsin (sequencing grade) or Poroszyme-immobilized trypsin (Applied Biosystems) 
See Step 7. 


Urea <!> 


C, solid-phase extraction pipette tips {e.g SPEC Plus PT C18, ANSYS Technologies, Lake 
Forest, California) 


These C, solid-phase disk pipette tips have a 0.4-pg sorbent capacity and a loading volume of up 
to 800 ul. 


Gold wire (0.025 diameter) (Scientific Instrument Services, Inc., Ringoes, New Jersey? 
High-performance liquid chromatography equipment 
Quaternary HPLC pump (e.g., a quaternary Hewlett-Packard 1100 series) 


Fused-silica capillary (100 pum LD. x 365 um O.D.) 
The capillary can be obtained from Agilent Technologies or Polymicro Technologies. 


Laser puller (e.g., P-2G00, Sutter Instruments) 
Stainless steel pressurization bomb (The Scripps Research Institute and Cytopea, Inc. | 
Helium tank with regulator (at least 1000 psi pressure) <!> 
Cs reversed-phase packing material (5 um) (e.g., Zorbax XDB, Agilent Technologies) 
CAUTION: Do not inhale; use in a chemical fume hood. 
Strong cation exchange resin (e.g., PartiSphere SCX, Whatman) 
Nano-LC ion sources (ThermoFinnigan, Scripps Research Institute, and Cytopea, Inc.) 
PEEK MicroCross, Microtight tubing sleeves (Upchurch Scientific) 


Tandem mass spectrometer 
A variety of mass spectrometers are suitable, including the LEQ Classic, Deca, Duo, or TSQ serles 
ClUhermoPinnigan), or the QTOFI or Q'0OF2 (Micromass, Inc.) 


Water bath preset to 50°C 


» Biological Sample 


Solubie protein fractions 
If proteins found in insoluble fractions are to be analyzed using MuDPIT, begin with the ADDI- 
TIONAL PROTOCO! : DIGESTION OF ENSOLUBLF. PROTEIN FRACTIONS FOR MUDPIT ANALYSIS, 


» Additional Reagents 


Step 1 of this protocol requires the reagents listed in Appendix 2, Techniques 1 or 2. 


510 » CHAPTER 8 


METHOD 


Digestion of Soluble Protein Extracts for MuDPIT 
1. Adjust the pH of the protein extract to 8.5 with ) M ammonium bicarbonate. Determine 
the protein concentration. 


The concentration of the protein mixture is needed for future reference and to determine the 
amount of protease to add in Steps 5 and 7. 


2. Add solid urea to the protein solution to a final concentration of 8 M. 


3. Add DTT to the protein solution to a final concentration of 1 mM. Incubate the solution 
for 20 minutes at 50°C. 


4. Allow the solution to cool to room temperature. Add iodoacetamide to the denatured 
protein to a concentration of 10 mM, Incubate the solution for 20 minutes at room tem- 
perature in the dark. 


5. Add endoproteinase Lys-C to a final ratio of substrate to enzyme of 100:1 and incubate 
the reaction overnight at 37°C. 


6. Dilute the sample to 2 M urea with 100 mM ammonium bicarbonate (pH 8.5). Add CaCl, 
to a final concentration of 1 mM. 


7. Digest the protein mixture with trypsin using either of the following two methods: 
For Method A: 


a, Add modified trypsin to the protein solution to a final ratio of substrate to enzyme of 
50:1. Incubate the reaction overnight at 37°C. 


b. Centrifuge mixture in a microfuge at maximum speed to remove insoluble material. 
c. ‘Transfer the supernatant to a fresh microfuge tube. 
For Method B: 


a. Add ~1 pl of Porosyzme immobilized trypsin slurry to every 50 pg of protein start- 
ing material. Incubate the reaction overnight at 37°C on a rotating wheel. 


b. Centrifuge the reaction in a microfuge at maximum speed to remove insoluble mate- 
rial and the trypsin beads. 


c. Transfer the supernatant to a fresh microfuge tube. 


Method B has the advantage that there is less trypsin contaminating the protein sample 
during the subsequent analysis. 


8. Load the supernatant onto a SPEC Plus PT €, solid-phase extraction cartridge accord- 
ing to the manufacturer's instructions. Exchange the supernatant into 596 acetont- 
trile/0.596 acetic acid. 

This essential step removes buffer components from the complex peptide mixture (i.e, ammo- 


nium bicarbonate, urea, etc.) that are incompatible with ESI-MS/MS. Removal of salt from the 
sample 1s critical, because salt prevents the peptides from binding to the strong cation-exchange 
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MuDPIT 


portion of the 2D column. The peptides will bypass the ion-exchange resin and bind to the 
reversed-phase material, effectively reducing the 2D-LC to a single-dimension reversed-phase 
column. Salts are used to bump populations of peptides from the cation exchanger to the 
reversed-phase resin in a controtied manner. A sample containing 1 M salt will failin a MulUPfl 
analysis. In addition, the extraction cartridge concentrates the sample, which is important 
because it can take ~1.5 hours to load 15 pil onto the 2D microcolumn. 


To carry out MuDPIT, a system must be assembled with a tandem mass spectrometer and à 
quaternary HPLC pump. In our case, a quaternary Agilent HP1100 series HPLC is directly 
coupled to a ThermoFinnigan LCQ Deca ion-trap mass spectrometer equipped with a nano- 
LC-ES] source (Gatlin et al. 1998). 


9. 


10. 


12. 


13. 


14. 


Prepare the column as described in Protocol 5. In summary: 


a. Pull a fused-silica capillary microcolumn (100 pm ILD. x 365 ym Q.D.) with a P-2000 
laser puller. 
b. Pack the microcolumn with 10 cm of C,, reversed-phase material in methanol. 


c. Pack the column with 4 cm of 5 um strong cation-exchange material in methanol. 


Connect the column to an Upchurch PEEK MicroCross (for details, see Protocol 5, Step 
4, and Figure 8.41) to which is also attached 

* agold wire to supply a spray voltage of ~1800 V, 

e amicrocapillary split line to provide an effective flow rate of 100—300 nl/min, and 

* a solvent line from the HPLC. 


Measure the flow [rom the tip of the microcolumn bv following Protocol 5, Step 6. 


. Equilibrate the column with 100% Solvent A for 5 minutes at a flow rate of 150 ul/min 


at the pump. 


Follow Protocol 5, Steps 12-17 to load the sample onto the column. 


Because MuDPIT columns are longer than single-phase LC microcolumns, a pressure of 
~ 1000 psi is required to load the sample. 


Place the MicroCross with the connections into a stage, as per Protocol 5, Steps !8 and 
19, and Figure 8.42. 


Begin the MuDPIT analysis of the peptide sample. 


The key feature of MuDPIT is the 2D capillary microcolumn chromatography, which performs 
better than when the resins are in separate columns and the buffers shunted through the 
columns. By having the beds integrated into the same column, the sensitivity is much higher 
and the sample losses are much fewer. The chromatography is controlled by the mass spec- 
trometer itself, through the use of Xcalibur software. A typical analysis is a fully automated, 15- 
step chromatography run on highly complex mixtures, but any number of steps can be appliec 
to a column. The gradient of each individual step is built within the Instrument Setup portion 
of Xcalibur (refer to Protocol 5, Steps 20 and 21). The following is a typical 15-step setup: 
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Step 1 
76-minute gradient from 0% to 80% Solvent B 
10 minutes held at 80% Solvent B 
Steps 2-13 
Each of these steps follows the sequence: 
5 minutes of 100% Solvent A 
2 minutes of x% Solvent C* 
3 minutes of 100% Solvent A 
10-minute gradient from 0% to 10% Solvent B 
90-minute gradient from 10% to 45% Solvent B 
*The 2-minute Solvent C percentages (X) in Steps 2-13 are as follows; 10%, 20%, 30%, 40%, 
50%, 60%, 70%, 80%, 90%, 90%, 100%, and 100%, respectively. 
Step 14 
5-minute 10096 Solvent A wash 
20-minate 10096 Solvent C wash 
5-minute 100% Solvent A wash 
10-minute gradient from 0% to 10% Solvent B 
90-minute gradient from 1096 to 4596 Solvent B 


Step 15 
5-minute 10096 Solvent A wash 
20-minute 100% Salvent D wash 
5-minute 100% Solvent A wash 
10-minute gradient fram 0% to 10% Solvent B 
90-minute gradient from 10% to 45% Solvent B 


Also within the Instrument Setup of Xcalibur and along with the building of an individual step 
are built the MS and MS/MS acquisition rules (see Steps 20 and 21 in Protocol 5). Typically, the 
instrument is operated here in a manner very similar to that in Steps 20 and 21, Protocol 5; 
however, instead of the top two ions being selected for MS/MS, the top three ions are selected 
during MuDPIT analysis. 


SEQUEST ANALYSIS 


The SEQUEST (Eng et al. 1994) output from a MuDPIT run is very complex, and substantial com- 
putational resources are required to generate and analyze this output. Due to the large number of 
spectra typically generated from a MuDPIT run, MS/MS data must be analyzed in a conservative 
fashion. The SEQUEST algorithm is run on any data set generated against the pertinent protein, 
DNA, or EST database, depending on the organism analyzed. When cyanogen bromide is used, the 
MS/MS data resulting from the two samples treated with cyanogen bromide/formic acid must be 
independently analyzed twice with SEQUEST because cyanogen bromide cleaves at methionine 
residues and leaves either homoserine or homoserine lactone (Aitken et al. 1989). For each of the 
two runs, engage the differential search modification and set to either -30 for homoserine or -48 
for homoserine lactone. SEQUEST results are evaluated using the criteria described in the panel on 
DATABASE SEARCHING in Protacol 5. 
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ADDITIONAL PROTOCOL: DIGESTION OF INSOLUBLE PROTEIN Fractions FOR MuDPIT Awaiysis 


Proteins associated with insoluble particulate matter from a whole-cell lysate or carefully prepared membrane 
samples can also be analyzed using MuDPIT. However, preliminary steps are required to prepare a usable 
complex protein mixture from these types of samples. 


Additional Materials 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <! >. 
Ammonium bicarbonate, saturated «1» 


Cyanogen bromide <!> 
Formic acid (9095) <!> 


WARNING: These steps MUST be carried aut in a chemical fume hood exercising proper safety precautions. 
Method 
t. Lyophilize the insoluble fraction. 


2. Add 100 pi of 90% formic acid to the lyophilized pellet, and incubate the tube for 5 minutes at room 
temperature. 


3. Add 100 ug of cyanogen bromide to the protein, mix the solution well, and incubate it overnight at room 
temperature in the dark (wrap the tube in aluminum foil). 

4. Add saturated ammonium bicarbonate to the tube until the pH reaches 8.5. Add the ammonium bicar- 
bonate solution very slowly, because a large amount of bubbling occurs during this step. Use pH paper 
to check the pH of the sample. 

5. Concentrate the fraction to ~200 ul by lyophilization in a vacuum concentrator. 

6. Proceed with Step 2 in the main protocol. 


"ANALYSIS OF THE S. CEREVISIAE PROTEOME USING MuDPIT 


The genome of $. cerevisiae is arguably the most highly characterized of any organism, Ongoing efforts in the 3. cerevisi- 
ae community include developing extensive on-line databases that provide functional information concerning the pro- 
teins of 5. cerevisiae (Mewes et al. 2000; Ball et al. 2001; Costanzo et al. 200 1b). Because of these resources, the results of 
a MuDPIT analysis of the $. cerevisiae proteome could be characterized (Washburn et al. 2001). 

1. Strain Bj5460 (Janes 1991) was grown to mid-log phase (O.D. 0.6) in YPD at 30°C and the pellets were washed three 

times with 1x phosphate-buffered saline (1.4 mM NaCl, 0.27 mM KCl, | mm Na HPO, 0.18 mm KH;PO, at pH 7.4). 

2. Three unique fractions were generated by lysing two separate groups of cells in a Mini-BeadBeater (BioSpec Products, 

Bartlesville, Oklahoma) followed by centrifugation, Three cycles of lysis and supernatant removal were carried out. 

After concentration by lyophilization to ~200 pl, the supernatant was digested and prepared for MuDPTT analysis as 

«described in Protocol 6, Steps 1-7. 

4. The remaining two pellets were washed by the addition of 1x phosphate-buffered saline to the tube, vortexed for 2 min- 
utes, and pelleted by centrifugation at 14,000 rpm in a microfuge for 10 minutes. One pellet (named the lightly washed 
insoluble pellet) was washed once, and a second pellet (named the heavily washed insoluble pellet) was washed three 
times in this fashion. Each sample was lyophilized to dryness and prepared for MuDPIT analysis as described in the 

ADDITIONAL PROTOCOL: DIGESTION OF INSOLUBLE, PROTEIN FRACTIONS FOR MUDPIT ANALYSIS. 

A. Each of the three fractions was then analyzed separately by MuDPIT as described in Protocol 6. 

The analysis of the yeast proteome by MuDPIT is the most comprehensive proteome analysis to date. After combining 
“the MS/MS data generated from all three samples, 5540 peptides were assigned to MS spectra leading to the identification 
` of 1484 proteins from the S, cerevisiae proteome. Each of the three prepared fractions provided unique hits to the final 

data set. After analysis of the data set using on-line S. cerevisiae databases (Mewes et al. 2000; Ball et al. 2001; Costanzo et 
al. 2001b), MuDPIT proved to be largely unbiased, because proteins predicted to be of low abundance and proteins with 
extremes in pl, molecular mass, and hydrophobicity were detected and identified. Of particular note was the detection 
and identification of 131 proteins with three or more predicted transmembrane domains, many of which were known 
integral membrane proteins (Washburn et al. 2001), These results demonstrated the potential for MuDPIT to rapidly 
detect and identify thousands of proteins from an organism in a largely unbiased fashion. 


PROTOCOL 7 


Combining 2D Chromatography and MS for 
the Separation of Complex Peptide Mixtures: 
An Off-line Approach 


To COMBINATION OF MULTIDIMENSIONAL CHROMATOGRAPHY for peptide separation and 
MS/MS for further separation and peptide sequence analysis represents an important tool for 
proteome analysis. The following protocol is based on the separation and analysis of 1 mg of 
soluble yeast protein, but the method can be optimized for the separation of any complex 
protein mixture. This protocol was kindly provided by Steven P. Gygi and his colleagues, 
Melanie P. Gygi, Larry J. Licklider, and Junmin Peng (Harvard Medical School, Boston, 
Massachusetts). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «1». 


p Reagents 
CaCl, <!> 
Denaturing solution 
8 M urea <!> 
0.2% SDS <!> 
10 mM dithiothreitol (DTT) <!> 
10 mM Tris-Cl (pH 8.5) l 
For resolubilization, the SDS is optional. It is removed in the void volume of the SCX chromatog- 
raphy column. Its final concentration in the protein solution must be below 0.1% for effective 
trypsin digestion (see Steps 5-7). 
Dialysis solution 
2 M urea 
10 mM Tris-Cl (pH 8.5) 
Iodoacetamide (IAA) «1 
RP chromatography solvents 
Solvent A 
0.496 acetic acid <!> 
0.00596 heptafluorobutyric acid (HFBA) <!> 
5% acetonitrile <!> 
Solvent B 
0.4% acetic acid 
0.005% HFBA 
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SCX chromatography buffers 
Buffer A 
5 mM KH,PO, <!> 
25% acetonitrile (pH 2.7} 
Buffer B 
5 mM KI1,PO, 
25% acetonitrile (pH 2.7) 
350 mm KC] <!> 
95% acetonitrile 
Trifluoroacetic acid (TFA) <!> 


» Equipment 

Dialysis tubing 

Gold wire 

LCQ DECA ion-trap mass spectrometer ( ThermoFinnigan, San Jose, California) or equiva- 
lent 

MicroCross (Upchurch Scientific) 

Restrictor capillary for flow splitting (50 jtm LD. x 50 em) 

RP capillary chromatography column 
This column is an in-house-packed 75 ju x 12-cm fused-silica capillary column packed with Magic 
Cy, €5 UM, 200 À, Micbrom BioResources, Auburn, California). 
See Steps 15 and 16. 

SCX chromatography column 
This column is a commercially packed 2.1 x [50-mm polysulfoethyl aspartamide (PolyLC, Columbia, 
Maryland) column and an in-line guard column of the same material. 

SEQUEST software (ThermoFinnigan, San Jose, California) 
This algorithm can be utilized to match acquired tandem mass spectra with peptide sequences fram 
databases. 


P Biological Molecules 
Peptide standards (Sigma) 
Peptide standards are used to evaluate the performance of the SCX column. The following six pep- 
tides were individually dissolved in H,O, mixed together, and diluted to à concentration of 1 prole 
peptide/ul in SCX chromatography Buffer A. The predicted net charge state of cach peptide at pH 
2.7 ts shown in parentheses. 
angiotensin I (4+) 
bradykinin (3+) 
neurotensin (34) 
bradykinin fragment 1-5 (2) 
leucine enkephalinammde (2+) 
leucine enkephalin (1 +) 
Soluble yeast protein or other complex protein mixture 
Soluble yeast protein can be obtained as described by Garrels et al. (1994). 


Trypsin, TPCK-modified sequencing grade (Promega) 
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METHOD 


Proteolysis of Soluble Yeast Profein 


t 


Solubilize 1 mg of lyophilized soluble yeast protein for 15 minutes at 37°C in 0.5 ml of 
denaturing solution. 


. Add IAA to a final concentration of 50 mm and place the sample in the dark for 30 min- 


utes at room temperature to alkylate the cysteine residues. 


Dialyze the solution against 100 ml of dialysis solution for 1 hour at room temperature 
to remove salts and other low-molecular-weight material, 


Repeat Step 3 twice with fresh changes of dialysis solution. 


5. Add 20 ug of trypsin to the sample, and incubate it for 10 minutes at 379C. 


Dilute the sample twofold (ta 1.0 ml) with 10 mM Tris-Cl (pH 8.5). Add CaCl, to a final 
concentration of 1 mM. Use a strip of pH paper to verify that the pH of the sample is 
between 8 and 9. 


If SDS was included in the denaturation solution (Step 1), ils final concentration in the protein 
solution must be below 0.196 for effective trypsin digestion. 


Allow the trypsin to digest the protein sample overnight at 37°C. 


Stop the digestion by acidifying the sample with 2 pl of TFA. Use a strip of pH paper to 
verity that the pH of the solution is <3. 


Alternative methods for solubilizing (Step 1) and proteolyzing proteins may be used (c.g. 
omitting SIDS, using a different reducing reagent, and omitting dialysis if salt concentration is 
low). However, it is critical that the final salt concentration be «20 mM (this includes Tris-Cl 
and any salt from protein harvesting) and the solution pli be ~ 3 immediately prior to peptide 
separation by SCX chromatography. [f the salt concentration is greater than 20 mm, use either 
membrare dialysis (see Steps 3 and 4) or a C, solid-phase extraction (SPE) column (Vydac) 
according to the manufacturer's instructions for removal of salt. 


Preparation of SCX Chromatography Column 


9. Set up the HPLC with the SCX buffers and SCX column. 


10. Set the flow rate of the column to 200 pl/min, and acquire a blank gradient. 


TABLE 8.12. Shallow gradient for SCX chromatography of tryptic peptides 


Time (min} 2B Flow (pl/min) 

0 Q 200 

l 5 200 
80 35 200 
90 100 200 
100 100 300 
116 0 300 
120 0 200 


Conditions 2.1 x 200-mm SUX column: Buffer A was 5 mM KH PO, 25% acetonitrile 
| pH 2.7). Buffer B was 5 mM KH,PO,. 25% acetomtrile, 350 mM Ki lliph 2.7]. 
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‘The gradient for peptide elution should be shallow in the area from ~5% to 35% Buffer B isee 
Table 8.12), This permits superior separation of the peptides with a net charge of 2° (which is 
mosk tryptic peptides) than does a linear gradient. 


11. Test the performance of the SCX column. 
a. Load 200 pmoles of each standard peptide in Buffer A onto the SCX column. 
b. Analyze the peptides using a UV spectrophotometer with the absorbance set at 214 


nm. 


Six completely resolved chramatographic peaks should be observed with peak widths of 
x1 minute. 


SCX Chromatography of Yeast Peptides 


12. Load the acidified yeast peptide sample onto the column. Wash the column for 5-10 min- 
utes with Buffer A until the UV trace returns to baseline. 


13. Start the gradient and collect 200-pl fractions per minute. 


The peptide elution profile for | mg of yeast separated by this technique is shown in Figure 
8.44. It is possible to load and separate as much as 5 mg of total peptides by this technique. 
Other column sizes cither larger or smaller can be used depending on the amount of starting 


material. 


14. Reduce the fraction volumes to ~50—100 qul by centrifugal evaporation. 


This step also removes most of the acetonitrile, permitting peptides to adsorb to the reversed- 
phase chromatography column. 


| -d l L Lud eines 
-4 4 12 20 28 36 44 52 60 68 76 84 92 100 2108 
Fraction number {every minute} 


FIGURE 8.44. Separation of 1 mg of yeast peptides by strong cation exchange (SCX) chromatography. Solunie 
yeast protein (1 mg) was harvested, reduced, denatured, and proteolytically cleaved with trypsin as described 
in this protocol, Fractions (200 ul) were collected every minute into a 96-well plate. The elution salt gradient 
has been superimposed and is Irom Table 8.12. Specific fractions (16, 32, 48, and 64) were later examined by 
reversed-phase nanoscale microcapillary LC-MS/MS for peptide identification (Table 8.13). Conditions: 2.1 X 
200-mm SCX chromatography column, 200 ul/min flow rate, monitored UV absorbance at 214 nm 
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TABLE 8.13. Peptide sequence analysis of four SCX fractions by nanoscale reversed-phase capillary 
LC-MS/MS and database searching l 


l l Peptide % Peptides 
Fraction Five representative peptides charge state identifed with 
number” identified from each fraction’ {mean + 5,0.) charges (2:3:4:5) 


(2) R.. WAGNANELNAAYAADGYAR. .I 
(2) K. NDLESYLNQAVEVSR. D 
16 (2) R. SNYDTDVFTPLFER. .] 2.00.10 99: ]:0:0 
(2) R. SSVLADALNAINNAEK. T 
(2) E. NPSDITQERYNAFYK. .S 


(3) K. LPLVGGHEGAGVVVGMGENVKE. .G 
(3) R,.FPSNGAFAE YSAISSETAYKFPAR. E 
32 (3) K.. IGEIQDAVENQALQLPR. .V 3,G£0.00 0:100:0:0 
(3) K. ATDGGAHGVINVSVSEAATEASTR. .Y 
(3) K..ALENPTRPFLAILGGAK. V 


(3) K. .KPQVTVGAQNAYLE,. A 
(3) R. HLUNDQPNADIVTEGDE. I 
48 (3) K. SHVVDYDDSSITENTR. .C 3.0240.15 0:98:2:0 
(3) R..NEDDVIAQNLIESK. .Y 
(3) R. HALALSPLGAEGER. .K 


(3) K. HSNGDAWVEAR. .G 
(4) R. YGANPHQKPAQAYVSQQDSLPFE. .V 
64 (4) R. IEEELGDNAVFAGENFHHGDRL..— 3.92*0.31 0:9:90:1 
(4) K. STSGNTHLGGQDFDTNLLEHFE. .A 
(4) K..LNSKPVELKPVSLDNK. .T 

"Fraction numbers correspond to those in Figure 8.44. 

FPeptides were identified by database matching using the SEQUEST algorithm against the yeast protein database with no 
enzyme specified. Matches were considered significant if the Xcorr score was 23.0, and the peptide was fully tryptic. Hundreds 
of peptides were automatically identified from each fraction, Only five representative peptides are shown here for method 
instructior. The predicted solution charge state at pH 2.7 is shown in parentheses and was calculated using the amino acids 
underlined and the amino terminus. For thoroughness, the amino acid both immediately before and afier the identified peptide 


is also shown. 
‘Mean (positive) solution charge state was calculated for each peptide at pH 2.7, with each occurrence of histidine, arginine, 


and lysine providing one charge and the ammo terminus always providing an additional charge. 


RPC Chromatography of SCX Chromatography Fractions 


For maximum sensitivity and peptide identification, combine nanoscale reversed-phase 
chromatography capillary LC with on-line detection via an ion-trap mass spectrometer. 
Using this combination, peptide detection levels are ~1—5 fmoles, with tandem mass spectra 
(sequencing attempts) being acquired at a rate of more than 2000 spectra/hour. 


15. Pull a 20-cm-long piece of 75-11M J.D. fused-silica capillary to form a needle tip. 


There are many ways to pull an electrospray needle, The simplest is to attach a weight (paper- 
binder) to the needle and then use a microtorch flame to stretch the capillary to a point. 
Alternatively, the procedure using a laser puller described in Protocol 5 is effective. In addition, 
the pulled tips or the pre-poured columns may be purchased (e.g., New Objective, Cambridge, 


Massachusetts). 
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16. Pack a piece of 75-1M fused-silica capillary that has a flame-drawn needle tip with a bed 


of 5 um, 200 A C,, particles to a length of 12 cm. (For details on packing the capillary col- 
umn, see Protocol 4 or 5.) l 


The pulled tip serves to contain the packed bed in the capillary and functions as an electrospray 
needle for MS. 


17. Connect the four arms of a microcross as follows: 
a. A flow from the HPLC (100 pl/min). 
b. A restrictor capillary for flow splitting (50 um LD. x 50 cm) 99.7 ul/min. 
c. A gold wire connected to a high-voltage supply (1.8 kV). 


d. A nano-scale reversed-phase capillary column with pulled tip (from Step 16} receiv- 
ing 300 nl/min. 


18. Operate the HPLC at 100 utl/min and split down the flow to only 300 nl/min using the 
microcross before the reversed-phase column. 


19. Pressure-load off-line 10 ul (or more) from a single 50-11 SCX chromatography fraction 
onto the column, followed by gradient elution. 


20. Subject eluting peptide peaks to ESI and place them in the ion-trap mass spectrometer. 


The MS operates in two modes, namely, MS mode (measurmg the m/z ratios of peptide ions) 
and MS/MS mode (sequencing peptides). 


Thousands of sequencing attempts can be made in a single 1-hour reversed-phase [C-MS/MS 
analysis by acquiring the MS/MS scans in a data-dependent fashion, where the peptide ions to 
be sequenced from each MS scan are chosen “on the fly” and five peptide ions are selected for 
sequencing (fragmentation) from each MS spectrum. 


The setup described above can be further automated by using an autosampler to inject the 
SCX chromatography fractions directly from the 96-well collection plate. A peptide trap 
(LC Packings or Michtom BioResources) can also be incorporated into the strategy before 
the resolving reversed-phase chromatography capillary column to allow for increased sam- 
ple volumes and improved concentration and desalting of samples. In addition, we have 
designed a high-sensitivity method for the automated analysis by nanoscale capillary LC- 
MS/MS directly from 96-well plates utilizing a device termed a “V-column,” or “vented col- 
umn,” which is derived from the direct fusion of a peptide trap and a nanoscale capillary 
column (Licklider et al. 2002). 

The separation power of the technique can be appreciated by the total number of 
sequencing attempts that can be obtained from the original peptide sample. When 20 frac- 
tions are analyzed, more than 40,000 peptides could potentially be sequenced. Furthermore, 
the length of the analyses in both dimensions are highly scalable, which can result in an 
increase in total sequencing attempts of perhaps an order of magnitude. 


PROTOCOL 8 


Cysteinyl Peptide Capture of Proteins 


Biz AN UNKNOWN PROTEOLYTIC MIXTURE by LC- MS/MS can be difficult because of the 
potential complexity of the mixture as well as the dynamic range of proteins that are repre- 
sented. Peptides from the most-abundant proteins may be repeatedly selected by the mass 
spectrometer for fragmentation, thereby preventing the identification of peptides from less- 
abundant proteins. One strategy to aid in the identification of less-abundant proteins from a 
complex proteolytic mixture is to fractionate the mixture prior to its analysis in a mass spec- 
trometer. 

Cysteine represents a mere 1.7% of all amino acids in proteins (Coligan et al. 1999). By 
affinity tagging cysteine residues prior to digesting the protein mixture. the resulting cysteinyl 
peptides can be selectively captured by affinity chromatography. The goal is to capture one 
cysteinyl peptide from each protein in the mixture, due to the low abundance of cysteine 
within proteins. The fraction containing cysteinyl peptides can then be analyzed by LC- 
MS/MS and should result in the identification of low-abundance proteins that would ordi- 
narily not be selected for fragmentation within the mass spectrometer. 

Recently, sulfhydryl-specific reagents have been used for simplification of complex pep- 
tide mixtures (Spahr et al. 2000) as well as for quantitative analysis (Gygi et al, 1999). The 
sulfhydryl-specific reagent used in this protocol, Biotin- HPDP (N-[6-(biotinamido)hexyl]- 
3’-[2’-pyridyldithio] propionamide; Pierce Chemical Company), contains a sulfhydryl-reac- 
tive moiety separated from a biotin moiety by a thiol-cleavable spacer arm. Reduction of the 
biotinylated cysteinyl peptides bound to immobilized avidin permits isolation of the cysteinyl 
peptides in their native state. 

This protocol works well with complex protein mixtures across a broad dynamic range 
even in the presence of undesirable buffer components. However, to optimize the results, the 
protocol must be adapted for specific applications. For the analysis of relatively simple pro- 
teolytic mixtures by MS (especially where identifying components is the goal), it may be 
more suitable to rely on the data-dependent functions of the mass spectrometer for frag- 
menting as many peptides as possible, rather than using the cysteinyl peptide capture strate- 
gy. For simple mixtures, some of the mass spectrometer software utilities such as "dynamic 
exclusion" should be sufficient for looking deeper into a sample. Dynamic exclusion puts an 
ion that has been previously fragmented into a "don't fragment" list for a user-defined peri- 
od of time, thereby allowing the selection of less-intense precursor ions. This protocol was 
provided by Chris Spahr (Amgen Inc., Thousand Oaks, California). 
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MATERIALS 


P» Reagents 


) Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Biotinylation reagent (EZ-Link Biotin-HPDP from Pierce} 


Prepare a 4 mM Biotin-HPDP stock solution by adding 2.2 mg of Biotin-HPDP to 1 ml of dimethyl 
sulfoxide (DMSO) «1». This stock can be stored frozen. 


Biotinylation/tryptic digestion buffer (PBS [pH 7.4] containing 1 mm EDTA and 2 M urea) 


Prepare a 100 mm EDTA stock solution by dissolving 0.186 g of EDTA in 5 m! o£ H,O. Combine 2.4 
g of urea <!> with 200 pl af the EDTA stock solution, and add 1x PBS to a final volume of 20 mi. 


Denaturation buffer (40 mM Tris-Cl [pH 7.0j containing 8 M urea) 


Combine 2.4 g of urea with 200 yl of 1 at Tris-Cl (pH 7,0) stock, Add H,O to give a final volume of 
5 ml. 


Denaturation/reducing buffer 
Just prior to use, add lO ul of Lo dithiothreitol «I» to 1 ml of denaturation buffer. 
Dithiothreitol (DIT) <I> 
Formic acid <!> 
Iodoacetamide (IAA, an alkylating reagent) <!> 
Prepare a | mM stock solution by adding 0.184 g of [AA to ] ml of 11,0. 
PBS containing 1 mM EDTA 
Phosphate-buffered saline (PBS} 
Trypsin (sequencing-grade, from Roche Molecular Biochemicals} 


Prepare a 1 ug/ul stock solution by diluting the dry enzyme in H,O. ‘Lhe stock should be stared 
frozen at BC. 


Centricon YM-3 (3000-m.w. cutoff; Millipore) 
Centrifuge (refrigerated) (Beckman GS-6R centrifuge or equivalent) 
ImmunoPure immobilized avidin Affinit yPak columns ( |-ml prepacked columns, Pierce Endo- 
gen) 
Oasis HLB cartridge 
Use !-cc (30 mg) extraction cartridge (Waters, Milford, Massachusetts} or equivalent desalting device. 
Vacuum concentrator (c.g., Speed Vac, Thermo Savant) 


b Biological Sample 


Complex protein mixture 
This protocol is based on the analysis of protein released from mouse liver mitochondria following 
treatment with atractyloside (Patterson et al. 2000). Protein (120 pg) was used for the cysteinyl pep- 
tide capture. Although varying amounts of a complex protein mixture can be used in this protocol, 
the limiting factor may be the binding capacity of the immobilized avidin column. The arnounts of 
denaturant/reductant, biotin, trypsin, and avidin should be optimized for each protein sample. 
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METHOD 


Denaturation, Reduction, and Buffer Exchange 


1. Concentrate 120 ug of the complex protein mixture under vacuum in à vacuum concen- 
trator until the sample is completely dry. 


2. Resuspend the dried sample in ~1 ml of denaturation/reducing buffer. Denature and 
reduce the sample by incubating it for 30 minutes at 37°C, 
The precise volume is unimportant, because the sample buffer will be exchanged in the next 


step. The smaller the sample volume, however, the less time it will take to exchange the buffer. 


3. To remove the reductant, decrease the urea concentration, and remove any undesirable 
buffer components. Exchange the sample buffer with ~4 ml of biotinylation/tryptic 
digestion buffer using a Centricon YM-3 concentrator at 4°C in a centrifuge. 


Labeling with Biotin-HPDP and Buffer Exchange 


4. To label all of the free sulfhydryls, add 50 ul of 4 mm Biotin-HPDP in DMSO to the sam- 
ple (the volume of the sample from Step 3 was ~1 ml) and incubate the labeling mixture 
for 90 minutes at room temperature. 


An excess of Biotin-HPDP was used because of the uncertainty as to whether or not the fabel- 
ing reaction would work in the presence of 2 M urea. This amount of Siotin-HPDP greatly 
exceeds the binding capacity of the immobilized avidin column used (theoretical binding 
capacity of 1 mg of avidin is 59 nmoles of biotin). 


5. Remove all of the unbound Biotin-HPDP by exchanging the buffer with fresh biotinyla- 
tion/tryptic digestion buffer at 4°C in a Centricon YM-3 concentrator. 


The concentrated sample volume should be ~ 100 Hl. 


Unbound Biotin- HPDP will compete with the biotiriylated peptides for binding to avidin. The 
Centricon YM-3 used in Step 3 can be used again in this step. 


Digestion of the Labeled Complex Mixture 
6. Add 1.5 pg of trypsin to the biotinylated sample and incubate the digest overnight at 
37°C, 


7. Dilute the biotinylated digest in PBS containing | mM EDTA to reduce the urea concen- 
tration to «0.5 M. 


If the digest is to be applied to the avidin affinity column within a few hours, it can be held at 
room temperature, 4°C, or frozen. For extended storage, freeze the peptides at —809C.. 


Immobilized Avidin Affinity Capture and Elution 


8. Equilibrate a 1-ml immobilized avidin column with 5 ml of PBS containing 1 mM EDTA. 


9. Apply the biotinylated digest (from Step 7) to the column, close the column, and incu- 
bate the material for 30 minutes at room temperature. Do not allow the column to run 


dry. 
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10. To collect noncysteinyl peptides, wash the column with 10 mi of PBS containing 1 mM 
EDTA and collect the eluate. This yields the flowthrough fraction. 


11. Elute the cysteiny! peptides from the immobilized avidin: 


à. Apply 2 ml of PBS containing 50 mM DTT (added just prior to use) to the column 
and allow it to enter the column. 


b. Close the column and incubate it for 30 minutes at room temperature. 


c. Open the column, and collect the eluate in 2-m] fractions. This vields the bound frac- 
tion. 


Alkylation and Desaiting 
12. If LC-MS/MS analysis is desired, alkylate the flowthrough and bound fractions with IAA 
(10 mM and 100 mM, respectively) for 30 minutes at room temperature in the dark. 
‘The flowthrough fraction is alkylated in case any cysteinyl peptides made it into this fraction, 
13. Acidify both fractions with 9096 formic acid (add 20 ul to the bound fraction and 100 pl 


to the flowthrough fraction) to give a final concentration of 196 formic acid. Freeze the 
fractions at —809C until they are to be desalted. 


14. Desalt each fraction by passing the material through an Oasis HLB cartridge or equiva- 
lent device prior to LC-MS/MS according to the manufacturer's instructions. 


PROTOCOL 9 


lsotope-coded Affinity Tagging of Proteins 


Quem: PROTEOMICS HAS TRADITIONALLY BFEN PERFORMED using 2D gel electrophoresis 
where quantitation is accomplished by recreating differences in the staining patterns of pro- 
teins derived from two states of cell populations or tissues from a similar biological system. 
More recently, MS methods based on stable isotope quantitation have been developed that 
show significant potential for differential expression proteomic studies. One such in vitro 
method, described in this protocol, involves the use of isotope-coded affinity tags (ICATS). 
This approach, illustrated schematically in Figure 8.45, uses a protein tag with three func- 
tional moieties: a cysteine reactive moiety, a linker with either eight hydrogens (the light form 
of the reagent) or eight deuteriums (the heavy form of the reagent, having an isotope code or 
mass tag of 8 daltons), and a biotin moiety (the affinity tag). Using this technique, the cys- 
teine side chains in complex mixtures of proteins from two different states of a cell popula- 
tion (e.g., normal vs. disease) are reduced and alkylated using the light form of the reagent 
(d0-labeled tag) in one cell state and the heavy form of the reagent (d8-labeled tag) for pro- 
teins in the second cell state. The two mixtures (d0- and d8-labeled) are then combined and 
subjected to proteolytic digestion (typically, with trypsin and/or I,ys-C). Generated cysteine- 
containing peptides are affinity-purified using an avidin column resulting in a “simplified” 
mixture of peptides that contains about tenfold fewer peptides than the original mixture. 
These peptides are analyzed by MS, and quantitation information based on the relative abun- 
dance of the d0 and d8 isotopes is obtained. The identification of proteins is obtained from 
the peptide molecular mass and MS/MS-derived amino acid sequence. This protocol was 
kindly provided by Ruedi Aebersold, Timothy }. Griffin, and Sam Donohoe (The Institute for 
Systems Biology). Four stages comprise this protocol: 


e Stage 1: Labeling Proteins with ICAT 

* Stage 2: Cation Exchange Cleanup of ICAT Samples 

+ Stage 3: Selection of Tagged Proteins Using an Avidin Column 
+ Stage 4: Analysis of ICAT-labeled Peptides by MS 
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FIGURE 8.45. ICAI strategy for quaniilying differential protein expression, (4) Structure of the ICAT reagent, The reagent consists 
of three clements: an affinity tag (biotin), which is used to isolate ICAT-labeled peptides; a linker that can incorporate stable isotopes: 
and a reactive group with specificity toward thiol groups (cysteines). The reagent exists in two forms: heavy (contains eight deuteri- 
ums} and light (contains no deuteriums). (b) Two protein mixtures representing two different cell states are treated with the 190- 
topically light (gray) and heavy (black circle} ICAT reagents, respectively; an ICAT reagent is covalently attached to each cysteinyl 
residue in every protein. The protein mixtures are combined and digested to peptides, and ICAT-labeled peptides are isolated utiliz 

ing the biotin tag. These peptides are separated by microcapillary HPTLC. A pair of ICAT-labeled peptides are chemically identical and 
are easily visualized because they essentially coelute, and there is an 8-dalton mass difference measured in a scanning mass spec- 
trometer (four m/z units difference for a doubly charged ion). The ratios of the original amounts of proteins fram the two cell states 
are strictly maintained in the peptide fragments. The relative quantification 1s determined by the ratio of the peptide pairs. Every 
other scan is devated to fragmenting and then recording sequence information about an eluting peptide (tandem mass spectrum). 
The protein is identified, with the aid of a computer, by searching with the recorded sequence information against large protein data- 
bases. (Adapted, with permission, from Gygi et al. 1999.) 
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STAGE 1: Labeling Proteins with ICAT 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


P Reagents 
Dithiothreitol (DTT) (1 M) <!> 
EDTA (0.5 m) 


ICAT reagents 
The ICAT reagents are either deuterated (“heavy”) or native (“light”) and can either be purchased 
from Applied Biosystems or synthesized (see Gygi et al. 1999). Dissolve each ICAT reagent in 
methanol <!> prior to adding it to proteins in solution. The volume of methanol used to dissolve 
the reagent should be no more than 1096 of the final volume of the proteins in solution. 


Tributylphosphine (TBP) (Aldrich Chemical Company) <!> 
If using small amounts of TBP, prepare a 1:10 dilution of TBP in 1-propanol (HPLC grade) «t». 
WARNING: TBP is extremely toxic. Always work in a chemical fume hood. 


Tris(2-carboxyethyl) phosphine hydrochloride (TCEP) <!> can substitute for TBP as the reducing 
agent. 


Tris (50 mM, pH 8.3) containing 0.5% SDS <!>, 5 mm EDTA, and 6 M urea <!> 
Trypsin (Promega sequencing-grade modified trypsin or equivalent) 
Prepare the trypsin at 1 uygul. 


b Biological Samples 
Lyophilized or precipitated protein samples 


b Additional Reagents 
Step 8, if performed, requires the reagents from Chapter 7, Protocol 11. 


METHOD 


> Be extremely careful with the protein concentrations throughout the experiment. 
* Handle samples on ice unless otherwise noted. 

* The pH should remain at ~8.3 during the labeling steps. 

* Work with equivalent amounts of sample throughout this stage of the protocol. 


1. Solubilize each protein pellet (300 ug) in 50 ul of 50 mm Tris (pH 8.3) containing 0.5% 
(w/v) SDS, 6 M urea, and 5 mM EDTA. 


The SDS is used to solubilize hydrophobic membrane proteins. If the sample does not include 
such proteins, omit the SDS. 


(Optional) Set aside ~1-2 jag of native/uniabeled protein to load on a gel when checking the 
efficiency of the labeling reaction (see Step 8). 
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. When the proteins are in solution, dilute each sample 1:10 with H,O, adding EDTA to 
keep it at 5 mM. In the end, 0.05% SDS, 5 mw Tris, and 5 mM EDTA are needed. 


The urea must be «1.5 M for the irvptic digest (Step 7) to work. 


. Add TBP to the protein samples to a final TBP concentration of 5 mM to reduce rhe pro- 
teins. incubate the solutions for 10 minutes at room temperature. 


TCEP can substitute for TBP to reduce proteins. 


. Add either light or heavy ICAT reagent to its respective protein sample to a final ICAT 
concentration of 1—1.5 mM. Incubate the reaction with stirring for 1—5.2 hours at room 
temperature in the dark. 


For example, place mini-stir bars in the sample tubes, wrap the tubes in aluminum foil, and 
place the tubes on a magnetic stir plate. 


. Add DTT to the reaction to a final concentration of 10 mM to inactivate residual ICAT 
reagent and limit the formation of adducts between the ICAT reagent and TBP. 


. Combine equivalent amounts of heavy and light peptide samptes. 


Methods for determining equivalent amounts vary (e.g, cell counts and protein assay, per- 
formed after sample preparation and before labeling). 


. Digest the pooled samples with trypsin at a mass ratio of 1:25 (trypsin:protein) for 3 
hours at 37°C. 


. (Optional) Check the efficiency of the labeling reaction by running the proteins on a 
polyacrylamide gel and visualizing the separated proteins with a silver stain (see Chapter 
2, Protocol 7 or Chapter 7, Protocol ! 1). Use a 415% or 4-20% gradient minigel. Include 
unlabeled protein (from Step 1), labeled protein (from Step 5), and post-digest peptides 
on the gel. 


if labeling works, some bands should increase in molecular weight (and hence migrate a short- 
er distance from the origin) when comparing the unlabeled and labeled protein lanes. If and 
how much they increase depends on the number of accessible cvsteines within each protein. 


. Proceed to the cation exchange procedure (Stage 2). 
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STAGE 2: Cation Exchange Cleanup of ICAT Samples 


MATERIALS 


» Reagents 


» Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


Buffer A (10 mM KH;PO,, 25% acetonitrile [pH 3.0]) <!> 
Buffer B (10 mM KH,PO,, 25% acetonitrile, 350 mM KCl [pH 3.0]) 


HPLC column (e.g., 2.1 mm x 20 cm, 5 jim particles, 300 A pore size Polysulfoethyl A, a 


strong cation-exchange material; PolyLC Inc.) 


HPLC system 


» Biological Sample 


METHOD 


ICAT-labeled peptides (from Stage 1, Step 7) 


. Set up the cation exchange column. 


. Acidify the peptide samples (from Stage 1, Step 7) to pH £3 with Buffer A to a final vol- 


urne of 2 ml. 


Acidification of the samples is important, because the peptides will not be fully charged at 
higher pH values and may nat stick to the column! 


. Load from 200 ug to ~5 mg of digested ICAT-labeled total protein onto the column, 


using a 2-ml sample loop. 


_ Set flow rate to 200 l/min and gradient to the times and percentages listed below: 


Time (min) % Buffer B 
ü ü 
30 25 
50 100 


. Collect fractions at 1-2-minute intervals. 


The gradient shown is designed to spread out the elution of doubly charged peptides, with 
these peptides usually eluting starting at ~8-9 minutes into the run until -15-16 minutes, after 
which triply charged peptides begin to elute. 


. Proceed to the avidin column procedure (Stage 3) when all of the samples have eluted 


from the column. 


Varying numbers of cation-exchange fractions have been pooled prior to running them over the 
avidin column. If and how much is pooled depends on the complexity of the sample and the 
sensitivity of the mass spectrometer, Consider running a single cation exchange fraction over the 
avidin column and analyzing the resulting eluent in the mass spectrometer. The MS data should 
indicate whether or not to subsequently pool the remaining cation exchange fractions. 


The fractions selected for avidin purification should be those that show UV absorbance in 
the cation-exchange HPLC fractionation, indicating a significant amount of peptide in 
those fractions. 
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STAGE 3: Selection of Tagged Proteins Using an Avidin Column 


MATERIALS 


» Reogents 


P» Equipment 


CAUTION: Sce Appendix 3 for appropriate handling of materials marked with <!>, 


Biotin blocking reagent (2 mM d-biotin «!» in PBS) 

Elution reagent (3096 acetonitrile <!> containing 0.496 trifluoroacetic acid ['TFA]) <!> 
Use glass to handle the elution reagent. 

NH,HCO, (1 M and 50 mM, pi 8.3) «1» 
Prepare 50 mm NH,HCO, in 20% methanol <!>. 

2x Phosphate-buffered saline (PBS) 

Regeneration buffer (100 mm glycine <!> at pH 2.8) 


The regeneration buffer must be sterile, and it does not last long. Autoclave before use, and store at 
49C, where it will last ! week. 


Avidin beads (e.g., Ultralink Monomeric Avidin, Pierce Chemical) 


Pierce provides helpful information for packing and usage of their avidin columns (see their tech- 
nical literature at www. piercenet.com). 


At any time during the column preparation, it is acceptable to wash the column with 2x PBS con- 
taining 0.01% NaN, <!> until the pH of the flowthrough is ^7. The column can then be stored for 
an extended period at 4°C (for overnight storage, the NaN, is unnecessary). Make sure that the col- 
umn packings are submerged in buffer during storage. 


Centrifugal concentrator (e.g., SpeedVac, Thermo Savant) 
Glass pipette 
Glass wool 


Peptide collection tubes (glass, four needed per column) 
See Steps 17-20. 


pH meter or paper 


P Biological Sample 


METHOD 


Peptide fractions from the cation-exchange column 
From Stage 2, Step 5, See the note at Stage 2, Step 6. 


Preparation of the Avidin Column 


1. Pack a small piece of glass woo! into the neck of a glass pipette. Pack 400 ul of avidin into 
the tube. 


The avidin slurry comes as a 50% dilution, so 800 jl of 50% slurry must be added in order to 
get 400 jl of avidin. 


The actual amount of avidin used can vary, although 400 pil of avidin beads will bond ~500 ug 
of peptides. 
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. Allow the avidin beads to settle, and then wash the column with 2x PBS to dispel the 


beads from the side of the tube. 


. Wash the column with 30% acetonitrile containing 6.4% TFA until the flowthrough pH 


changes to ~1, Continue to wash the column with one additional column volume (400 
Hl) of the acetonitrile/TFA. 


This acidic wash step removes polymers associated with the beads. 


EXPERIMENTAL TIP: Use a glass pipette when pipetting acetonitrile or TFA. 


. Wash the column with 2x PBS (pH 7.2) until the flowthrough pH is ~7.2. Continue to 


wash the column with 3 more column volumes (1200 ul} of PBS. 


. Wash the column with 3-4 column volumes of d-biotin blocking reagent. 


The d-biotin blocks the more retentive avidin sites on the column, ensuring recovery of the 
sample from the remaining binding sites in Steps 17-19. 


. Wash off loosely bound biotin with ~6 column volumes (2400 ul) of regeneration buffer 


until the flowthrough pH is ~2.8. 


. Wash the column with 6 column volumes of 2x PBS to return the column to an approx- 


imate pH of 7.2 (as measured by the pH of the flowthrough). 


Preparation of the Biotinylated Protein 


8. 


9. 


10. 


Place the sample (from Stage 2, Step 5) in a centrifugal concentrator to remove the ace- 
tonitrile. Reduce the original sample volume by at least 25%. 


Adjust the pH of the sample to 7.2 with 1 M NH,HCO, («10 ul should be sufficient to 
adjust the pH). 


Add 2x PBS to increase the sample volume to approximately three quarters of the total 
column volume (sample volume = 250—300 pl). 


Loading and Incubation of the Sample on the Column 


11. 


12. 


13. 


Load the sample onto the column, and allow it to flow through until the solution is level 
with the top of the column. 


Add 100 ul of 2x PBS to the loaded sample, and cover the top of the column to stop the 
flow. 


Allow the sample to incubate in the column for ~20 minutes. 


Washing Unbound Material from the Column 


14. Wash the column with 5 column volumes of 2x PBS (pH 7.2) and start collecting frac- 


tions. 


The unbound material washed from the column in this step and the two subsequent steps 
can be pooled into a single fraction. 
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15. 
16. 


Wash the column with 5 column volumes of 1x PBS to reduce the salt concentration. 


Wash the column with 6 column volumes of 50 m« NH,HCO, (pH 8.3) in 20% 
methanol while continuing to collect fractions. 


The NH, HCO, further reduces the salt concentration and the methanol removes hydrophobic 
peptides, 


Collection of the Biotinylated Peptides 


17. 


18. 


19. 
20. 


21. 


Place the first peptide collection tube underneath the outlet of the column. Add 1 column 
volume of elution reagent onto the top of the column, and allow it to flow into the col- 
umn until the reagent is flush with the top of the column. 


Add another column volume of elution reagent while placing the second peptide collec- 
tion tube beneath the column. 


Repeat Step 18 two more times so that a total of four fractions have been collected. 


The fractions that include the peptides will he the first collection tube to have an acidic 
pH and the collection tube that follows it. Pool these two samples. 


Proceed to analyze the peptides by MS (Stage 4). 
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STAGE 4: Analysis of ICAT-labeled Peptides by MS 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 

Reversed-phase Solvent A 
0.5% acetic acid <!> 
0.005% heptafluorobutyric acid <!> 
5% acetonitrile «1» 

Reversed-phase Solvent B 
0.496 acetic acid 
0.00596 heptafluorobutyric acid 
10096 acetonitrile 


b Equipment 
Centrifugal concentrator (e.g., SpeedVac, Thermo Savant) 
Mass spectrometer 
Reversed-phase column packing material (100 A spherical silica, Cj, (5 um mean particle 
size |) 
RP-HPLC column 
Use a column with one of the following set of dimensions: 
50 um E.D./200 pm O.D. 


75 um LD./360 pm O.D. 
100 pm L.D./350 um O.D. 


P Biological Sample 


Pooled fractions from avidin column 
From Stage 3, Step 20. 


METHOD 


1. Concentrate the pooled avidin fractions in a centrifugal concentrator. Do not allow the 
sample to dry out. 


2. Centrifuge the sample in a microfuge at 14,000g for 15 minutes to pellet any particulates 
(there should be no visible or a barely visible pellet). 


3. Transfer the supernatant to a fresh tube. 


A. Pressure-load varying percentages of the sample onto a reversed-phase column that is in- 
line with the mass spectrometer (for additional details on using a LC-MS setup, see 
Protocols 4 and 5). 


The percentages of sample to load will depend on the complexity of the sample and should be 
determined empirically. 
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5, Wash the column with Reversed-phase Solvent A for at least 10 minutes to remove salts. 


6. Elute the sample from the column directly into the mass spectrometer using the follow- 
ing elution and MS conditions: 


Flow rate: At the column tip, the flow rate is 200—300 nl/min (i.e., the HPLC pump is run 
at ~0.1 ml/min and the flow is split prior to the column). 


Gradient protocol (example): 


% Solvent B Minutes 
0 (} 
i 5 
40 45 
80 50 
80 55 
0 56 
Q 66 


Mass spectrometer operating conditions: The total mass spectrum (m/z 400-1800) is 
scanned in ! second. If the most-intense peak in the full scan is above a dictated thresh- 
old, the following scan will be an MS/MS scan of that most-intense peak. If no single peak 
surpasses the MS/MS threshold value, then a full scan per second is repeatedly made until 
a large enough peak is found. An example of a threshold value is 500,000 counts mini- 
mum signal (required for MS/MS). Masses of analyzed peaks are put into an exclusion 
table for several minutes to prevent recurrent MS/MS analysis of the same peak. 


Heated capillary temperature: 170°C 


Collision energy to form MS/MS fragments: 27 


These operating parameters were established on an LCQ ion trap mass spectrometer 
(ThermoFinnigan MAT [San Jose, California]). Other instruments may require slightly dif- 
ferent parameters that will have to be determined empirically. 


PROTOCOL 10 


In Vivo Isotopic Labeling of Proteins for Quantitative 
Proteomics 


MATERIALS 


Sonus ISOTOPE CODING STRATEGIES ARE OF IMMENSF VALUE in determining protein concentra- 
tion changes in cells and tissues triggered by regulatory stimuli (e.g. drugs and toxins) and 
disease (e.g, caused by mutational changes). Recognizing and identifying the smal} number 
of proteins whose expression levels differ as a consequence of disease or external stimuli are 
complicated by the complexity of biological extracts and the fact that protein posttransla- 
tional modifications are numerous and can occur at many sites on a protein. 

One way of quantifying global protein expression patterns involves in vivo labeling. 
Using the following protocol, stable isotopes can be incorporated into metabolic products, 
and the relative difference between these products from cells grown in normal or isotope- 
enriched media can be readily quantified by MS analysis. This protocol was provided bv 
Yoshiya Oda (Eisai Co., Ltd., Ibaraki, Japan). 


P Reagents 


g| 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acetone, ice-cold <!> 

Acetonitrile <!> 

Acetonitrile/H,O/TFA (66/33/0.1 [v/v/v] and 5/95/0.1 [viviv|) 

Ammonium bicarbonate (50 mM) <!> 

Ammonium bicarbonate (25 mM) containing 0.05 pg of trypsin and 0.1% n-octylglucoside 

Ammonium bicarbonate (25 mat) containing 0.196 n-octylglucoside 

Cell culture growth media 
The experiment requires growth medium capable of supporting the cells under study (bacterial, 
yeast, other fungi, insect, or mammalian). The medium must be available in two varieties, which 
are identical except for the inclusion of °N-labeled metabolites in one. Two sources of such media 
are Spectra Stable Isotopes (hitp://www.spectrastableisotopes.com), which produces the Celtone 
and Yeastone products; and Cambridge Isotope Laboratories, Inc. (Andover, Massachusetts, 
http://www.isotope.com/cil/index.html), which produces the Bio-Express line of growth media. 
Experience with Bio-Express 1000 medium indicates that it is preferable to add tryptophan to the 
medium to a final concentration of 40 mg/ml. Bio-Express 1000 media are provided as sterile, 10x 
concentrates in H,O, Dilution experiments indicate a linear growth response over the range of 
0.25x to 2X concentrations. Because growth rates of S. cerevisiae are slightly slower in Bio- Express 
1000 medium, prepare a 2x solution as the working stock. However, growth conditions must be 
optimized empirically for each cell type and medium. 

Dry ice or liquid nitrogen <!> 

H,O, ice-cold 

Ice 
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2-Mercaptoethanol (10 mM) <!> 

Methanol (5096) <!>/40% H,0/10906 acetic acid 
Potassiurn ferricyanide (15 mM) <!> 

Protein extraction reagent (c.g., Y-PER reagent, Pierce) 
Protease inhibitor cocktail 

Sodium thiosulfate (50 mM) 


p-—————— 
MATERIALS FOR PROTEIN SEPARATION USING 2D (EL ELECTROPHORESIS 


See Step 22. For additional details on running 2D gel electrophoresis, see Chapter 4. 
Rehydration buffer A 
0.5 M Tris-Cl (pH 8.3) 
8 M urea <!> 
0.596 Triton X-100 <!> 
2 mM dithiothreitol (DTT) «t» 
0.2% Ampholine (Amersham Biosciences)* 
*Choose the Ampholine set with the appropriate pH range for the proteins to be separated. 
Immohiline dry strip gel (Amersham Biosciences) 
Polyacrylamide gel (12.5961, 2.796€) 
Electrophoresis apparatus (Hoefer DALT system, Amersham Biosciences, or equivalent) 


LL 


MATERIALS FOR PROTEIN SEPARATION Usine HPLC AND SDS-PAGE 


See Step 22. For additional details on running ion-exchange HPLC and SDS-PAGE, see Chapters 5 
and 2, respectively. 
Rehydration bufter B 
20 mM Tris-Cl (pH 8.3) 
8 M urea <!> 
0.5% Triton X-100 <!> 
Mono-Q column (HR 5/5, Amersham Bioscience, or equivalent) 
Mobile phase A: 20 mM Tris-Cl (pH 8.3) containing 1 M urea 
Mobile phase B: 20 mM Tris-Cl (pH 8.3] containing | M urea and Lm NaCl 
HPLC system 
Trichloroacetic acid (YCA) (1096) <1 
SDS-PAGE sample buffer (2x sample buffer for SDS-PAGE: 125 mm Tris-Cl i pH 6.8], 20% glyc- 
eral, 4% SDS «12, 100 mm DTT «12, 0.04% bromaphenol blue} 
Triglycine 8—16% gradient gel (Invitrogen-NOVEX) 


MATERIALS FOR PROTEIN SEPARATION UsiNG 2D HPLC 


See Step 22. Far additional details on running HPLC, see Chapter 5. 
Rehydration buffer B (see recipe above) 
Mono-Q column (HR 5/5, Amersham Bioscience, or equivalent) 

Mobile phase A: 20 mM Tris-Cl (pH 8.3) containing 1 M urea <!> 

Mobile phase B; 20 mM ‘Tris-Cl (pH 8.3) containing 1 M urea and } M NaCl 

HPLC system 

RP-HPLC column ( YMC-Pack CN-AP 4.6 x 150 mm, Ap-502 [YMC Inc, Kyoto, Japan] or equiv- 
alent) 

Solvent A: 5% acetonitrile containing 0.1% TFA 

Solvent B: 95% acetonitrile containing 0.1% TFA 

Trypsin 
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P Equipment 
Centrifugal evaporator (SpeedVac, Thermo Savant or equivalent) 
Centrifuge tubes (100-ml capacity) 


Hemocytometer 
Optional, see Step 8. 


Incubator preset to 309C 


MALDI-TOF mass spectrometer and an LC-MS/MS system 
See Step 34 and Protocols 4—7. 


Sonicator 
Spectrophotometer 


» Biological Sample 


Saccharomyces cerevisiae, freshly grown colonies on a Petri plate 
Other types of cells can be labeled using this protocol, provided the growth conditions arc adjust- 
ed to optimize growth of the cells under study. 


P Additional Reagents 


If a gel is used for the separation of proteins, Step 22 will require the silver stain reagents from 
Chapter 2, Protocol 7 or Chapter 4, Protocol 12. 


METHOD 


Culturing of Cells 


1. Beginning with a freshly grown plate, choose a single yeast colony, and inoculate 2 ml of 
!5N-labeled culture medium with the colony. Choose a second colony as a control sam- 
ple, and inoculate 2 ml of unlabeled culture medium with that colony. 


2. Vigorously vortex the tubes of medium for ~ I minute to disperse the cells thoroughly. 
3. Incubate the two cultures with shaking at 230-270 rpm overnight at 30°C. 


4. Following overnight growth, measure the culture growth with a spectrophotometer. The 
cultures should be dense (OD, 71.5). 


5. Vortex the overnight cultures for ~1 minute to disperse any cell clumps. 


6. Inaculate 50 ml of '5N-labeled medium with the entire overnight culture grown in YN- 
labeled medium. Likewise, inoculate 50 ml of the unlabeled medium with the entire con- 
trol culture. 


7. Incubate with shaking at 220-250 rpm at 30°C until the cultures have reached mid-log 
phase (OD, «1.0). 
Preparation of Cultures for Protein Extraction 


8. Determine the cell density by measuring the culture at 600 nm in a spectrophotometer or 
by counting cells in à hemocytometer. 
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t. 


12. 


. Using as much of each culture as possible, combine equal numbers of the PN -labeled cul- 


ture and the unlabeled control culture together. 


. Quickly chill the culture by pouring it into two chilled 100-ml centrifuge tubes filled 


halfway with ice. 


Immediately place the tubes in a chilled rotor and centrifuge them at 1000g for 5 minutes 
at 49°C, 


Pour off the supernatant and resuspend the cell pellet in 50 mi of ice-cold H,O. 


Any unmelted ice pours off with the supernatant. 


. Recover the pellet by centrifugation at 1000g for 5 minutes at 49C. 


. Again, discard the supernatant, and immediately freeze the cell pellet by placing the tubes 


on dry ice or in liquid nitrogen. Store the cells at -80°C until they are needed. 


Preparation of Protein Extracts 


15. 


Add an appropriate volume of protein extraction reagent containing a protease inhibitor 
cocktail and 10 mM 2-mercaptoethanol to the frozen cell pellet. Usually 2.5-5 ml of the 
extraction reagent, Y-PER reagent, is used for a 1-ml (1 g) cell pellet. 


. Incubate the cells gently for 30 minutes at room temperature. 

. Spin down the debris and collect the supernatant. 

. Add 5 volumes of ice-cold acetone to the supernatant and vortex the tube. 
. Incubate the protein extract for 10 minutes at —809C. 

. Centrifuge the proteins in a microfuge at 20,000g for 5 minutes at 4°C. 


. Discard the supernatant and immediately carry out Step 22 or store the precipitated pro- 


teins at -809C until they are needed. 


Separation of Proteins 


22. 


Three different ways to separate the protein mixture into individual proteins are 2D gel 
electrophoresis, HPLC combined with SDS-PAGE, and 2D HPLC. Perform one of these 
procedures for separating the proteins. 


For 2D gel electrophoresis: The following steps are a summary of the 2D gel electrophoresis 
procedure. For step-by-step instructions, see Chapter 4. 


a. Dissolve 500 pg of precipitated protein (Step 21) in 400 pl of Rehydration buffer A. 
b. Rehydrate on an Immobiline dry strip gel. 


c. Perform IEF in the first dimension. 
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d. Apply the strip to a 12.596T, 2.7%C polyacrylamide gel, and separate the proteins in 
the second dimension. 


e. Following the completion of 2D electrophoresis, stain the gel with silver using a pro- 
cedure that is compatible with subsequent analysis of the proteins by MS (see Chapter 
4, Protocol 12). 


f. Proceed to Step 23. 


For HPLC-SDS/PAGE: The following steps are a summary of the ion-exchange HPLC and 1D 
gel electrophoresis procedures. For step-by-step instructions, see Chapters 5 and 2, respec- 
tively. 


a. Dissolve 10 mg of precipitated protein (Step 21) in 2 ml of Rehydration buffer B. 


b. Establish a column flow rate of ] ml/min. Set the concentration gradient for Mobile 
phase B at: 


0-5 minutes, 0% B 
5-45 minutes, 0-50% B 
45—60 minutes, 50-100% B 


c. Collect 30 2-ml fractions. 


d. After the HPLC is complete, precipitate the proteins from each 2-ml fraction with 1 
mi of 10% (w/v) TCA. Vortex the tubes and incubate for 30 minutes on ice. 


e. Centrifuge the tubes at 3000g for 30 minutes at room temperature. 


ma 


. Discard the supernatant. Add SDS-PAGE sample buffer, and run each fraction in a 
separate lane of an 8-16% gradient triglycine gel. 


g. Stain the gels with silver using a procedure that is compatible with subsequent analy- 
sis of the proteins by MS (see Chapter 2, Protocol 7). 


h. Proceed to Step 23. 


For 2D-HPLC: The following steps are a summary of the ion-exchange and RP-HPLC proce- 
dures. For step-by-step instructions, see Chapter 5. 


a. Dissolve 10 mg of precipitated protein (Step 21) in 2 ml of Rehydration buffer B. 


b. Establish a column flow rate of 1 ml/min. Set the concentration gradient for mobile 
phase B at: 


0—5 minutes, 0% B 
5-45 minutes, 0-50% B 
45-60 minutes, 50-100% B 


c. Collect 30 2-ml fractions. 


d. Inject each 2-ml fraction into the second HPLC column, a reversed- phase column. 


Solvent A is 596 acetonitrile containing 0.1956 TFA, and Solvent B i5 9596 acetonitrile 
containing 0.1% TFA. 
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In-gel Digestion 


23. 


24. 


25. 
26. 
27. 
28. 
29, 


30. 


31. 


32. 


e. Establish a column flow-rate of 1 ml/min. Set the concentration gradient for Mobile 
phase B at: 


0-5 minutes, 0% B 
5-60 minutes, 0-55% B 


. Collect 30 2-ml fractions. 


m 


g- After the second HPLC is complete, dry the fractions in a centrifuga! evaporator. 


h. Dissolve the fractions in 20 pl of 20 mM NH,HCO, and 5 mw n-octylglucoside. 
Vortex and sonicate the samples. Check that the pH is ~8.5. 


i. Add 1 pl of 0.05—1 ug of trypsin to the fractions. Incubate overnight at 37°C. 


The amount of trypsin to use will depend on the protein concentration. The final sub- 
strate-to-trypsin ratio should be 50:1. 


j. Proceed to Step 34. 


Destain tbe silver-stained gels for a few minutes with a freshly prepared mixture of 15 mM 
potassium ferricyanide and 50 mM sodium thiosulfate. 


Wash the gels vigorously with 500 ul of 50% methanol/4096 H,O/1096 acetic acid for 30 
minutes. 


Repeat Step 24 four more times, using fresh washing solution each time. 

Incubate the gels with 500 ul of 50 mM ammonium bicarbonate solution for 5 minutes. 
Incubate the gels with 500 ul of acetonitrile for 5 minutes. 

Dry the gel pieces completely in a centrifugal evaporator. 


Swell the dried gel pieces in 2 ul of 25 mM ammonium bicarbonate containing 0.05 ug of 
trypsin and 0.1% n-octylglucoside. 


After all solvents have penetrated into the gels (usually 5-10 minutes), add 10 wl of 25 mm 
ammonium bicarbonate containing 0.1% n-octylglucoside, and allow the mixture to 
stand for 2 hours at 37°C. 


Extract the tryptic peptides twice with 40 ul of acetonitrile/H,O/TFA (66/33/0.1 [v/viv]) 
solution in a 350 W sonicator for 5-10 minutes. 


Dry the combined extracts (~90 pl) in a centrifugal evaporator. 


Mass Spectrometric Analysis of the Proteins 


33. 


34. 


Dissolve the dried tryptic peptides in 5 ul of acetonitrile/H;O/TFA (5/95/0.1 [v/vív]) 
solution. 


Subject 10% of the digest to MALDI-TOF-MS, and use the remainder for LC-MS/MS 
analysis. 


Proteins separated by 2D-HPLC and digested in solution must be passed through a ZipTip 
prior to MALDI or loaded onto an RP-HPLC column prior to MS/MS. 


PROTOCOL Ll 


Anhyarotrypsin Affinity Capture of Proteins 


MATERIALS 


Mormon OF SER-195, LOCATED WITHIN THE ACTIVE SITE of trypsin, to a dehydroalanine 
residue renders the enzyme catalytically inert. However, the resulting anhydrotrypsin deriv- 
ative retains a strong affinity for peptides that contain a carboxy-terminal arginine or lysine 
tesidue. This interaction is such that a column prepared of the immobilized form of anhy- 
drotrypsin can be effective in capturing peptides with basic carboxyl termini from a typical 
protein digest. This anhydrotrypsin affinity capture technique was developed more than two 
decades ago (Yokosawa and Ishii 1976; Ishii et al. 1979) and remains useful today, especially 
when used in conjunction with MS for the characterization of the carboxyl termini of pro- 
teins (Sechi and Chait 2000; Bures et al. 2001), Anhydrotrypsin affinity capture is a reliable 
technique for isolating the carboxy-terminal peptide of a tryptically digested protein 
(Kumazaki et al. 1986, 1987). Because the carboxy-terminal tryptic peptides of proteins typ- 
ically do not contain a carboxy-terminal arginine or lysine, these peptides can be selectively 
isolated from complex tryptic digests of proteins by their inability to bind to an anhy- 
drotrypsin affinity capture column (all other carboxy-terminal arginine or lysine peptides 
will be retained). The following protocol, which was kindly provided by Ted Bures (Amgen 
Inc., Thousand Oaks, California), involves five stages: 


+ Preparation of an affinity resin (salmine-trypsin [ST]-Sepharose) containing a tryptic 
digest of salmine, an atginine-rich polypeptide, coupled to Sepharose. 


* Preparation of anhydrotrypsin from the alkaline treatment of phenylmethylsulfonyltrypsin. 
* Purification of anhydrotrypsin on ST-Sepharose, 
* Preparation of an anhydrotrypsin affinity column. 


+ Use of the anhydrotrypsin affinity column for capturing basic carboxy-terminal peptides. 


» Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acetic acid (1 M) <!> 

Bovine trypsin (Roche Molecular Biochemicals) 

CaCl, (1 M) «I» 

Cyanogen-bromide-activated Sepharose 4B (Amersham Biosciences) <!> 

Hcl (1 «) <!> 

KOH (1 M) <!> 

NaHCO, (1 M, pH 8.3) 

NaHCO, (1 M, pH 8.3) containing 5 M NaC) 

NaOH (20 mM) <!> 

Phenylmethylsulfonyl fluoride (PMSF) (87.5 mg) «1» in 2.5 ml of acetone <l> 


THE USE OF MASS SPECTROMETRY IN PROTEOMICS a 541 


Phosphate-buffered saline (PBS) containing 0.02% NaN, <!> 

Salmine sulfate (protamine sulfate, Grade X, isolated from salmon, Sigma} 
Sodium acetate (0.1 m, pH 4.0) containing 0.5 M NaCl 

Sodium acetate (50 ma, pH 5.0) containing 0.02 M CaCl, 

Trifluoroacetic acid (TFA) (0.05% v/v) <!> 

Tris-Cl (0.05 m, pH 7.0) containing 20 mm CaCl, 

Tris-Cl (0.1 M, pH 8.0) containing 0.5 M NaCl 


» Equipment 


Dialysis membrane (6000-8000 m.w. cutoff) 
Sintered glass funnel 
Water bath (boiling) 


METHOD 


Preparation of ST-Sepharose 


1. 


Dissolve 10 mg of bovine trypsin and 2.5 g of salmine sulfate in 75 ml of 0.1 M NaHCO, 
(pH 8.3) and incubate for 100 minutes at 37°C. 


. Heat the solution for 3 minutes to 100°C to arrest the digestion. Allow the salmine- 


trypsin solution to cool to room temperature. 


. Weigh out 14.3 g of cyanogen-bromide-activated Sepharose 4B resin. 
. Swell and wash the resin in 3 liters of 1i mM HCl on a sintered glass filter for 15 minutes. 


. Mix the salmine-trypsin solution (from Step 2) with the washed cyanogen-bromide-acti- 


vated Sepharose 4B on a rotating wheel for 1 hour at room temperature. This allows the 
salmine-trypsin to bind to the activated Sepharose. 


. Add 10 gel volumes of 0.1 M NaHCO, (pH 8.3) to the Sepharose solution to dilute the 


excess ligand (i.e., salmine-trypsin) and aid in its subsequent removal. 


. Block the remaining active groups by aliowing the resin to stand in 50 ml of 0.1 M Tris 


(pH 8.0) containing 0.5 M NaCl for 2 hours at room temperature, 


. Aspirate all of the liquid from atop of the settled resin. 


. Carry out three cycles of alternating pH washes using: 


0.1 M sodium acetate (pH 4.0) containing 0.5 M NaCl 
0.1 M Tris-Cl (pH 8.0) containing 0.5 M NaC] 


Perform each wash with 50 ml of buffer. Between each washing, allow the resin to settle, 
and aspirate off the liquid. 


. Store the completed ST-Sepharose in PBS containing 0.02% NaN;. 


The ST-Sepharose may be stored for up to 1 year at 4°C with no loss of binding affinity. 


As an alternative to batch processing of the ST-Sepharose, Steps 6—10 can be carried out in a 
column. 
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Preparation of Anhydrotrypsin 


lt. 


7. 


Inactivate 500 mg of bovine trypsin (in 250 ml of 0.05 M Tris [pH 7.0! containing 20 mm 
CaCl} with 2.5 ml of PMSF (87.5 mg in acetone) for 30 minutes at room temperature 
with gentle stirring. Using a pH probe kept continuously in the solution, maintain the pH 
at 7.0 with 0.02 M NaOH throughout this step (Ishii et al. 1983). 


. To terminate the reaction, add | M HCE until the pH of the trypsin solution is 3.0. 


. Dialyze the trypsin overnight at 4°C against 15 liters of 1 mM HCl. Use a dialysis mem- 


brane rated with a 6000-8000 molecular-weight cutoff. 


. Adjust the resulting 290 ml of phenylmethylsulfony! trypsin to an alkaline pH with the 


addition of a 1/19th volume (or 15.3 ml) of 1 m KOH and stir for 20 minutes at 0°C, 


. Adjust the pH to 5.0 with the addition of ~15.6 ml of 1 M acetic acid. Add 6.4 ml of 1 M 


CaCl, to the phenylmethylsulfonyl trypsin. 


The final mixture should contain 50 mM acetate and 20 mM CaCl. 


. Filter the 327 ml of phenylmethylsulfonyl trypsin solution through a 0.22-pm filter. 


Purify the anhydrotrypsin on an ST-Sepharose column (Kasai and Ishii 1975): 


a. Prepare a column of ST-Sepharose (using the resin from Step 10). Wash the resin 
thoroughly to remove the NaN, and equilibrate the column in 50 mM sodium acetate 
buffer (pH 5.0) containing 0.02 M CaCl, 


b. Load the phenylmethylsulfonyl trypsin solution onto the column at a flow rate of 1 
ml/min. 


c. Wash the column with 15 column volumes of 50 mM sodium acetate buffer (pH 5.0j 
containing 0.02 M CaCl,. Wash with a column flow rate of 3 ml/min. 


d. Elute the anhydrotrypsin from the column with 3 column volumes of 5 mm HCI, 
operating the column at a flow rate of | ml/min. Monitor the eluent with a spec- 
trophotometer set at Asso 


Preparation of Anhydrotrypsin Affinity Column 


18. 


20. 
21. 


Dilute 10 ml of the concentrated anhydrotrypsin solution with 10 ml of 0.1 M NaHCO, 
(pH 8.3) containing 0.5 M NaCl. 


The following coupling method is based on the technique of Axen and Ernback (1971). 


. Add 1 ml of 1 M NaHCO, (pH 8.3) containing 5 M NaCl to the diluted anhydrotrypsin, 


and filter the solution through a 0.22-yum filter. 


This adjusts the solution to 0.1 M NaHCO, (pH 8.3) containing 0.5 M NaCl. Filtering the anhy- 
drotrypsin removes any precipitates and prevents the possibility of clogging the column. 


Wash 0.314 g of cyanogen -bromide-activated Sepharose 4B with 70 ml of | mm HCl. 


Mix the diluted anhydrotrypsin (from Step 19) with the cyanogen-bromide-activated 
Sepharose 4B for 3 hours at room temperature on a rotating wheel. 
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22. 
23. 


24. 


25. 


26. 


Pack a column with the affinity resin according to the manufacturer's instructions. 


Wash excess ligand from the affinity resin with 5 column volumes of 0.1 w NaHCO, (pH 
8.3) containing 0.5 m NaCl. 


Block any free active groups on the activated Sepharose. with 10 column volumes of 0.1 
M Tris-Cl (pH 8.0) containing 0.5 M NaCl. 


Carry out three cycles of alternating pH washes for ~5 minutes each at room tempera- 
turc using: 


20 ml of 0.1 « sodium acetate (pH 4.0) containing 0.5 M NaCl 
20 ml of 0.1 m Tris-Cl (pH 8.0) containing 0.5 M NaCl 


Store the completed anhydrotrypsin-Sepharose in PBS containing 0.0296 NaN.. 


The anhydrotrypsin-Sepharose can be stored for up to | year at 49C. 


Sample Preparation, Loading, and Elutlon 


27. 


28. 


29. 


Before loading the digested protein sample onto the anhydrotrypsin column, acidifv the 
protein sample to pH 5.0 with 0.1 N acetic acid. 


Divide the sample into 2- ml aliquots (2 ml is the maximum load capacity of the column), 


A sample containing a calibrated amount of enkephalin-Arg was passed through the column to 
determine the column capacity. The amount of sample added is such that the column capaci- 
ty is not exceeded. T'he column is equilibrated and washed with 50 mm sodium acetate (pH 
5.0)/20 mM CaCl,/0.02% NaN.. 


Elute bound peptides from the anhydrotrypsin affinity column using 0.05% (v/v! TFA, 


Alternative elution methods include eluting with 5 mm HCI or with 0.1 M formic acid 
(Kumazaki et al. 1987), or generating a pH gradient from pH 5.0 to pH 2.5 (Kumazaki et al. 
1986). 


PROTOCOL 12 


Analysis of Proteomes Using the Molecular Scanner 


£ 


Tre MOLECULAR SCANNER iS A HIGHLY AUTOMATED PROTEOMICS method, which simultaneous- 
ly processes thousands of proteins under identical conditions and offers a flexible and pow- 
erful visualization tool that can create a fully annotated 2D gel electrophoresis map 
(Bienvenut et al, 1999). Proteins separated by 2D gel electrophoresis are then simultaneous- 
ly digested while undergoing electrotransfer from the gel to a membrane. The peptides are 
subjected to peptide mass fingerprint (PMF) analysis to identify proteins directly from the 
PVDF membranes by MALDI-TOF-MS scanning. An ensemble of dedicated tools is then 
used to create, analyze, and visualize a proteome as a multidimensional image (Binz et al. 
1999). The molecular scanner method reduces to a minimum the sample handling prior to 
mass analysis and decreases the sample size to a few tens of micrometers, i.e., the size of the 
MALDI-TOF-MS laser heam impact. This protocol has been kindly provided by Catherine 
Déon, Willy Bienvenut, Jean-Charles Sanchez, and Denis F. Hochstrasser (Geneva University 
Hospital, Geneva, Switzerland) and Markus Müeller, Robin Gras, and Ron D, Appel (Swiss 
Institute of Bioinformatics, Geneva, Switzerland). The process can be divided into four parts 
(see Figure 8.46): separation and digestion of proteins, acquisition of PMF data, processing 
of the MS data and protein identification, and creation of multidimensional proteome maps 
(see below). 


© Separation and digestion of proteins. Digestion of gel-electrophoresis-separated proteins 
occurs while they are being transferred onto a PVDF membrane. Following either 1D or 2D 
separation of proteins, electrotransfer of the proteins is performed through a field of 
trypsin covalently bound to a membrane interposed between the gel and the PVDF mem- 
brane (Figure 8.462). Consequently, the proteins separated on the gel are endoproteolyti- 
cally cleaved during their migration and the corresponding peptides are collected on the 
PVDF membrane. The protein digestion is improved by performing an in-gel predigestion 
step prior to the electrotransfer. The combination of digestion steps (called the double-par- 
allel digestion process or the DPD process) improves the proteolysis of high-molecular- 
weight and basic proteins. Furthermore, an alternating square-shape voltage is applied dur- 
ing the electrotransfer to increase the migration time of the proteins through the enzymat- 
ic membrane, which improves tryptic hydrolysis efficiency. 

This technique allows additional analyses to be carried out. For example, the collecting 
membrane can be reused for MS scans under different conditions or additional chemistry 
to improve protein identification (e.g., derivatization of peptides). The DPD process can 
also be used with other enzymes such as phosphatase or glycosidase followed by any endo- 
proteinase. Finally, with the development of new mass spectrometric technologies such as 
the MALDI-TOE/TOE (Medzihradszky et al. 2000) and the MALDI-QqTOF (Loboda et al. 
2000), fature promising developments of the molecular scanner process could combine the 
MALDI scanning technique with MS/MS acquisition and generation of sequence data for 
protein identification. 
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FIGURE 8.46. Scheme of the molecular scanner. (a) Paraliel digestion and simultaneous electrotransfer of proteins from a 2D elec- 
trophoresis gel using the DPD process, (LAV) Immobilon AV membrane. (5) MALDI-TOF-MS scanning of PVDF collecting mem- 
brane after spraying with matrix solution. (x, y) refers to the position where MS spectra were measured on the PVDF membrane. 
(c) Identification procedure. The peak detection and mass calibration yields sets of PME The MS signal measured at each (x, y) 
coordinate is represented by its m/z value m, and its MS intensity 1, The x, and y, values ase interpreted as pl and M, values. The 
PMF data are submitted to Smartident. Identification results are collected together with the PME data. (d) A visualization tool 
allows representation of the analyzed data in different forms. Three examples of typical queries and representations are described 
here. (1) An MS intensity image can be created that contains the identification data as database labels. [t is generated in a Melanie 
readable format. (2) Other options allow the user to search for a particular protein and ta visualize it as an intensity plot. In this 
plot, the gray levels represent the number of masses identified that belong to the protein at each position. (3) 'The program further 
allows the user to search for a set of predefined masses and to generate an intensity image where the gray levels represent the total 
intensity of the mass peaks found at each (x, y? position. This image can be smoothed if needed. (Reprinted, with permission, from 
Binz et al. 1999 [OAmerican Chemical Society.) 
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MATERIALS 


Acquisition of PMF data, The PVDF membrane, containing the digestion products of all 
the proteins at discrete positions on the membrane surface, is sprayed with an MS matrix 
solution and then scanned directly by MALDI-TOF-MS to obtain PMF data over the entire 
surface of the membrane (Figure 8.46b). 


Processing of the MS data and protein identification. The identification process is man- 
aged by an interactive tool, which automatically treats all of the generated MS data and per- 
forms the various steps of the analysis, starting with peak detection and calibration (Figure 
8.46c). PME data combined with pI and M, information (extracted from the position on 
the 2D gel) are sent for identification to SmartIdent-adapted software, an identification 
tool developed by the Swiss Institute of Bioinformatics (Gras et al. 1999; http://ch.expasy. 
org/tools/). 


Creation of virtual maps. A multidimensional image is generated corresponding to a vir- 
tual annotated *2D map" (Figure 8.46d). x and y coordinates are related, respectively, to p! 
and M, values. A z axis is added that represents different data or information obtained dur- 
ing the acquisition and interrogation processes. The resulting images containing identifi- 
cation results are readable by the Melanie 2D electrophoresis analysis software. The user 
can choose to filter and visualize only particular features from all of the data contained in 
the multidimensional image (Figure 8.46d). 


b Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Acetonitrile (preparative HPLC grade, 50% and 8096) <!> 

o-cyano-4-hydroxy-cinnamic acid (HCCA) (10 mg/ml in 70% methanol and 0.1% TFA) <!> 
c-tosyl-L-arginine methyl ester (TAME) (10 mM} 

Amido Black (0.5% w/v) 


Dissolve the dye in 25% (v/v) 2-propanol «1» and 10% (v/v) acetic acid «T7. 


Capping reagent 


1 M ethanolamine (pH 10.5) <!> 
1 M sodium bicarbonate buffer (pH 9.5) 


Coomassie Brilliant Blue R250 (0.1% w/v) 


Prepare the dye in a 60/30/10 (v/v/v) mixture of H,O, methanol, and acetic acid. 
Destain the solution (50/40/10 [v/v/v] mixture of H,0, methanol, and acetic acid). 


HCl (1 my) <!> 
Paraffin oil 
Phosphate- buffered saline (PBS) containing Tween (PBS-Tween, pH 7.4) 


20 mM NaH;PO, «t» 
140 mM NaCl 
0.5% Tween 20 


0.5x 'Transblotting buffer (Towbin buffer containing 0.0196 w/v of SDS) «t» 
Tris-Cl (46 mM, pH 8.1) containing | mM CaCl, <!> and 0.196 sodium azide <!> 
Tris-Cl (460 mM, pH 8.1) containing 11.5 mM CaCl, 

Trypsin (2 mg/ml) in 20 mM NaH,PO, (pH 7.8) 

Trypsin (0.05 mg/ml) in 10 mM Tris-Cl (pH 8.2) 


All of the trypsin used in this experiment should be TPCK-treated trypsin (Type 1X from porcine 
pancreas, Sigma Chemical). 
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Gel electrophoresis reagents: 
* One-dimensional minigel 
60 mM Tris-Cl (pH 6.8) containing 10% (v/v) glycerol, 2% (w/v} SDS, and 3% 
(v/v) mercaptoethanal 
1296, 2.6%C, 1-mm thick linear polyacrylamide gels 
* Two-dimensional minigel 
2D Equilibration solution 
50 mm Tris-Ci (pH 6.8) 
6 M urea <!> 
3096 (v/v) glycerol 
2% (wiv) SDS 
2% (w/v) dithioerythritol (I TE} 
Sulfhydryl blocking solution 
50 mM Tris-Cl (pH 6.8) 
6 M urca 
3096 (v/v) glycerol 
2% (w/v) SDS 
2.5% (wiv) iodoacetamide «1» 
a trace of bromophenol blue 
Overlay solution 
0.5% (w/v) agarose 
25 mM Tris (pH 8.3) 
198 mM glycine <!> 
0.1% (wiv) SDS 
Shortly before use, boil the entire overlay solution to melt the agarose. 
Hold the solution at ~70°C. 
2D gel rehydration solution 
8 M urea 
4% (wiv) CHAPS 
2% (v/v) Resolytes 4-8 
65 mm DTE 
a trace of bromophencl blue 
* Two-dimensional fuil-size 
First-dimenston sample loading buffer 
8 M urea 
4% (wiv) CHAPS 
40 mM Tris 
65 mM DTE 
0.05% (wiv) SDS 
a trace of bromophenol blue 
Sodium thiosulfate (5 mM) 
2D Equilibration solution 
50 mM Tris-HCl (pH 6.8) 
6 M urea 
30% (v/v) glycerol 
2% (wiv) SDS 
2% (wiv} DTE 
Sulfhydryl blocking solution 
50 mM Tris-HCl (pH 6.8) 
6 M urea 
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30% (v/v) glycerol 
296 (wiv) SDS 
2.5% (wiv) iodoacetamide 
a trace of bromophenol blue 
Overlay solution (for recipe, see Two-dimensional minigel above) 


P Equipment 
Electroblotting apparatus 
The apparatus can be purchased (e.g., Bio-Rad) or laboratory made (Bienvenut et al. 1999). 
Gel electrophoresis system: 
e One-dimensional minigel 
1D minigel system (e.g., Mini-Protean II Cell, Bio-Rad) 
Water bath preset to 95°C 
* Two-dimensional minigel 
2D minigel system (e.g., Multiphor II, Amersham Biosciences) 
Ready-made IPG strips (7-cm) (Amersham Biosciences) 
Water bath preset to 709C 
Precast or homemade vertical or horizontal SDS slab gels for the second dimension 
Dimension: 80 x 60 x 1.0 mm or 100 x 80 x 1.0 mm 
Resolving gel: acrylamide/piperazinediacrylamide (12%T/2.6%C) 
Stacking gel: no stacking 
Leading buffer: 0.375 m Tris-Cl (pH 8.8) 
Trailing buffer: 25 mM Tris, 198 mM glycine, 0.1% (w/v) SDS (pH 8.3) 
Additive: 5 mM sodium thiosulfate 
Polymerization agents: 0.05% TEMED, 0.1% ammonium persulfate 
« Two-dimensional full size 
Ready-made IPG strips (18-cm) (Amersham Biosciences) 
Sample cups (Amersham Biosciences} 
Large electrophoresis system (e.g., Protean Il Cell, Bio-Rad) 
Water bath preset to 70°C 
Immobilon AV membrane (Millipore) 


The [AV membrane is a modified PVDF matrix presenting activated carboxylic groups abie to react 
with nucleophiles such as amine groups of proteins. 


MALDI sample plate (modified) 
See Step 24. 


Mass spectrometer 
See Step 26. 


PVDF membrane 

Rotating hybridizer (e.g., model HB-2D, Techne, Cambridge, England) 
UV-visible spectrophotometer (Ultrospec III, Amersham Biosciences) 
Water bath preset to 359C 


» Additional Reagents and Equipment 
If using high-resolution 2D gel electrophoresis to separate proteins: 
e Step 9a of this protocol requires the reagents and equipment in Chapter 4, Protocol 5 
(Method A) 
« Step 9b, those in Chapter 4, Protocol 7 
« Step 9c, those in Chapter 4, Protocol 8 
» Step 9f, those in Chapter 4, Protocol 9 
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METHOD 


Preparation of JAV-Trypsin Membranes 
1. Submerge a 10 X 12-cmr IAV membrane in a solution of 2 mg/m! trypsin in 20 mM 


NaH,PO, (pH 7.8); incubate in a rotating hybridizer for 3 hours at room temperature. 


2. Wash the membrane three times briefly but with agitation in 10 m! of PBS-Tween to 
remove unbound trypsin. 


3. Incubate the membrane in 10 ml of capping reagent for 3 hours at 4°C to block the 
remaining free carboxylic groups. 


4. Wash the membrane three times briefly but with agitation in 10 ml of PBS-Tween to 
remove the capping reagent. 
5. Gently wash the membrane twice for 30 minutes each in 10 ml of PBS-Tween. 


Membranes can be stored for 2-3 years at 49C in a 46 mM Tris-HCl, 1 mM calcium chloride, 
0.0196 sodium azide buffer (pH 8.1). 


Testing the Activity of the Immobilized Trypsin 


ó. Add l-cm? of IAV-trypsin membrane to a mixture of: 


2.6 ml of 460 mm Tris-HCI (pH 8.1) containing 11,5 mw CaCl, 
0.3 ml of 10 mM TAME 
0.1 ml of 1 mM HCI 


7. Stir the mixture for 40 seconds, Measure the absorbance of the solution at 247 nm with 
a UV-visible spectrophotometer. 


8. After 3 minutes of constantly stirring the solution, take a second absorbance reading. The 
value of AA,,;/minute is used to calculate the equivalent amount of active trypsin 
expressed per unit of surface area as described by Hummel (1959). 


Protein Separation by Gel Electrophoresis 
9. Separate the proteins by either 1D or 2D gel electrophoresis. The choice of method will 
depend on the complexity of the sample and the abundance of the protein(s) of interest. 


For iD gel electrophoresis: 
a. Dilute proteins to the appropriate amount in 60 mM Tris-HCl (pH 6.8) containing 
10% glycerol (v/v), 2% SDS (w/v), and 3% mercaptoethanol (v/v). 
b. Reduce the proteins by heating the sample for 5 minutes at 95°C just before loading 
the samples onto the gel. 


1298, 2.696C, I-mm-thick linear polyacrylamide gels are typically used. 


c. Separate the proteins at 200 V for 45 minutes. 
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For 2D minigel electrophoresis: 


a. 


b. 


N 


Faai 


— 


-—- 


Combine 1 mg of the protein sample with 150 pl of rehydration solution. 


Apply the entire diluted protein sample to 7-cm ready-made IPG strips. 


. Peel off the protective cover sheets from the IPG strips, and position them in the rehy- 


dration chamber such that the gel of the strip is in contact with the sample. 


. Cover the IPG strips with low-viscosity paraffin oil, and allow them to rehydrate for 


at least 6 hours (overnight is fine) at room temperature. 


. Use tweezers to remove the rehydrated IPG gel strips from the chamber, rinse them 


with H,O, and place the strips gel side up in the electrophoresis running tray accord- 
ing to the manufacturer’s instructions. 


. Increase the voltage linearly from 300 to 3500 V over a 10-minute period, followed by 


1 additional hour at 3500 V. 


. Equilibrate the strips for 12 minutes in the rehydration tray using 3 ml per groove of 


2D equilibration solution. Discard the equilibration solution and replace it with 3 ml 
per groove of sulfhydryl blocking solution. Incubate the strips for 5 minutes. 


. After the equilibration, cut the IPG gel strips to size. 
. Overlay the second-dimension gels with ~70°C overlay solution. 
. Immediately load the 1PG gel strips through the overlay solution. 


. Run the gels at 200 V (constant) for 30 minutes at 12°C using a minigel electrophore- 


sis system. 


For high-resolution 2D gel electrophorests: 


a. 


b. 


Follow the steps in Chapter 4, Protocol 5 (Method A) to separate the proteins by IEF 
(the first dimension). Incorporate the following details: 


è Use IPG strips that are 3 mm X 18 cm. 

e Rehydrate the strips in the absence of the protein sample, which should be applied 
using sample cups. 

e Mix 4 mg of protein sample with 60 [1l of first-dimension sample loading buffer. 

+ Separate the proteins using a linear voltage gradient that increases from 300 to 3500 
V aver 3 hours, followed by 3 more hours at 3500 V. Then increase the voltage to 5000 
V, Separate the proteins by electrophoresis overnight fer a total of 100,000 volt- 
hours. 


Follow the steps in Chapter 4, Protocol 7 to prepare the second-dimension SDS-PAGE 
gels. Incorporate the following details: 


» Gel dimensions are 160 x 200 x 1.5 mm. 

æ Each gel is 9-16%T/2.6%C acrylamide and bisacrylamide. 

» Sodium thiosulfate (5 mM) is added ta acrylamide’ bisacrylamide to initiate/catalyze 
polymerization. 

Pour the gels to within 0,7 cm from the top of the plates and overlay them with 1- 
buranol. 

Aliow the gels to polymerize for ~2 hours. 

Remove the butanol, overlay the gels with H,O, and incubate them overnight. 


THE USE OF MASS SPECTROMETRY IN PROTEOMICS » 55] 


c. Follow the steps in Chapter 4, Protocol 8 to prepare the proteins for the second- 
dimension separation by SDS-PAGE, Incorporate the following details: 


* Equilibrate the strips for 12 minutes with 100 ml of 2D equilibration solution. 


+ Block ali sulthydryl groups within the proteins for 5 minutes with 100 ml of sulfhydry! 
biocking solution. 


d. Cut the IPG gel strips to size so that they fit atop the SDS-PAGE gels. Remove 6 mm 
from the anodic end and 14 mm from the cathodic end. 


e. Layer cach SDS-PAGE gel with overlay solution heated at ~70°C. 


f. Immediately apply the [PG gel strips to the top of the SDS-PAGE gels, according to 
Chapter 4, Protocol 9. 


g. Run the gels at 40 mA/gel under constant current for 5 hours at 8-129C using a large 
electrophoresis system. 


Staining the Gel-bound Proteins 


10. 


11. 


Stain the proteins with 0.196 (w/v) of Coomassie Brilliant Blue R250 solution by sub- 
merging the gel in the stain solution for 30 minutes, 


Destain the gel in repeated washes of destain solution. 


Double-parailel Digestion of Proteins 


12. 
13. 
14. 


15. 


17. 
18. 


20. 


Soak the gel in H,O for 5 minutes. 
Repeat Step 12 two more times. 


Soak the gel in 5096 acetonitrile for 20 minutes, followed by 10 minutes in 8096 acetoni- 
trile. 


Air-dry either the entire wet gel or a selected piece at room temperature. 


. Incubate the gel with 0.05 mg/ml of trypsin in 10 mm ‘Tris-Cl (pH 8.2) for 30 minutes at 


35°C. 


The volume of trypsin solution to use equals 3-5 times the initial volume of the gel. This step 
both rehydrates the gel and begins digesting the proteins. 


After 30 minutes of incubation, discard the trypsin solution. 


Just prior to electroblotting the proteins, equilibrate the [AV trypsin membrane and the 
PVDF membrane (in separate containers) in transbiotting buffer for 5 minutes. 


. Carry out the electrotransfer overnight (12-18 hours) at room temperature in a semi-dry 


apparatus by inserting a double layer of IAV-trypsin membrane between the gel and the 
PVDF membrane. 


The migration time of the proteins through the enzymatic membranes can be increased by 
applying an alternating square-shape voltage during the transfer: +12.5 V for 125 msec fol- 
lowed by —5 V for 125 msec. 


Wash the PVDF membrane in H,O for 5 minutes. 
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Staining the Membrane-bound Proteins 


21. 


(Optional) Stain the PVDF membranes with 0.5% Amido Black for ] minute, and destain 
them with repeated washes of H,O. 


Acquisition of PMF Data 


22. 
23. 


24. 


25. 


26. 


Wash the PVDF membrane containing the bound proteins with H,O and air-dry. 


Spray the membrane surface with a solution of 10 mg/ml of HCCA until it is complete- 
ly wetted. Allow the membrane to air dry. 


Use a very small amount of high-vacuum grease to attach a 4 x 4-cm/ piece of PVDF 
membrane onto a modified MALDI sample plate. Make sure that the grease does not 
penetrate the membrane. 


Define absolute coordinates on the membrane, and establish a grid that represents posi- 
tions where mass spectra will be acquired. 


For a given experiment, the distance between distinct mass spectrum acquisitions should be 
defined as a constant ranging between 0.2 and 0.5 mm, the exact distance determined empiri- 
cally. 


Insert the MALDI plate into the mass spectrometer and set the parameters for mass spec- 
tra acquisition, according to the manufacturer's instructions. 


Our mass spectra were acquired on a Voyager DE-STR MALDI-TOF mass spectrometer 
(Applied Biosystems) equipped with a 337-nm UV nitrogen laser, a delayed extraction device, 
and an acquisition rate of 20 Hz. The acquisition was performed with an acceleration voltage 
of 20 kV and a delay extraction time settled to 150 nsec. The mass range was generally defined 
from 850 to 4000 daltons with a low-mass gate fixed at 750 daltons, The laser power was set 
~20% above threshold, Spectra (50-100) were acquired at each position depending on the 
amount of analyzed material. 


Protein Identification 


27. 


2B. 


Using the mass spectrometer's acquisition software, set the peak detection threshold to 
the value optimized for a set of calibrated spectra (for standards, see Tables 8.9 and 8.10). 
Then, convert the positions on the MALDI sample plate to apparent molecular mass (M,) 
and pl values. 


An interactive tool that enables the steps of the identification process to be performed auto- 
matically has been developed (Gras et al. 1999; Perkins et al. 1999). 


Combine the PMF data from all of the spectra with calculated pl and M,, and also with 
user-defined interrogation criteria, such as peptide mass tolerance and chemical modifi- 
cations. The combined data are sent automatically to Smartident (http://ch.expasy.org/ 
tools/. 
The process of identification takes into account weak expression of some proteins but also 
overlapping protein spots, so the minimal number of matching peptides for the PMF search 
should be set as low as possible (i.e., 3 peptides). Set the number of missed cleavages to 1. In 
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any case, peptides with one missed cleavage, or other peptides carrying artifactual modifi- 
cations such as propionamide cysteine, will have a less important weight in the final protein 
identification score than the nonmissed cleavage and nonmedified peptides. [t is well 
known that the position of mass spectrum acquisition on the MALDI sample plate influ- 
ences the calibration error (Egelhafer et al, 2000). In addition, the membrane pasted on the 
sample plate could be also slightly warped, resulting in a variation in the calibration of the 
spectra. Therefore, expect mass differences up to 0.6 dalton, although usually, this will not 
hinder protein identification because the Smartident tool is able to compensate for catibra- 
tion errors. 


29. Write peak-list and identification results into a text file. 


Creation of Virtual Maps 


30. Analyze and cluster the identified proteins according to the procedure described by 
Bienvenut et al. (2001). Several criteria are defined to trace and exclude false identifica- 
tions from the final list of identified proteins. 


+ First, identifications of the same protein should be clustered in regions that have the 
shape of a spot on a 2D electrophoresis gel. Therefore, identifications that are isolat- 
ed or spread out over a large portion of the scanned membrane should be discarded. 


e Second, identifications that require matches with a matrix cluster or impurity mass- 
es should be excluded, as well as weakly expressed proteins that required peptide 
masses of abundant proteins. 


« Third, identifications that have a low average score are not considered. 


RESULTS 


e The identification results are represented as a virtual, annotated 2D electrophoresis map. 
The program generates a multidimensional image where x and y coordinates are related 
to pl and M,, respectively, and the z axis can be related to numerous data or information 
associated with the identified protein such as PMF spectra intensity, Smartident identi- 
fication score, number of matching peptides, etc. 


* The image files are stored in a graphical format that can be read by the Melanie 2D 
image analysis software. Those images contain numerous distinct attributes such as pl, 
M,, identification labels (SwissProt or TrEMBL AC numbers, I.D. labels), peptide mass- 
es, and MS intensities. The user can choose to filter and visualize only particular aspects, 
as desired. 
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APPLICATION: ANALYSIS OF A 2D GEL FROM E. COLI 


The molecular scanner approach was applied to the analysis of E. coli proteins. Three 1-mg 
samples of E. coli were separated on mini-2D gels. 


* The first gel was stained with Coomassie Brilliant Blue (Figure 8.472). 


* The proteins on the second gel were electrotransferred to a PVDF membrane without 
any exposure to trypsin. The PVDF membrane containing the bound proteins was 
stained with Amido Black (Figure 8.47b). 


* The proteins on the third gel were treated as in the protocol. The results are: 


-— 


N 


» 


= 


. No proteins were detectable after staining the membrane with Amido Black (Figure 


8.47c). 


. A9 x 13-mm? piece of the collecting PVDF membrane corresponding to the 5.1-5.2 


pl range and the 35,000—45,000-dalton M, range was cut out and pasted on the mod- 
ified sampling plate of a MALDI-TOF mass spectrometer. The membrane was 
scanned on a 48 x 32 mm grid with a sampling distance of 0.25 mm in both hori- 
zontal and vertical directions. 
Sixty-four laser shots were fired at a frequency of 3 Hz, leading to an acquisition time 
of ~9 hours. Almost 1540 spectra were acquired. The disc space needed to store all 
the spectra was 350 MB. 
All of the acquired PMF data (Figure 8.47e) were used for protein identification in 
the SwissProt protein database with the SmartIdent program, and seven distinct pro- 
teins were unambiguously identified including proteins from overlapping 2D elec- 
trophoresis spots: 

Isocitrate dehydrogenase (IDH_ECOLI, P08200) 

Isocitrate lyase (ACEA_ECOLI, P05313) 
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FIGURE 8.47. DPD result from a mini-2D electrophoresis of E. colt extract. (2) Coomassie Blue-stained mini-2D electrophoresis 
gel, (b) Amido Black-stained PVDF membrane after electrotransfet. The image corresponds to the dashed box in a. (c) Amido- 
Black-stained PVDF membrane after DPD transfer. The image corresponds to the dashed box in a. (d) 3D image obtained after 
MS scanning of the straight line delimited area of c (pI range: 5.1—5.2, m.w. range: 35—45 kD, real size: ~9 x 13 mm, 1536 MS spec- 
tra). The vertical axis corresponds to the SmartIdent identification score, Proteins identified in the E. coli scanning experiment are 
(1) isocitrate dehydrogenase (IDH, ECOLI, P08200); (2) isocitrate lyase (ACEA_ECOL], P05313); (3) aldehyde dehydrogenase A 
(ALDA, ECOLI, P25553); (4) 6-phosphogluconate dehydrogenase (6PGD | ECOLI, P00350); (5) $-adenosylmethionine synthetase 
(METK, ECOLI, P04384); (6) phosphoglycerate kinase (PGK, ECOLL P11665); (7) hypothetical protein (YBHE. ECOL, P52697}. 
Note that proteins ) and 6 appear as two different spots. (e) Peptide mass fingerprint spectra obtained during the scanning process 
for isocitrate dehydrogenase, 6-phosphogluconate dehydrogenase, and phosphoglycerate kinase. 


PROTOCOL 13 


Peptide Mass Fingerprinting Using the Chemical Printer 


MATERIALS 


P» Reagents 


8 


Tu MOST POPULAR METHOD FOR THE ANALYSIS OF PROTEINS separated by 2D-PAGE is "in-gel" 
digestion (Pandey and Mann 2000). This method typically involves the excision of the pro- 
tein spot from the gel and deposit into the well of a microtiter plate. A number of liquid-han- 
dling steps are performed before and after endoproteinase digestion. The most sensitive 
results are achieved if the sample is cleaned and concentrated with microtips packed with 
reversed-phase resin, such as the Millipore ZipTip (see also Protocol 2). The chemical print- 
er approach for peptide mass fingerprinting differs significantly from traditional "in-gel" 
digestion methods in that the endoproteinase chemistry is brought to the membrane-bound 
proteins, rather than excising the gel-entrapped protein spots and delivering them to a 
microtiter plate. Peptide mass fingerprinting is achieved by inserting the manipulated mem- 
brane into the MALDI-TOF mass spectrometer and desorbing the peptides directly from the 
membrane surface. 

Electroblotting gel-separated proteins onto membranes is an ideal method for generating 


- high-resolution protein arrays that are suitable for long-term storage. When coupled with the 


chemical printer technology, membrane-based protein arrays represent a robust medium for 
chip-based proteomics. Archiving protein arrays on membranes is particularly important as 
proteomic studies move into clinical studies where patient samples are unique and rarely avail- 
able for subsequent collections. Microdispensing reagents with piezoelectric devices onto pro- 
tein spots at precise coordinates ensures minimal cross-contamination of the samples, with the 
additional advantage that unused protein within the spot can be archived for future analyses. 

This protocol was provided by Andrew A. Gooley and his colleagues, Femia G. Hopwood, 
Janice L. Duff, Parag S. Ghandi, Cameron J. Hill, Wendy L. Holstein, Paul E. Smith, Andrew 
J. Sloane, and Keith L. Williams (Proteome Systems, Sydney, Australia), and Patrick W. Cooley 
and David B. Wallace (MicroFab Technologies Inc., Plano, Texas). 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


ACTH peptide 
Denaturing sample buffer 
‘The composition of the denaturing sample buffer will vary depending on the source of the proteins. 
Example buffers: l l 
For E. coli: 7 M urea, 2 M thiourea, 1% (v/v) ASB-14, 2 mM TBP (tributylphosphine), 40 mM Tris 
For human plasma proteins: 7 M urea, 2 M thiourea, 2% (w/v) CHAPS, 5 mM Tris-Cl 
Direct Blue-71 (Sigma) 
Direct Blue-71 is more sensitive than Coomassie Blue and has a very low background. A fluorescent 
stain is not suitable because the blot cannot be registered on the chemical printer. 
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p Equipment 


IEF-strip equilibration buffer 
6 M urea «I» 
2% SDS <!> 
375 mM Tris-Cl (pH 8.8) 
lodoacetamide <!> 
Matrix solution (u-cyano-4-hydroxy-cinnamic acid) <!> 
Before using, recrystallize the matrix solution and dissolve it to 10 mg/m! in an organic solvent mix- 


ture of 30% (v/v) acetonitrile «1», 20% (v/v) isopropanol <!>, and 20% (v/v) butanol<!>/0.5% 
(v/v) formic acid <!>. 


Octyl-glucopyranoside (OGP) (1% v/v) 
Porcine modified trypsin, sequencing grade (Promega) 
TBP (tributylphosphine} 


Centrifuge 

Chemical printer 

Conductive adhesive tape (3M, product 9073) 

Humidified container (sce Step 21) 

Immobilized pH gradient strips (7 cm, pl 4-7) (Amersham Biosciences) 

MALDI target 

Mass spectrometer (e.g., Axima-CFR MALDI-TOF, Kratos, Manchester) 

NuPAGE (4--12% minigels) (Invitrogen) 

Off-line scanner 

PVDF membrane (c.g. Immobilon P^, Millipore, and other protein-sequencing grade PVDF 
membranes) 


P Biological Sample 


Cells or proteins of interest 


P Additional Equipment and Reagents 


METHOD 


Steps 5 and 6 require the materials used in Chapter 4, Protocol 5 (Method A); Step 8, those 
in Chapter 4, Protocol 7; Step 9, those in Chapter 4, Protocol 8; Step 10, those in Chapter 4, 
Protocol 9; and Step 11, those in Chapter 4, Protocol 15 or 16. 


2D PAGE and Electroblotting of Samples 


1. Resuspend the cells or proteins in denaturing sample buffer to a concentration of ~3 
mg/ml. Incubate the sample for 2 hours at room temperature. 


2. If necessary, reduce the proteins with 3 mm TBP (final concentration from 20 mM stock 
solution) for 2 hours at room temperature. 


3. Alkylate the proteins by adding iodoacetamide (1 mm stock solution) to a final concen- 
tration of 15 mM, and incubating the sample in the dark for | hour at room temperature. 


4. Centrifuge the sample at 21,000g for 5 minutes at 4°C. 
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10. 


11. 


12. 
13. 


. Apply 115 pl of the supernatant to a 7-cm IPG (pH 4-7) strip, and rehydrate the strip for 


4 hours at room temperature. 


. Follow the steps in Chapter 4, Protocol 5 (Method A) to separate the proteins by IEF (the 


first dimension). Incorporate the following detail: Separate the proteins by electrophore- 
sis for a total of 36,000 volt-hours. 


. Following IEF, equilibrate the strip for 20 minutes in JEF-strip equilibration buffer. 


. Follow the steps in Chapter 4, Protocol 7 to prepare the second-dimension SDS-PAGE 


gels. 


. Follow the steps in Chapter 4, Protocol 8 to ready the proteins for the second-dimension 


separation by SDS-PAGE. 


Apply the IPG gel strip to the top of the 4-12% minigel SDS-PAGE gel, and separate the 
proteins in the second dimension according to Chapter 4, Protocol 9. 


Following electrophoresis, electroblot the gel onto an Immobilon-P* membrane at 400 
mA for 1.3 hours, according to Chapter 4, Protocol 15 or 16. 


Stain the membrane with Direct Blue-71 (see Hong et al. 2000). 


Dry the membrane under vacuum, and store it in a sealed plastic bag to prevent keratin 
contamination. 


Preparation of the Membrane for Chemical Printing 


14. 
15. 


16. 


Attach the dried membrane to a MALDI target with conductive adhesive tape (3M). 


Acquire an image of the electroblotted proteins adhered to the MALDI target using an 
off-line scanner. 


Detect the protein spots either automatically using a proprietary segmentation analysis 
algorithm or manually with a simple “point and click” on the scanned electroblot image. 


The commercial version of this instrument (Shimadzu Biotech) includes this software. 


. To achieve registration of the MALDI-membrane target on the chemical printer, select 


three registration points of the MALDI target (see Figure 8.48). View these points with an 
overhead digital camera and use the XY position of each registration point to calculate 
the exact positions of the proteins that have been selected for digestion. 


The overhead camera can also be used to view the protein spots after microdispensing solutions 
onto the membrane. 


Automated On-membrane Tryptic Digestion of Protein Spots 


18. Program the chemical printer to dispense 1.5 nl (15 drops) of 196 (v/v) OGP onto each 


protein spot for three iterations (total volume equals 4.5 nl). 


The use of the nonionic detergent OGP is necessary to wet the hydrophobic PVDF membrane 
surface, as well as to serve as a blocking reagent to minimize adsorption of the endoproteinase 


to the PVDF membrane. 
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FIGURE 8.48. An electrobiot of E. coli proteins 
separated on a 2D polyacrylamide gel and 
attached to a MALDI target with conductive 
tape. Proteins are identified with PMF directly 
from the intact membrane. The corners of the 
MALDI target, labeled 1, 2, and 3 are used to 
register the protein spots in the chemical print- 
er and in the Axima-CFR mass spectrometer. 
(Inset) Enlarged view of a section of the mem- 
brane where endoproteinase solutions have been 
dispensed onto the electroblotted proteins. Y 


19. 


20. 
21. 
22. 
23. 


" 
VE pf 


_ Conductive tape 


PVDF membrane 


Program the chemical printer to deliver 1 nl (10 drops) of 200 ng/l trypsin in 25 mM 
ammonium hydrogen carbonate on top of the detergent for 50 iterations (a final volume 
of 50 nl). This is equivalent to 10 ng of trypsin delivered to each protein spot. 


For experiments acquiring PMF data from two different endoproteinases dispensed onto the 
same protein spot, deliver 0.1% (w/v) palyvinylpyrrolidone/50% (v/v) methanol using one 
iteration of 60 drops. 


Place the printed membrane into a humidified container and incubate it for 3 hours at 37°C. 
Following digestion, return the membrane to the chemical printer. 
Add 50 fmoles/ul ACTH peptide to the matrix solution. 


Program the printer to dispense 2 nl (20 drops) of matrix solution onto the digested pro- 
teins spots with 50 iterations. 


Using the dispensing iterations described above, the digested protein area on the protein 
spots is ~0.2 mm? (400-500-j1m diameter). If the solutions were dispensed continuously, 
the digested area would be significantly larger and the resulting peptides diffused over the 
area. Figure 8.48 shows the PVDF electroblot of the E. coli proteins adhered to the MALDI 
target after the matix has been applied to the digested proteins. The expanded region clear- 
ly shows the positions where the protein has been digested. 


MALDI-TOF-MS 


24. Introduce the entire PVDF membrane (adhered to the MALDI target) into the mass 


spectrometer. À coordinate file corresponding to the jetting positions is converted into 
an Axima-CFR stage file (see Figure 8.49 a,b), with the Axima-CFR software Ascii2plate. 


25. Acquire all of the mass spectra in positive-ion mode. lonize the peptides with a 337-nm 


nitrogen laser and accelerate them with a 25-kV time-delayed extraction pulse. 


Despite the membrane having a surface that is not flat, quality spectra are obtained by acquir- 
ing all spectra from a single position (sce Figure 8.49c). 
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FIGURE 8.49. Automated peptide mass collection from proteins digested on-membrane. (a) Axima CFR control window; (b, 
inset) a MALDI target coordinate file (defining the positions of each of the piezoelectric dispensing locations} created using the 
chemical printer and converted into an Axima CFR stage file; (c) resultant PMF spectrum of albumin fragment (see Table 8.15) 
acquired directly from the membrane. 


26. To obtain mass spectra, average 50-100 individual laser shots on on-membrane-digested 
samples. Use an internal 2-point calibration for on-membrane-digested samples on one 
trypsin autolysis peptide (monoisotopic mass of 842.51) and ACTH peptide (monoiso- 
topic mass of 2465.19). 


27. Perform monoisotopic peak picking using a custom peak harvesting program (Breen et al. 
2000) and identify proteins using either a PMF search engine (which searches GenBank, 
www.ncbi.nlm.nih.gov/Genbank/index.html) or ExPASy Peptident www.expasy.ch. 
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Pepring Mass FINGERPRINT ANALYSIS OF E, COLI AND HUMAN PLASMA PROTEOMES 


MALDI-TOF mass spectrometric analysis of the proteins digested on an E. coli electroblot 
(Figure 8.50) resulted in peptide mass fingerprint (PMF) identification of 56 protein 
spots from a search of the SwissProt database (see Table 8.14). The mass range covered 
was evenly distributed from DNA-binding protein H-N$ (predicted mass 15,530 daltons) 
to HSP-70 protein (predicted mass 68,983 daltons}. The average coverage was 31.5% with 
a sample high of 65.4% (adenylate kinase, predicted mass 23,571) and a sample low of 
8.396 (Hsp70 protein, predicted mass of 689,383 daltons). Several protein spots appeared 
to be in low abundance based on the staining of Direct Blue-71: spot 46 (stringent star- 
vation protein, 53.3% amino acid coverage), spot 7 (prolyl-tRNA synthetase, 11.496 
amino acid coverage), and spot 23 (coding sequence of gene ipd, 21.1% amino acid cov- 
erage). 

As part of an ongoing study to understand the influence of glycosylation on the separa- 
tion properties of glycoproteins, proteins were identified from a 2D electroblot of a human 
plasma sample (Figure 8.51). Whereas E. coli proteins migrate essentially as predicted from 
their gene sequence (the correlation coefficient for the apparent mass of the proteins in 
Figure 8.50 was R? = 0.966), the same is not apparent for plasma-derived glycoproteins. 
The correlation coefficient for the apparent mass of the plasma proteins in Figure 8.51 is 
R? = 0.74. Whereas the pl is often difficult to measure, the observed correlation coefficient 
for the E. coli proteins was quite good at R? = 0.8. For the plasma proteins, there is essen- 
tially no correlation with an R? = 0.38. 

Despite the poor predictability of glycoprotein migration, 32 protein spots were identi- 
fed with PME MALDI-TOF-MS and the results are shown in Table 8.15. The mass range 
covered was distributed from transthyretin (predicted mass 13,761 daltons) to serotrans- 
ferrin (predicted mass 75,181 daltons). The average coverage was 37%, with a sample high 
of 66.3% (apolipoprotein A-I, predicted mass 28,078 daltons) and a sample low of 15.2% 
(0-2 HS glycoprotein, predicted mass of 30,221 daltons). 

Overall, the sequence coverage for proteins identified with on-membrane digestion is 
typically slightly lower than that of the standard " in-gel” digestion procedures practiced in 
our laboratories. This is not unexpected for two reasons. First, the area digested for “in-gel” 
digestion is typically three times larger than the “on-membrane” digestion: ~0.6 mm? com- 
pared to ~0.2 mm’, Second, the in-gel digests are normally purified and concentrated with 
Cie ZipTips. Both of these factors would contribute to a larger coverage obtained for in- 
gel-digested samples, However, the advantage of the on-membrane approach is that most 
of the protein remains available for subsequent experiments, such as a second endopro- 
teinsase digestion (see panel on MULTIPLE ENDOPROTEINASE REACTIONS WITH THE CHEMICAL 
PRINTER following Table 8.15) or characterization of the sites of N-linked glycosylation 
using sequential PNGase F and trypsin digestion (Sloane et al. 2002). 
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FIGURE 8.50. A 2D polyacrylamide gel of E. coli proteins electroblotted onto PVDF (Immobilon-P*). Labeled 
protein spots were digested on-membrane using the chemical printer, and peptide masses were collected using 
MALDI-TOF-MS. 
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FIGURE 8.51. Protein spots identified on a 2D polyacrylamide gel of human plasma proteins electroblot- 
ted onto Immobilon-P*. Labeled protein spots were digested on-membrane using the chemical printer and 
peptide masses were collected using MALDI-TOF-MS. Protein spots 1 and 7 were identified as human serum 
albumin that had not completely focused due to the high concentration of albumin in plasma. 
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Muctteie ENDOPROTEINASE REACTIONS WITH THE CHEMICAL PRINTER 


Control of the ink-jet devices allows exquisite precision of the volumes dispensed. Hence, 
multiple endoproteinase digestions can be performed on many of the protein spots. 
Proteins from a human plasma sample were separated on an 1 }-cm IPG with a pl range of 
5-6.5. Under these conditions, apolipaprotein A-IV is clearly resolved from the D-chain of 
haptoglobin (Figure 8.52) and was analyzed on the chemical printer with both endopro- 
teinase Glu-C and trypsin. The two printed positions are shown in Figure 8.52. 

Despite the successful dispensing of the acetonitrile-based matrix solution, this reagent 
diffuses into the membrane easily due to its nonpolar properties. Combined with a small 
error in the registration between the chemical printer point file and the Axima-CFR 
MALDI stage file, as well as the added variable of the size of the MALDI laser beam (~ 1 10- 
um diameter), we discovered that better quality results were obtained by aspirating the 
multiple digests from the membrane surface onto a micro-C,, ZipTip (Millipore) rather 
than desorbing them directly in the Axima-CFR MALDI-TOF-MS. Peptides were desorbed 
from the micro-C,, ZipTip directly onto an Axima CFR MALDI target, and the resulting 
PMF data obtained a sequence coverage of 29.896 for trypsin and 21.596 for endopro- 
teinase Glu-C, à combined amino acid coverage of 41.296 (see Figure 8.52). This increase, 
as well as the matching being derived from two endoproteinases, increases the confidence 
of protein identification. 
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FIGURE 8.52. Apolipoprotein A-IV was digested with the endoproteinases trypsin and Endo Ghi-C on the 
marked positions of the protein spot. Peptide mass fingerprinting analysis generated a combined amino 
acid coverage of 41.296. The gray-shaded peptides wete identified with trypsin, and the blue-shaded pep- 
tides were identified with Endo Glu-C. 
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“TOP-DOWN” PROTEIN SEQUENCE ANALYSIS USING MS/MS* 


Gavin E. Reid and Scot A. McLuckey 
Department of Chemistry, Purdue University, Indiana 


Recently, approaches toward protein identification and characterization have been developed 
that enable primary sequence information to be obtained directly from the gas-phase disso- 
ciation of intact protein ions without prior recourse to extensive separation or proteolytic 
digestion steps [1]. In this scenario, intact multiply charged protein ions derived from ES] are 
isolated and then fragmented in the mass spectrometer; if necessary, the resulting product 
ions are subjected to further fragmentation, until sufficient product ions are generated to 
allow for the identification of the protein. Protein identification can be carried out by deriva- 
tion of a "sequence tag” [2—4] via database searching of the uninterpreted product ion spec- 
trum [5,6] or through "de novo" assignment of the complete amino acid sequence [7,8]. This 
approach has been termed “top-down” protein characterization (Figure 8.53) |] |. The major 


PURE PROTEIN OR PROTEIN MIXTURE 


i 


"Bottom-up" ‘Top-down" 


ESI 


MALDI/ESI 


MS/MS of peptides 


Mass analysis of 
protein fragment ions 


Mass analysis of 
peptide fragment ions 


Database interrogation or "de novo" analysis 
Protein identification/characterization 


FIGURE 8.53. Schematic overview comparing “bottom-up” and “top-down” protein 


characterization approaches. 


*The numbers in brackets indicate reference citations. 
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advantage of the top-down approach is that performing an MS/MS experiment on intact 
protein ions makes the entire sequence available for examination, better enabling complete 
characterization of the protein and any associated posttranslational modifications [5,9-12]. 
Additionally, the large number of redundant identifications that are usually obtained with 
conventional "bottom-up" peptide sequencing strategies can be avoided. Furthermore, the 
masses of intact proteins are spread over a wider range than those of protcatvtically derived 
peptides, potentially simplifying the task of analyzing proteins in complex mistures. 

In the first whole-protein ion-dissociation studies, triple-quadrupole instrumentation 
was used to collect MS/MS data from multiply charged protein ions derived from ESI 
! 13-19]. Product ion spectra derived from these ions are typically composed of ions with 
charges ranging from unity up to the charge of the parent ion, However, the limited resolv- 
ing power associated with mass measurements in triple-quadrupole instruments was insul- 
ficient to resolve the isotope spacings in product ion signals, thereby precluding charge state 
determination of the individual product ions. The fact that product ion charge is not known 
a priori can complicate interpretation of the spectrum. Therefore, the application of MS/M5 
to whole-protein ions using triple-quadrupole MS has been largely restricted to protein fin- 
gerprinting measurements (i.e., the comparison of uninterpreted product ion spectra). 


Fourier Transform-lon Cyclotron Resonance Mass Spectrometry 


The issue of product ion charge state determination can be overcome by the use of high-mag- 
netic-field-strength. Fourier transform-ion cyclotron resonance (FI-ICR). MS |1.2,4,5,7,8, 
10—12,20—50]. This form of instrumentation enjoys sufficient resolving power (typically »107) 
to measure the isotope spacings in high-mass product ions and provides high-mass accuracy 
(«10 ppm? for facile product ion identification. This technology therefore allows tor the inter- 
pretation of product ion spectra such that reliable sequence information can be obtained from 
the fragmentation of highly multiply charged protein ions. A variety of activation techniques 
for protein ions have been examined in conjunction with FT-ICR, including collision-activat- 
ed dissociation (CAD) [20-31], photodissociation [32-36], surface-induced dissociation 
(SID) [37], and blackbody infrared radiative dissociation (BIRD) 138]. 

An alternate dissociation technique for implementation on FT-ICR instruments, termed 
electron-capture dissociation (ECD), was introduced by MeLafferty et al. [39-50]. ECD 
involves capture of low-energy («1 eV) electrons by multiply charged [M +nH |^' protein ions 
to yield odd electron-reduced [M+nH]'"""" products which rapidly dissociate by cleavage 
between the N-aC linkage of the peptide backbone. ECD has been shown to result in more 
random fragmentation along the peptide backbone for small proteins compared to CAD 
[44,45]. Additionally, the loss of labile posttransfational modifications, such as carboxylation, 
glycosylation, phosphorylation, and sulfation, is less evident in ECD-derived product ton 
spectra, thereby better enabling their characterization [42,43,48]. Recently, an improvement 
of the ECD method has been described [47], whereby ions are collided with a background 
gas while subjecting them to electron capture. This activated ion ECD {AI-ECD) method has 
the effect of increasing the number of observed fragment ions, presumably by breaking 
intramolecular noncovalent bonds of the secondary and tertiary structures of the ions that 
otherwise would prevent separation of the fragmentation products arising from ECD cleav- 
age of the protein backbone. For example, Figure8.54 shows the AI-ECD spectrum obtained 
from the 1M420H]** to [M+40H]** ions of the 29-kD protein carbonic anhydrase. This 
activation method produces 116 inter-residue cleavages of the possible 258 inter-residue 
amino acid bonds, whereas CAD of this protein produced only 66 of 258 cleavages {1], and 
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FIGURE 8.54. Activated ion-electron capture dissociation (AT-ECD spectrum of rons from ESL of carbonk anhydrase B subject 
ed to in-beam collimons during exposure to low energy electrons. (Reproduced, with permission, from 47} Horn et al. 200a 
iGAmerican Chemical Society |] 


ECD of the isolated {M + 34H]"* ion produced no cleavages [46]. To date, extensive sequence 
information on proteins as large as 45 kD has been obtained using the AI-ECD technique 
150]. 

Two recent papers have demonstrated the utility of FT-ICR MS for the identification of 
unknown proteins from simple mixtures. Kelleher and co-authors have identified a number 
of archaeal or bacterial proteins ranging from 7 kD to 36 KD. po «ut in a mixture of modest 
complexity, Interrogation of the uninterpreted product ion spectra, using only three or four 
nonadjacent fragment ions, a mass accuracy of 40.1 dalton, and no intact mass bias, resulted 
in correct protein identification from a database containing 5000 proteins [5]. Fenselau et al. 
have also used intact protein ion dissociation in an FT-ICR, coupled with database searches 
using a sequence tag, to identify the major ~7-kD biomarker derived from an extract of 
Bacillus cereus Y spores, as well as its methionine-oxidized derivative (4. 


lon/ton Reactions and Quadrupole lon-trap Mass Spectrometry 


An alternative approach to resolving product ion charge state ambiguity relics on the use of 
gas-phase ion chemistry, rather than high resolving power, a» a tool for measuring product 
ion masses. Such an approach enables the use of relatively low-resolving-power mass analyz- 
ers for performing whole-protein MS/MS measurements. The most fully developed example 
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FIGURE 8.55. Quadrupole ion-trap CAD MS/MS spectra of the {M+I1H]?! tons of apomyoglobin. (a) 
MS/MS product ion spectra obtained using on-resonance collisional activation, (b Post-ion/ion reaction 
MS/MS product ion spectra using the same collisionai activation conditions employed in a, and obtained 
after jonfion reactions with PDCH anions. (Adapted, with permission, from [69] Newton et al. 2002.) 


in which chemical reactions are used to overcome product ion charge state ambiguities 
involves the use of ion/ion proton transfer reactions to reduce the multiply charged product 
ions largely to the +1 charge state [51-72]. Conversion of product ions to the +1 charge state 
after parent ion dissociation greatly relaxes the mass resolution requirements for product ion 
identification, thereby allowing interpretable MS/MS spectra of proteins of at least 20 kD to 
be obtained in instruments with modest resolving power (e.g., M/AM = 1000-2000) 163-72]. 
To date, this approach has been used exclusively in quadrupole ion-trap instrumentation. 
Figure 8.55a provides an illustrative case whereby the multiply charged ion of a protein of 
modest size, i.e. the |M--11H]!'* ion of the 17-kD protein apomyoglobin, can be readily dis- 
sociated in the ion trap, but the spectrum cannot be interpreted a priori due to charge state 
ambiguity associated with the limited mass resolving power of the ion-trap mass analyzer. 
Note that it is only with the benefit of the post-ion/ion MS/MS data (discussed below) that 
the identity of many of the multiply charged product ions may be assigaed. Additionally, a 
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number of the potential product ions labeled in the spectrum cannot be unambiguously 
assigned. For example, a resolving power of greater than 16,000 would be required to unam- 
biguously assign the yár (m/z 1635.39) and b," (m/z 1635.49) ions, whose multiply charged 
pre-ion/ion product ion masses differ by only 0.1 dalton. This problem is readily overcome 
by the use of ion/ion proton transfer reactions to reduce the multiply charged products large- 
ly to +1 charge states. The post-ion/ion proton transfer reaction product ion spectrum of the 
[M+11H]'' ion of apomyoglobin, acquired after subjecting the multiply charged product 
ions shown in Figure 8.55a to ion/ion proton transfer reactions with singly charged anions 
derived from atmospheric sampling glow discharge ionization (ASGDI) of perfluoro-1,3- 
dimethylcyclohexane (PDCH)}, is shown in Figure 8.55b. These product ions may be readily 
assigned as either b- or y-type product ions by comparison with the expected product ion 
masses of apomyoglobin [69]. 

To date, the fragmentation of a wide range of proteins has been studied, including ribonu- 
clease A [9], mellitin [59], lysozyme |63], hemoglobin B-chain [64], insulin 166], ubiquitin 
[59,68], holo- and apomyoglobin [69,70], ferri-, ferro-, and apocytochrome c |67,71], and bag- 
teriophage MS2 coat protein [65,72]. In several of these cases, collisional activation of individ- 
ual precursor ion charge states in the quadrupole ion trap has resulted in dissociation ot more 
than 50% of the amide bonds along the protein backbone [69], with more than 80% coverage 
obtained from several charge states [71]. The degree of sequence coverage obtained for a pro- 
tein of interest can be further extended by the use of multistage MS/MS ( MS") of selected first- 
generation product ions [68]. It has also been demonstrated that dissociation of whole protein 
ions under ion-trap collisional activation conditions can be used to characterize posttransla- 
tional modifications, such as the N-linked glycosylation site of ribonuclease B [9'. 

Importantly, ion/ion reactions may also be used to manipulate the charge states of mul- 
tiplv charged precursor ion populations, to form a range of precursor ion charge stales lower 
than those produced by ES] alone [9,68,69,71,72], as well as to facilitate the gas-phase puriti- 
cation of individual components of complex protein mixtures. In the latter case, a "double- 
isolation" experiment is performed, whereby an initial m/z region containing a charge state of 
the protein of interest is first isolated and then subjected to a short ion-ion reaction period. 
Following the ion/ion reaction, isolation of a second m/z region corresponding to a lower- 
charge state of the protein of interest can be used to "charge state" resolve the protein from 
all other proteins of different mass initially present in the first isolated iz region :6,65,7? ]. 
One limitation of this "double-isolation" technique is that ions from a given protein charge 
state are distributed over several lower-charge states during the ion/ion reaction, thereby 
diluting the protein ion signal and decreasing the sensitivity for subsequent dissociation. 
Recently, however, it has been demonstrated that the rates of ion/ion reactions in à quadru- 
pole ion trap may be selectively inhibited in a mass-to-charge selective fashion during the 
ion/ion reaction period, allowing essentially all of the ion current of a protein of interest to 
be concentrated into a single-charge state. This technique is referred to as “ion parking" [73]. 
When combined with the double-isolation technique, the ion-parking approach can be used 
to facilitate the gas-phase concentration and purification of selected precursor ions from 
complex protein mixtures for subsequent dissociation. In an example of this approach, mul- 
tiply charged precursor ions of five of the abundant protein components present in a rela- 
tively complex mixture derived from a whole-cell lysate HPLC fraction of E. coli containing 
~30 components, ranging in mass trom 7 kD to 10 kD, were subjected to this concentra- 
tion/purification procedure and then dissociated by collisional activation. Interrogation of 
the uninterpreted product ion spectra against a partially annotated protein sequence data- 
base, coupled with a scoring scheme based on the relative abundances of the experimentally 
observed product ions and the frequency of fragmentations occurring at preferential cleavage 
sites, resulted in the unambiguous identification of four out of five of these proteins [6]. 
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Future Developments in Top-Down Protein Characterization 


Understanding the fragmentation behavior of whole-protein ions 


Protein ion dissociation behavior is currently less well understood than peptide ion dissoci- 
ation in part because the body of observations associated with protein jon dissociations is 
still relatively small. Therefore, if intact protein sequencing approaches are to find more gen- 
eral use, the factors governing the fragmentation of multiply charged protein ions, such as 
the influence of precursor ion charge state on fragmentation, the role of primary, secondary, 
and tertiary structures, as well as the presence of posttranslational modifications, must be 
explored. A number of studies on the factors affecting the dissociation of multiply charged 
protein ions have been performed in a quadrupole ion-trap mass spectrometer. [t has been 
shown that the fragmentation of whole-protein ions under ion-trap collisional activation 
conditions is strongly influenced by the precursor ion charge state [63-72]. Generally, a lim- 
ited number of intermediate charge states give rise to nonspecific cleavages of the protein 
backbone, thereby allowing derivation of sequence tags for subsequent database searching. At 
other charge states, the facile loss of NH, or H,O (very low-charge states! and preferential 
cleavage at the carboxyl terminus of aspartic acid and lysine residues (low-charge states) and 
at the amino terminus of proline residues (high-charge states) are often the dominant frag- 
mentation products observed. Although many of these cleavages are also observed in the 
fragmentation of peptide ions, cleavage at the carboxyl terminus of lysine residues, particu- 
larly when present at the amino terminus of an adjacent histidine residue or in a sequence of 
residues containing a number of adjacent basic sites, has not been noted previously [69,71]. 
Thus, further examination of the fragmentation of whole proteins is warranted in order to 
identify the general charge state dependence of whole-protein ion dissociations and the 
mechanisms behind these fragmentation reactions, and to maximize the amount of struc- 
tural information that can be obtained from selected charge states. 


Novel instrumentation 


The application of MS/MS for peptide-sequencing applications, coupled with the develop- 
ment of modern biomolecular ionization methods, has been the major driving force in com- 
mercial instrument development throughout much of the past decade. Although new ion- 
ization methods have enabled the study of large ionic species in the gas phase, with the 
increasing recognition of the value of “top-down” MS/MS approaches for protein identifica 
tion/characterization, it is likely that future instrument developments will be directed toward 
improving the instrument capabilities of MS/MS specifically for high-mass ions. Mass ana- 
lyzer characteristics that are most important for high-mass ions include mass resolution, 
mass accuracy, mass range, sensitivity, dynamic range, and speed (i.e., mass spectral acquisi- 
tion rate) 61]. To date, most protein ion dissociation studies have been conducted using 
high-field FT-ICR, triple-quadrupole tandem mass spectrometers, and specialized quadru- 
pole ion-trap instrumentation. The use of high-magnetic-field-strength FT-ICR for tandem 
mass spectrometry of high- mass ions will continue to grow due to an increasing number of 
practitioners making use of the remarkable flexibility and mass analysis capabilities of this 
tool. However, it seems likely that new instruments of lower cost and, perhaps, higher speed 
than ET-ICR will be developed for MS/MS of high-mass ions. For example, state-of-the-art 
commercial tandem mass spectrometers employing high-performance time-ol-flight mass 
analyzers are beginning to be applied to whole-protein ion analysis. The significantly superi- 
or resolving power and mass accuracy afforded to product ion measurement in modern 
quadrupole time-of-flight instruments relative to triple-quadrupole instruments allow for 
improved performance in “top-down” applications using beam-type tandem mass spec- 
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trometers [74]. Electrodynamic ion-trapping tools, in both two- and three-dimensional ver 
sions, are likely to be used prior to the time-of-flight instrument to facilitate precursor ion 
accumulation, purification, and dissociation. 


Novel chemistries 


Realization of the potential of mass spectrometry for whole-protein identification and char- 
acterization will require further advances in the understanding of the behavior of gas-phase 
macro-ions derived from proteins. Given the high dimensionality presented by gaseous pro- 
tein ions (Le, primary, secondary, tertiary, and quaternary structures, charge state, posl- 
translational modifications, etc.), it is expected that a rich array of ton chemistries might be 
utilized. By far, the most widely studied aspect of protein ion chemistry has been the uni- 
molecular dissociation behavior of the ions. Although a variety of ion activation methods 
intended to induce dissociation have been applied to protein ions, only a few have been char- 
acterized with a number of different proteins. Therefore, the relative merits of the vast array 
of activation methods thus far developed for simple polyatomic ions have not vet been clear- 
ly delineated for macro-ions. 

It has already been demonstrated that ion/electron and ion/lon reactions can have impor- 
tant roles in “top-down” protein identification approaches. ‘lo date, ion/ion reactions have 
been used largely as a means for manipulation of ion charge. However, it is likely that novel 
ion/ion chemistries will be developed that can provide structural information. One such 
example might be the selective covalent modification of a protein ion in the gas phase followed 
by ion activation/dissociation. Such reactions might prove to be an attractive alternative to the 
modification of protein ions in solution for subsequent tandem mass spectrometry. 

The use of ion/molecule reactions also has a potential role in protein structural analysis. 
To date, ion/molecule reactions involving protein ions have largely been restricted to proton 
transfer reactions [75-77] and hydrogen/deuterium exchange | 78-82], However, other tvpes 
of ion/molecule reactions may also prove to be useful in the study of high-mass ions. For 
example, it has been shown that the sum of the protein ion charge and the maximum num- 
ber of molecules of hydroiodic acid (HI) that attach to an ion in the gas phase correlates with 
the number of basic sites in the protein, therefore providing information about the compe- 
sition of a protein (83-87]. Other ion/molecule reactions of potential use to protein sequence 
analysis involve gas-phase derivatization 188-90]. Although these types ot reactions have thus 
far been applied only to peptides, this class of ion/ molecule reaction could be of use in direct- 
ing the dissociation of whole protein ions to desired fragmentation channels, thereby facili- 
tating sequeuce determination. Given the relatively limited attention thus far paid to the 
ion/ion and ion/molecule chemistries of protein ions, it seems likely that other types ot reac- 
tions with relatively high specificity might also be discovered that could have useful roles in 
“top-down” sequencing strategies. 
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FRAGMENTATION MECHANISMS OF PROTONATED PEPTIDES IN THE GAS PHASE* 


Richard A.J. O° Hair 
School of Chemistry, University of Melbourne, Australia 


a, lons 


The successful sequencing of peptides using MS/MS hinges upon the formation of a complete 
set of sequence ions via random cleavage of each of the peptide bonds. This is typically 
achieved via collision-induced dissociation (CID) of protonated peptides, which yields the 
complementary b and y sequence ion series (see the section on CID in the chapter introduc- 
tion). The universally accepted nomenclature for the sequence ions formed in these experi- 
ments is shown in Figure 8.56 [1,2]. In an ideal world, each of the sequence ions would be 
formed in equal abundance, allowing the peptide sequences to be "read out" from the MS/MS 
spectrum. In reality, this is not the case, because some sequence ions are not observed and 
some side reactions or rearrangements complicate the mass spectrum, which in the worst case 
prevents the investigator from being able to assign the sequence of a peptide. To understand 
the causes of these complications, it is necessary to examine the mechanisms of protonated 
peptide fragmentation and how they control the types of fragment ions observed in an 
MS/MS experiment. One of the most important considerations is the location of the site of 
protonation (governed by the local proton affinities of the various sites within a peptide (3]} 
and how this relates to the fragmentation reactions observed. To illustrate key concepts for 
understanding fragmentation processes of protonated peptides, the structures and mecha- 
nisms associated with the formation of five key types of ions are briefly discussed: (1) a, ions, 
(2) b, ions, (3) b, ions (n 2 2), (4) y, ions, and (5) nonsequence ions. For more detailed dis- 
cussions of fragmentation mechanisms of simple protonated molecules and peptides, the 
reader is directed to several recent reviews [3-7]. 
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FIGURE 8.56. Sequence ion nomenclature for a protonated tetrapeptide. 


A common fragment ion that identifies the amino acid residue at the amino terminus is the 
a, ion (see Equation 1). Several experimental and theoretical studies have been carried out on 
simple systems ranging from glycine to simple dipeptides and tripeptides to determine the 
mechanisms of these reactions, and key findings are discussed in this section. 
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Glycine (X 2 OH) 


Three possible mechanisms for the formation of the a, ion of glvcine are shown in Scheme 1, 
and these can be distinguished by their neutral losses [8]. Each of these losses occurs via dif- 
ferent sites of protonation within glycine, which are governed by their relative "local proton 
affinities.” Mechanism A involves the 1,2 elimination of formic acid from N-protonated 
glycine, 1, which is the thermodynamically favored site of protonation. In contrast, the other 
two mechanisms involve intramolecular proton transfer to form CO-protonated glycine, 2 (the 
second most favored site of protonation), or HO-protonated glycine, 3 (the least favored site of 
protonation), which then fragment via dihydroxycarbene loss (Mechanism B) or the combined 
losses of H,O and CO (Mechanism C). Using CID ionization (CIDT}, Wesdemiotis [8] discov- 
ered neutral products due to H,O and CO loss, consistent with Mechanism ©, but could find 
no neutral products which would support the other mechanisms (i.e., Mechanism A or BJ. 


Mechanism A 
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Support for Mechanism C comes from experimental [9] and theoretical 110] measure- 
ments of the activation energy required to induce formation of the a, ion. Using collision- 
induced threshold energy measurements, Klassen and Kebarle have shown that 43.9 kcal 
mol"! is required to induce this reaction [9], which nicely agrees with a recent theoretical esti- 
mate from ab initio calculations of 39.1 kcal mol [10]. in contrast, ab initio calculations pre- 
dict that Mechanism A has an activation barrier of >90 kcal mol“! and Mechanism B also has 
a higher activation barrier (67.0 kcal mol!) [10]. 

Even a simple system such as protonated glycine thus demonstrates that several experi- 
mental techniques further bolstered by theoretical calculations are required to prove mech- 
anistic details of fragmentation reactions. Furthermore, it demonstrates that the initial site 
of protonation may not be that at which fragmentation occurs. Instead, "mobilization" of the 
proton may be required to induce fragmentation at another site (i.e., Mechanism C) i6]. 
Further support for proton mobility under CID conditions comes from experiments on 
{M+D]" ions from simple amino acids such as alanine and leucine, which lose the combined 
elements of (H,O + CO) and (HOD + CO) in a statistical ratio. This finding indicates that 
proton transfer from 1 to 3 is reversible [11]. 
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Glycinamide (X = NH.) 


Using a combination of experiment and theory, Kinser et al. | 12] have shown that a similar 
mechanism (compare Mechanism C in Scheme 1} operates for the formation of the a, ion 
from glycinamide. Reaction of neutral glycinamide with deuteron (D) donors again demon- 
strates the mobilization of the proton (both CH,—NH," and CH,—NHD' are observed). By 
changing the energy of protonation, these authors were able to demonstrate that 235.8 kcal 
mol ' is required to induce fragmentation, which is consistent with their own ab initio pre- 
diction of 41.1 kcal mol"! as well as the activation energy of 44.2 kcal mot! measured by 
Klassen and Kebarle [9] using collision-induced threshold energy measurements. 
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Glycylglycine (X = NHCH;CO;H) and other dipeptides 


Simple di- and tripeptides can also fragment to form a, ions directly, which are often in com- 
petition with y, ions [9,12-15]. Recently, Paizs and Suhai have suggested the mechanism 
shown in Scheme 2, which accounts for the combined formation of a, and y, ians in proto- 
nated glycylglycine using ab initio calculations [12]. Thus, intramolecular proton transfer 
from the themodynamically favored amino-terminal protonated species 4 to the N-amide 
protonated form 5 precedes expulsion of CO. Once CO departs, an ion-molecule complex 
between the a, ion and the neutral amino acid is formed (6 in Scheme 2). This ion- molecule 
complex, 6, can then fragment to give either the a, ion or the y, ion. 

Their model can also be used to rationalize some interesting experimental results [13,14]. 
For example, Isa et al. have examined the competition between a, and y, ion formation in the 
CID spectra of a series of protonated dipeptides: Xxx-Gly, Gly-Xxx, Xxx-Leu, Leu-Xxx 
(where Xxx represents various amino acid residues) (13]. They found that for Gly-Xxx, both 
the y, and a, ions were always observed, whereas the y, ion was rarely seen for Xxx-Gly, where 
the a, ion usually dominated. When Leu was substituted for Gly, the abundances of the a, ion 
increased for Leu-Xxx, as did the abundances of the y, ion for Xxx-Leu, suggesting that the 
proton affinity differences between the conjugate bases within the ion-molecule complex 6 in 
Scheme 2 has a role in determining their observed relative abundances. A study of the proto- 
nated dipeptides Val-Xxx and Xxx-Phe found similar influences of the proton affinity differ- 
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ences between the amino- and carboxy-terminal amino acids on the relative abundances of 


y, and a, ions |14]. Once again, this is consistent with a competition for the proton within 
the ion-molecule complex 6 in Scheme 2. 


b, tons 


Why, when ion 5 fragments, are two bonds broken (at a and b in Scheme 3), forming an a, 
ion, as opposed to only one bond (at a) being broken to form a b, ion with an acylium struc- 
ture as shown? The answer comes from several experimental and theoretical studies, which 
have shown that for simple aliphatic amino acids, the acylium ion structures of b, ions are 
unstable with respect to CO loss [16]. 
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Competition between b, lons (n 2 2) and y, lon Formation: Peptide Bonds as 
Neighboring Groups 


Given that the simple acylium ion structures of b, ions are unstable, the commonly encoun- 
tered b, ions (r 2 2) are unlikely to adopt a simple acylium ion structure. One of the first 
studies to examine the structure of b, ions experimentally and theoretically is that of 
Harrison et al. (17]. These authors found that a five-membered ring oxazolone structure is 
favored. The role that this structure has in the fragmentation reactions of protonated GGG 
has recently been examined via ab initio calculations [18]. Key features of this study are 
shown in Scheme 4. Once again, intramolecular proton transfer is required to mobilize the 
proton from the first amide bond of species 7 (it is interesting to note that this O-protonat- 
ed form is now favored over protonation at the amino terminus, indicating that internal 
hydrogen bonding can influence the relative local proton affinities of various groups in sim- 
ple peptides) to the second amide bond to yield the N-protonated form, 8. The first amide 
bond can then act as an internal nucleophile to induce cleavage of the amide bond to give an 
ion-molecule complex between the oxazolone b; ion and the neutral amino acid, 9. This 
intramolecular process is called a neighboring group process. Once again, this ion-molecule 
complex 9 can fragment to form either a b, ion (path a of Scheme 4) or a y, ion (path b of 
Scheme 4). 

In Schemes 2 and 4, we have seen that y, ions can be formed in competition with other 
a, and b, sequence ions. These y, ions are amino acids. Several studies have shown that high- 
er analogs (y, ions, where n 2 2) correspond to truncated peptides 119]. Thus, the mechanism 
shown in Scheme 4 can be regarded as a general mechanism for the competition between y 
and b ion formation in protonated peptides. More importantly, these neighboring group 
processes have direct solution phase analogies for cleavage of peptide bonds under acidic 
conditions [20]. 
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Other Types of Processes Leading to b, and y, lon Formation 


The section above has shown how the adjacent peptide bond can facilitate peptide bond 
cleavage via the formation of an oxazolone structure. The presence of certain residues with- 
in peptides can enhance the effect of cleavage of adjacent peptide bonds. These types of reac- 
tions can be further classified as foilows: 


» b, ion formation: Stabilized b, ions are observed for peptides that contain amino-terminal 
lysine, histidine, and methionine peptides [21]. These residues facilitate cleavage of the 
adjacent peptide via a neighboring group pathway involving their nucleophilic side chains 
resulting in ring structures. Thus, these represent channels competing with those discussed 
for a, ion formation. 


© The proline effect: Enhanced amide bond cleavage occurs at the amino-terminal side of a 
proline residue. This is known as the “proline effect” [22]. 


The aspartic acid effect: The “aspartic acid effect” occurs when the amide bond carboxy- 
terminal to an aspartic acid residue undergoes facile cleavage [23]. A possible mechanism is 
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shown in Scheme 5 and involves proton transfer from the adjacent acidic side chain to the 
amide bond in 10 to form a salt bridge structure 11, which then undergoes cleavage to yield 
the products shown. Indirect evidence for the salt bridge structure comes from experiments 
in which the aspartic acid residues are converted to esters. This leads to a substantial 
decrease in cleavage at the esterified aspartic acid residue [23]. Fnhanced cleavage has also 
been reported recently for the amide bond between Gln-Gly residues within peptides [24]. 
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Nonsequence ions 


Neighboring group processes similar to those described above for sequence ion formation 
can also operate for the formation of nonsequence ions. For example, (1) a side-chain nucle- 
ophile can attack a protonated backbone site of a peptide to liberate either H;O or ammonia 
or (2) the peptide backbone can attack a protonated side chain to induce fragmentation of 
the side chain. The latter class of reaction can prove to be useful in identifying a posttransla- 
tionally modified amino acid residue. For example, oxidized methionine loses CH,SOH, 
whereas phosphoserine loses H,PO,. Two recent publications discuss neighboring group 
mechanisms associated with these losses [25,26]. 
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Summary of Important Concepts in Peptide Fragmentation Reactions 


The preceding sections have illustrated how various tools can be used to reveal the key con- 
cepts associated with the fragmentation mechanisms of protonated peptides, One of the most 
useful tools—theroretical calculations—not only provides energetics, but also allows the 
visualization of reactants, intermediates, products, and the transition states connecting them. 
From these studies, the following key concepts emerge: 


* Under low-energy collision conditions, most fragmentations are directed bv the site of the 
charge. 


e If cleavage occurs at multiple sites throughout the peptide, the charge must be allowed to 
migrate from the initial site of protonation to the site of cleavage. This has been termed the 
mobile proton model and is one of the central tenants in peptide fragmentation mecha- 
nisms [7]. The energy required to mobilize the proton from the thermodynamically favored 
site of protonation to less favored sites of protonation is provided by the collision energy 
imparted to the ion. 


* Once appropriate protonated precursors have been formed, cleavage of the amide bond can 
be facilitated by a neighboring group pathway in which an adjacent nucleophile (amide 
bond or nucleophilic side chain) facilitates cleavage. 


» Initial cleavage yields an ion-neutral complex and proton transfer within this complex can 
account for the competition between formation of two types of ions (e.g. b and y ions). 


« Arginine has the highest proton affinity of all the amino acids, and thus the proton of a 
singly protonated peptide containing an arginine is sequestered to the arginine side chain. 
As a consequence, the (M+H]* ion of an arginine-containing peptide often fragments 
poorly (i.e., the proton is not readilv mobilized). 


+ Other side chains can introduce new fragmentation pathways that differ from those shown 
in Schemes 1 and 2, facilitating peptide bond cleavage or yielding nonsequence 1ons due to 
the loss of small molecules (such as NH, or H.O) i4]. 


Future Developments 


With recent advances in mass spectrometry instrumentation, it is now becoming a realistic 
proposition to sequence small proteins using a top-down approach. Some of the key concepts 
introduced here are likely to have an important role in fragmentation pathways of protein 
ions (see the information panel on “TOP-DOWN” PROFIN SEQUENCE. ANALYSIS USING MS/MS}, 
Other approaches for gaining structural information on peptides use CID on [M-H] ions 
[27] or [M+Cat]* (where Cat means a metal ion). The recently introduced electron capture 
dissociation (ECD) method opens up new fragmentation pathways invotving radical chem- 
istry [28]. 
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WWW RESOURCES 


For additional Web resources, see Tables 8.5 and 8.6 in this chapter. 


http://www.asms.org American Society for Mass Spectrometry. What is mass spectrometry? 

http//www.astbury.leeds.ac.uk/Facil/Mstut/mstutorial.htm An introduction to mass spectrometry, A.E. 
Ashcroft, University of Leeds. 

http://www.biotech.niuc.edu/mass guide.htm Protein Analysis by MALDI-TOF Mass Spectrometry, 
Guidelines for Sample Preparation, Protein Sciences Facility, University of Illinois at Urbana-Champaign 
Protein 

http://www.isotope.com/cil/index.html Cambridge Isotope Laboratories, Inc. homepage 

http://www.srsmaldi.com Stanford Research Systems, Inc., MALDI 

http://www.millipore.com/ziptip Millipore Corporation ZipTip pipette tips 

http://www.srsmaldi.com/Maldi/Step/StepbyStep.htm] MALDI; Desalting and Purification 

http://www.spectrastableisotopes.com Spectra Stable Isotopes homepage 

www.piercenet.com Pierce Biotechnology homepage 

www.ncbi.nlm.nih.gov/Genbank/index.html Submit to GenBank. National Center for Biotechnology 
Information (NCBI). 

www.expasy.ch ExPASy (Expert Protein Analysis System) Molecular Biology Server 
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le SYSTEMATIC ANALYSIS OF LARGE SETS OF PROTEINS isolated from tissues, cells, cellular 
organelles, and protein complexes often includes studies of posttranslationally modified pro- 
teins (Krishna and Wold 1993) (see Chapter 1). Posttranslational modification (PTM) may 
encompass enzyme-catalyzed attachment of molecular entities onto the protein (¢.g., phos- 
phorylation, acylation, and glycosylation), formation of intramolecular cross-links (e.g., Cys- 
Cys disulfide bridges), as well as enzymatic processing of the protein (e.g., removal of amino- 
terminal signal sequences). Such modifications maintain protein integrity, modulate protein 
activity and interactions, or serve as targeting signals for delivery of proteins to specific 
organelles inside the cell or for export across the cell membrane. The field of *modification- 
specific proteomics,” for example, phosphoproteomics and glycoproteomics, will have a 
major role in elucidating the highly complex molecular mechanisms in cells, tissues, and 
whole organisms, because PTMs are not readily predicted from DNA or amino acid 
sequences (Jensen 2000). Since covalent modifications of proteins lead in most cases to an 
altered protein molecular mass, direct evidence for such modifications can be determined 
accurately by mass spectrometry (MS) (Shou et al. 2002) (see Chapter 8). 

Phosphorylation of proteins at specific amino acid residues is a key regulatory mecha- 
nism in cells. Phosphorylation of particular proteins by specific protein kinases controls basic 
cellular processes, such as cell cycle regulation, cell growth, and cell differentiation (Hunter 
2000; Schlessinger 2000; Blume-Jensen and Hunter 2001b). For a historical perspective of 
protein phosphorylation, see Krebs (1994) and Cohen (2002); for a discussion on the kinet- 
ic and catalytic mechanisms of protein kinases, see Francis and Corbin (1993), Adams (2001), 
and Johnson and Lewis (2001). Protein phosphatases catalyze the removal of phosphate 
groups from proteins, completing the regulatory cycle (Hunter 1995). Temporal and spatial 
regulation of cellular events and processes is achieved via reversible phosphorylation of pro- 
teins involved in signal transduction, gene regulation, and metabolism. 

Proteins are usually phosphorylated at specific serine, threonine, or tyrosine residues 
(Posada and Cooper 1992), but other phosphoamino acids exist (e.g. phosphoarginine, 
phosphohistidine, and phospholysine) (see Figure 9.1) (Shrecker et al. 1975; Pas and 
Robillard 1988; Yan et al. 1998; Sickmann and Meyer 2001). The chemical stability of known 
phosphorylated amino acids is shown in Table 9.1. Of these, phosphoserine is by far the most 


TABLE 9.1. Chemical stability of phosphorylated amino acids 
Nature of Stability in 
phosphoamino acid Acid Alkali Hydroxylamine Pyridine 


O-Phosphates 
Phosphoserine + 
Phosphothreonine 
Phosphotyrosine + 


N-Phosphates 
Phosphoarginine - - - - 
Phospbohistidine - + - - 
Phospholysine - * - - 


T 
* Md 

+ 

+ 


Acylphosphates 
Phosphoaspartate - - 
Phosphoglutamate - - - 


§-Phosphates 
Phosphocysteine {+) + + + 
^ Reprinted, with permission, from Sickmann and Meyer (2001), originally from Martensen 11971). 
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FIGURE 9.1. Chemical structures of phosphoamino acids. 
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common phosphoamino acid, followed by phosphothreonine and phosphotyrosine (Hunter 
1998). Phosphorylation leads to a mass increase of 80 daltons (or more accurately, 79.9663 
daltons) as a result of the addition of HPO,, usually donated by ATP or GTP ina pracess cat- 
alyzed by protein kinases. Protein kinases exhibit sequence specificity, i.e., they recognize ser- 
ine, threonine, or tyrosine residues in the context of a particular sequence motif. For exam- 
ple, protein kinase C (PKC) recognizes [ST]-x-[RK] (where S or T is the phosphorylation 
site). Prediction of candidate phosphorylation sites is therefore possible to a certain extent 
using bioinformatic tools, but this approach is insufficient to establish whether or not a puta- 
tive phosphorylation site is occupied in a given situation. However, phosphorylation site pre- 
diction tools can be helpful in designing experiments aimed at direct determination of phos- 
phorylation sites in proteins. 


Hundreds of protein kinases and phosphatases exist that differ in their substrate specifici- 
ties, kinetic properties, tissue distribution, and association with regulatory pathways. For 
details see the following Web Sites: 

http:///werw.cbs.dti.dk/databases/PhosphoBase/ 

http.//www.lecb.ncifcrf.gov/phosphoDB/ 

ProSite at hetp://ca.expasy.org/prosite/ 

NetPhos at http:/Awww.cbs.dtu.dk/services/NetPhos/ 

ScanSite at http://scansite.mitedu 

For a compendium of information on protein kinases, computational and structural 
resources, and links to other sites, see the Protein Kinase Resource (http://pkr.sdsc.edu/ 
htrnifindex.shtml). 


Labeling proteins with radioactive isotopes such as "P or 9P is the most common pro- 
cedure for the analysis of phosphorylation sites. However, ineffective incorporation of label 
due to the presence of endogenous ATP or GTP and the fact that radioactive labeling itself 
does not necessarily lead to the exact localization of the phosphorylation site warrant the 
need for alternative approaches. MS is increasingly becoming the method of choice for non- 
radioactive phosphorylation analysis (Aebersold and Goodlett 2001; Ahn and Resing 2001; 
Chen et al. 2002; Mann et al. 2002; Shou et al. 2002). 

Most MS-based strategies for identifying phosphorylation sites in proteins include the 
following stages: 


» [solation and characterization of intact phosphoproteins. 
+ Generation of peptides by sequence-specific proteolytic cleavage of phosphoproteins. 


e. Characterization of the peptide mixtures and identification of the phosphopeptides by MS 
and/or tandem MS (MS/MS). 


+ Phosphopeptide sequencing by MS/MS. 


The remainder of this introduction describes these four stages in greater detail. 
Techniques used in the first two stages are outlined next, followed by a more detailed discus- 
sion of the latter two stages, emphasizing optimized protein chemistry techniques combined 
with matrix-assisted laser desorption/ionization (MALDI) and electrospray ionization (£51) 
MS (for an overview of methods for enrichment, identification, and sequencing of phospho- 
peptides, see Figure 9.2). Methods for purification and functional analysis of phosphopro- 
teins and protein kinases are beyond the scope of this chapter (see, e.g, Aebersold and 
Goodlett 2001). The introduction closes with several examples of multidimensional phos- 
phoprotein characterization and a look at several emerging MS techniques for sequencing 
phosphopeptides. 
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SAMPLE PREPARATION 
Complex Mixture of Phosphorylated and 
Nonphosphorylated Proteins 
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| 
- 


IDENTIFICATION OF PHOSPHOPROTEINS/ 
PEPTIDES AND SITES OF PHOSPHORYLATION 


* 


Identification and Sequencing of Phospho- 
peptide(s) by MALDI-TOF and/or MS/MS 


FIGURE 9.2. Summary of techniques for enrichment, identification, and sequencing of 
mation on mass spectrometry methods and tandem mass spectrometry, see Chapter 8. 


phosphopeptides. For additional intor- 


DETECTION AND ANALYSIS OF INTACT PHOSPHOPROTEINS 


y a variety of tech- 
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raphy, and detergent extraction/solubilization techniques. Typically, SDS- polyacrylamide gel 
electrophoresis (SDS-PAGE) or two-dimensional (2D) PAGE (Ti et al. 1993; Gatti and ‘Traugh 
1999) (see Chapters 2 and 4) is used as a final purification step when the goal is to identify 
and characterize the primary structure and natural modifications of small amounts of phos- 
phoprotein (low microgram or nanogram levels) (Powell et al. 2000; Aebersold and Goodlett 
2001; Godovac-Zimmermann and Brown 2001; Larsen et al. 2001; Mann et al. 2001). A vari- 
ety of methods for detecting phosphorylated proteins will be discussed in turn. 


Phosphoprotein Detection by *P Radiolabeling 


Proteins can be labeled in vivo using [?P'orthophosphate (Ji et al. 1993) or in vitro using 
purified protein kinase and |y-?P]ATP (Zhao et al. 1989; Aebersold et al. 1991; Bovle et al. 
1991; Wettenhall et al. 1991; Yan et al. 1998). Proteins metabolically labeled in vivo or in vitro 
with ?P can be detected on 2D gels or one-dimensional (1D) SDS-PAGE by autoradiogra- 
phy or phosphorimaging. After incorporation, the protein is chemically or enzymatically 
cleaved, and the phosphopeptides are separated by high-performance liquid chromatogra- 
phy (HPLC) (Craig 2001; see also Chapter 5) or 2D thin-layer chromatography (TLC) and 
detected by scintillation counting or autoradiography. Phosphopeptide-containing HPLC 
fractions or excised TLC spots are subjected to phosphoamino acid analysis (Sickmann and 
Meyer 2001; Blume-Jensen and Hunter 20012) (see also Appendix 2) and analyzed by Edman 
sequencing, monitoring for a loss of radioactivity at each cycle of the Edman degradation 
(MacDonald et al. 2002) (see also Chapter 6, Protocol 5). Because there is typically insuffi- 
cient sample to detect the unlabeled amino acids (even by ESI-MS or MALDI-MS technolo- 
gies), the phosphorylation site must be deduced by comparing the Edman cycle number at 
which the radioactivity is released with a list of predicted peptides to identify those that con- 
tain serine, threonine, or tyrosine at the same position in the peptide sequence. (Note that 
traditional methods such as TLC and Edman sequencing are limited in that they typically 
require radiolabeled samples and either prior knowledge of the protein sequence or the abil- 
ity to make assumptions about the peptide sequence. Incorporation of radioactive phosphate 
into the target protein may alter the physiological state of the cells, for example, by inducing 
expression of the tumor suppressor protein p53 [Bond et al. 1999]. If the protein is already 
fully phosphorylated in vivo, further addition of phosphate will not occur, and the results 
can therefore be misleading.) 


Phosphoprotein Detection by Western Blotting 


Phosphorylated proteins can be detected, following their electrophoretic separation, by west- 
ern blotting using antibodies against the phosphoamino acids. This ts one of the most sensi- 
tive and widely used techniques for detecting specific phosphorylation sites (Arad-Dann et 
al. 1993; Abu- Lawi and Sultzer 1995). However, even when the phosphoamino acid is part of 
the recognition sequence, in some cases, the antibody may not recognize the epitope due to 
steric hindrance within the recognition site. 

If sufficient amounts of a phosphoprotein are present in an acrylamide gel for MS analy- 
sis, then the protein can be proteolytically digested in situ in the gel (see Chapter 7, Protocol 
17) and the phosphopeptide (including the phosphorylation site) identified. (Typically, phos- 
phorylated peptides are of low abundance in a peptide mixture and need to be enriched prior 
to MS analysis.) It should be noted that detailed characterization of à phosphoprotein cur- 
rently requires significantly more materia! than is normally used for protein identification 
alone (see below Characterization of Phosphopeptides by MS or MS/MS). 
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intact Phosphoprotein Analysis 


MS is very useful for accurate molecular-mass determination of solubilized, intact phospho- 
proteins. Such an analysis indicates whether a protein is actually modified and to what extent, 
as well as establishing the homogeneity of the sample. For mass analysis of intact proteins, 
the preparation preferentially should contain only low millimolar amounts of salts and other 
low-molecular weight contaminants, as they will interfere with sample preparation and ion- 
ization in MS. 

Detailed analysis of protein modifications is facilitated by having a homogeneous phos- 
phoprotein sample; i.c., little or no contaminating protein species should be present. If in 
doubt, a fraction of the intact phosphoprotein sample should be checked for homogeneity by 
SDS-PAGE or MS. When relatively high amounts of sample are available (tens of micrograms), 
a solubilized sample of phosphoprotein is preferred, but buffers and detergents that are incom- 
patible with MS must be avoided (Jensen et al. 1997) (for methods to remove MS-interfering 
contaminants, see Chapter 8). Recovery of intact protein by elution from the polyacrylamide 
gel matrix after SDS-PAGE for subsequent MS analysis by MALDI or ESI is sometimes possi- 
ble, but usually requires microgram amounts of protein to be successful (Haebel et al. 1995). 

Both ESI (Fenn et al. 1989) and MALDI (Karas and Hillenkamp 1988) MS can be used 
for intact molecular-mass determination of phosphoproteins (see, e.g., Weijland et al. 1997; 
Garnier et al. 2001; Merrick et al. 2001). Analysis of phosphoproteins is preferably per- 
formed in a "differential" experiment where two identical phosphoprotein aliquots are 
investigated in parallel: One sample is treated with a phosphatase enzyme (e.g., alkaline 
phosphatase) and the other left intact (see Protocol 2) (Annan and Carr 1997). Both sam- 
ples are analyzed by MS, and on the basis of the mass difference (if any) between the two 
samples, it can be established whether the protein is phosphorylated and the number of 
phosphate groups on the protein can be quantitated. In a similar experiment, a protein sam- 
ple can be incubated in vitro with a kinase and ATP and another identical but untreated 
sample serves as the control. 


GENERATION OF PEPTIDES FROM PHOSPHOPROTEINS BY PROTEOLYTIC CLEAVAGE 


Peptides are usually produced by cleavage of the protein with a sequence-specific protease, 
but a range of chemical agents are also available. Trypsin is a favorite protease in most labo- 
ratories due its high activity and specificity, and because it generates peptides of a size appro- 
priate for mass analysis, i.e., in the range of 500-4000 daltons. Additionally, it is advantageous 
for MS/MS analysis to have a highly basic amino acid residue at one of the termini, because 
this greatly simplifies the fragmentation pattern in MS/MS experiments (see the information 
panel on FRAGMENTATION MECHANISMS OF PROTONATED PEPTIDES IN THE GAS PHASE in Chapter 8). 
Endoproteinases Lys-C, Asp-N, and Glu-C are also very useful proteases fur the generation of 
phosphopeptides. A number of protocols for enzymatically digesting solubilized proteins oc 
proteins isolated by gel electrophoresis have been established (see, e.g., Shevchenko et al. 
1996; Jensen et al. 1999). 


CHARACTERIZATION OF PHOSPHOPEPTIDES BY MS OR MS/MS 


Analysis of protein phosphorylation typically involves: 
« Detection and identification of the phosphorylated peptides in a complex mixture. 


* Unambiguous localization of phosphorylation site(s) by peptide sequencing. 
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Whether only the first step or both steps must be performed will depend on the level of 
information that is desired. For certain biological questions, it is sufficient to detect and 
identify the phosphopeptides, which can be accomplished using the peptide mass finger- 
printing {PMF) strategy outlined in Chapter 8. However, a comprehensive phosphorylation 
analysis requires detection and exact localization of the phosphorylation sites, and MS/MS 
experiments must be conducted to accomplish this (for details on MS/MS, see Chapter 8). 
Knowledge of the amino acid(s) phosphorylated in vivo in a protein and the surrounding 
amino acid sequence (i.e., the sequence motif) can often be used to identify the kinase(s) and 
phosphatase(s) responsible for catalyzing the specific reaction. This information, in turn, is 
central to understanding the functional significance of any phosphorylation event in a cell. 

The most significant problem encountered in phosphorylation peptide mapping is hav- 
ing sufficient amounts of sample. More sample than is required for protein identification by 
MS is necessary for two reasons: (1) Many times, knowing the sequence of the protein is a 
prerequisite for modification mapping. (2) For modification mapping, the modified peptides 
must be present in the sample and observable in the mass spectrum to enable the phospho- 
rylation site analysis, whereas for identification purposes, any set of peptides can be sufficient 
as long as the criteria for an unambiguous database search are fulfilled. Phosphorylated pro- 
teins can be present in cells at very low concentrations. In such cases, the amount of phos- 
phoprotein available for analysis may challenge the limits of even the most sensitive MS sys- 
tems. This problem can be further compounded by a low stoichiometry of phosphorylation 
at a given site (Katze et al. 2000); i.e., only a small fraction of the population of molecules of 
the protein of interest is phosphorylated at a particular site. Often, multiple differentially 
phosphorylated forms of a protein exist, which also complicates site-specific analysis (Storm 
and Khawaja 1999). Thus, the analysis of the in vivo phosphorylation state of a protein often 
presents a formidable technical challenge. 

For MS analysis of phosphorylated samples, MALDI-time of flight (TOFj-MS has proven 
to be very useful because it is one of the most sensitive methods for the detection of phos- 
phopeptides (after appropriate enrichment or treatment of the protein sample), and it is fast 
and relatively easy to perform. However, as MALDI-TOF-MS analysis of phosphopeptides 
does not readily provide sequence information, it relies completely on relating expected and 
observed m/z values of ion signals. Therefore, this approach has serious limitations if addi- 
tional modifications or nonspecific enzyme cleavages occurred, because both give rise to 
unexpected peptide masses. Under these circumstances, additional experiments, such as phos- 
phatase treatment, are required to confirm the presence of phosphopeptides (Protocol 2). 


Problems Associated with MS Analysis of Phosphopeptides 


Mass spectrometric analysis of phosphopeptides is complicated by the presence of phospho- 

groups in three ways. 

a The ionization efficiency of phosphopeptides in the positive-ion mode is strongly reduced 
when compared with their nonphosphorylated counterparts, which is thought to be the 
result of increased hydrophilicity and acidity because of the phospho group (Liao et al. 
1994), Even though MS analysis of phosphopeptides can suffer from suppression effects, 
this method has still proven to be very useful in mapping posttranslational phosphoryla- 
tion of proteins (Annan and Carr 1997; Qin and Chait 1997). The advantages of the MS- 
based approach include generally rapid analysis, and the fact that it does not require radi- 
olabeling. Both MALDI-MS and ESI-MS have been used to determine the phosphorylation 
sites on proteins (Liao et al. 1994; Annan and Carr 1996, 1997; Carr et al. 1996; Qin and 
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Chait 1997; Immler et al. 1998; Zhang et al. 1998; Asara and Allison 1999; Neubauer and 
Mann 1999). Two recent studies have quantified the differences in ionization efficiencies 
for MALDI and ESI in positive-ion mode. For MALDI, the response factor for a particular 
nonphosphorylated peptide was ten times higher than that for the tyrosine-phosphorylat- 
ed counterpart, a result that is valid for the majority of phosphopeptides (Chen et al. 2001). 
To compensate for this discrimination in ionization during MALDI, ammonium salts such 
as ammonium citrate can be added (Asara and Allison 1999), The difference in ionization 
efficiencies is not as great for ESI. Miliotis et al. (2001) determined an intensity ratio of 
1:1.5:5 for equimolar amounts of the doubly, singly, and nontyrosine-phosphorylated pep- 
tide ALGADDSYYTAR. Therefore, the ion signals of phosphopeptides are normally only of 
low abundance when present in a crude mixture with nonphosphorylated peptides. 
Recently, two groups independently showed that phosphopeptides are sometimes more 
efficiently ionized by MALDI in negative-ion mode as compared with nonphosphorylated 
peptides, This difference in ionization efficiencies in positive- and negative-ion modes can 
be used to identify phosphopeptides in simple mixtures of peptides (Janek et al. 2001; Ma 
et al. 2001). 


Phosphorylated amino acid residues can interfere with enzymatic cleavage behavior. 
Studies have shown that phosphoserine or phosphothreonine residues in the +2 position 
downstream from the cleavage side (R/K-X-pS/pT) reduce significantly the cleavage rate of 
trypsin, the most widely used enzyme for protein characterization by MS (Benore-Parsons 
et al. 1989; Schlosser et al. 2000). The reduced cleavage efficiency results in larger peptides, 
which make MS analysis and MS/MS sequencing more complicated. 


Phosphopeptides easily lose the phospho moiety during mass spectrometric experiments, 
which is especially true for serine- and threonine-phosphorylated peptides, which readily 
lose phosphoric acid (H;PO,) upon low-energy collision-induced dissociation (CID), Even 
laser irradiation for MALDI experiments can be sufficient to initiate the gas-phase loss of 
phosphoric acid. Although this characteristic feature can sometimes be used for the iden- 
tification of phosphopeptides in mixtures (Annan and Carr 1996), it generally causes prob- 
lems for the mass spectrometric analysis, due to the additional decrease in sensitivity in 
MALDI-MS and the complicated fragmentation pattern in MS/MS studies. 


Circumventing the Problem of lonization Suppression of Phosphopeptides 


There are several ways to circumvent the difficulties associated with ionization suppression 
of phosphopeptides and the lability of the phosphoester group. 


Derivatization of phosphopeptides 


Some of the derivatization strategics are based on methods devised in the early 1970s to avoid 
problems caused by alkylphosphates during peptide/protein sequencing by Edman degrada- 
tion (for an early example, see Kolesnikova et al. 1974). The basic idea is to replace the labile, 
highly acidic phosphate group by a more stable, less acidic moiety. In the case of phospho- 
serine and phosphothreonine residues, this can be accomplished by performing a B-elimina- 
tion reaction under strong alkaline conditions, The resulting unsaturated amino acid residue 
(dehydroalanine or 2-aminodehydrobutyric acid) can be analyzed either directly by MS 
(Resing et al. 1995) or following a Michael addition with a nucleophile such as ethanethiol 
performed prior to mass spectrometric analysis of the phosphopeptides ( Meyer et al. 1986; 
Lapko et al. 1997; Jaffe et al. 1998; Weckwerth et al. 2000; see also Appendix 2). The derivati- 
zation strategies, including their advantages and disadvantages, are summarized in Table 9.2. 
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Immunological methods for enriching phosphopeptides 


Another way to circumvent the problem of ionization suppression is to enrich the sample for 
phosphopeptides. This approach reduces the excess (or bulk) of unmodified peptides that 
can suppress ionization. One enrichment strategy utilizes immunological methods, such as 
antibodies against phosphoamino acids, in immunoprecipitation experiments. However, at 
the moment, only anti-phosphotyrosine antibodies are of sufficient quality to allow the 
immunoprecipitation of phosphotyrosine-containing peptides only. Several studies have 
been published in which phosphoproteins have been immunoprecipitated prior to separation 
of the proteins by PAGE, protein analysis, and phosphorylation mapping (Gold et al. 1994; 
Godovac-Zimmermann et al. 1999; Pandey et al. 2000a,b). Although this approach in theory 
should work with immunoprecipitation of phosphopeptides, a recent study showed that the 
yield of phosphopeptides was ~2% only (Marcus et al. 2000). 


Enrichment of phosphopeptides by immobilized metal ion 
affinity chromatography 


Phosphopeptides can also be enriched by employing immobilized metal ion affinity chro- 
matography (IMAC). Suppression effects can be greatly reduced by selectively enriching for 
phosphopeptides on metal ion affinity media prior to MS analysis. This approach makes use 
of the high affinity of phosphate groups toward specific trivalent metal ions, in particular Fe” 
and Ga” (Andersson and Porath 1986; Nuwaysir and Stults 1993; Olcott et al. 1994; Posewitz 
and Tempst 1999; Zhou et al. 2000; Stensballe et al. 2001). Two types of metal-chelating sta- 
tionary phases, iminodiacetic acid (IDA) and nitrilotriacetic acid (NTA), are coupled to a 
support, such as Sepharose, agarose, or macroporous silica, permitting affinity chromatogra- 
phy based on the noncovalent binding of phosphopeptides (Neville et al. 1997). IMAC can 
be highly efficient, selective, and sensitive, but it also shows limitations, including poor 
adsorption of certain phosphopeptide species and nonspecific adsorption of nonphosphory- 
lated peptides (Ahn and Resing 2001). This often necessitates efficient means of verification 
of phosphopeptide candidates such as enzymatic dephosphorylation or MS/MS (Larsen et al. 
2001; Stensballe et al. 2001). An example of IMAC enrichment in combination with alkaline 
phosphatase treatment, MS, and MS/MS analysis is shown in Figure 9.3. Several methods for 
using IMAC to enrich for phosphoproteins prior to protein analysis by MS are given in 
Protocols 1, 3, and 4. IMAC can be utilized both on-line and off-line followed by MS analy- 
sis directly or coupled to either capillary electrophoresis or LC systems. 

Offline microJMAC. IMAC-LLC-ESI-MS/MS has been used successfully even though 
optimal low-level analysis cannot be easily achieved due to the relative disparity between 
IMAC and LC flow rates. This disparity can be linked in part to the necessity of using IMAC 
columns of much higher binding capacity than the subsequent uLC step to avoid overload- 
ing and subsequent loss of phosphopeptides. IMAC performed off-line from LC-ESI-MS 
permits a wide range of sample volumes to be loaded (1-100 ul) using a sample pressure ves- 
sel (Goodlett et al. 2000b) or an autosampling system equipped with a sample-trapping car- 
tridge (Hayashi et al. 2001) (see Protocols 4 and 5). The use of a pressure vessel (Figure 9.4) 
is preferred because (1) the sample moves directly from the vial containing the sample onto 
the IMAC column, avoiding nonspecific sample loss that occurs by pipetting a sample, and 
(2) the sample does not come in contact with stainless steel fluidics tubing that will selectively 
remove acidic peptides such as phosphopeptides. 

Whether using a pressure cell or gel-loader tips (see Chapter 8, Protocol 2), the IMAC- 
purified peptides can then be injected into a LC separation system and analyzed by LC- 
ESI-MS/MS as described in Protocol 5 and Chapter 8 (see also Becker et al. 1998; Katze et al. 
2000}. Note that phosphopeptides behave and elute similarly to nonphosphorylated peptides 
during RP-HPLC, but because of a decrease in hydrophobicity due to the addition of the 
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FIGURE 9.3. Phosphopeptide recovery by Fe(III)-IMAC fram phosphoprotein isolated by gel electrophoresis. Mouse piatelet- 
derived growth factor receptor a/b (PDGF-R) was isolated by SDS-PAGE and in-gel-digested with trypsin. (a) MALDI peptide mass 
map recorded from the crude tryptic peptide mixture using 2,5-DHB as the matrix. Signals from PDGF-R peptides (open circles) 
were evident. (b) Purification of phosphopeptides by nanoscale Fe(III)-IMAC followed by off-line MALDI mass analysis of the elu- 
ate using 2,5-DHB as the matrix. A number of peptides can be assigned as nonphosphopeptides (oper circles), several putative 
phosphopeptides (triangles) based on mass accuracy alone as well as unassignable peptides (question mark). (c) Alkaline phos- 
phatase treatment of the analyte/matrix deposit from which the spectrum in panel b was obtained. The phosphatase assay was 
employed, and the resulting peptide mixture was analyzed by MALDI-MS using 2,5-DHB matrix. Mass shifts of —80 daltons cor- 
responding to removal of one phosphate group confirmed the modified peptides. (d) Recovery of the analyte/matrix deposit from 
which the spectra in panels b and c were obtained followed by nanoscale desalting and nanoelectrospray MS/MS confirmed the 
identity of the dephosphorylated peptide PDGF-R c (residues 736—748). (Courtesy of Hanno Steen, Allan Stensballe, and Ole N. 
Jensen, University of Southern Denmark.) 
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FIGURE 9.4. Diagram of IMAC column and pressure cell, Photograph of an IMAC column (A) setup for 
use with a pressure cell and diagram of pressure cell cross-section (B) with photograph of top view of pres- 
sure cell in use (C). (Courtesy of Garry L. Corthals, Valerie C. Wasinger, and David R. Goodlett.) 


phosphate moiety, they will generally elute before the nonphosphopeptide with the identical 
amino acid sequence. This difference in elution time between phosphopeptide and non- 
phosphopeptide of identical amino acid sequence is most pronounced for short peptides. 
Thus, when analyzing phosphopeptides on-line with ESI-MS/MS analysis, it is a good idea to 
acquire data during the wash step (i.e., prior to gradient elution) because some short phos- 
phopeptides may not even bind to the C, resin in 100% aqueous solutions, Otherwise, these 
very hydrophilic peptides will be missed. 


The following are typical pLC conditions for the separation of IMAC-enriched phospho- 
peptides on-line with ESI-MS/MS: 


» Column: Use a 50-m (LD.) microcapillary column for optimum sensitivity or a 100- 
um (1.D.) column for optimum ruggedness, pressure packed with, for instance, ODS- 
AQ (Waters Corp.). 

» Solvents: Solvent A is aqueous with 0.4% acetic acid/0.005% heptafluorobutyric acid 
(HFBA}/H,©. Solvent B is acetonitrile only because addition of acid here is not neces- 
sary for good chromatography (it is used with UV detection to avoid a ríse in the base 
line with increasing acetonitrile) and adds substantially to chemical noise in the mass 
spectrometer. 

« Gradient: Use a linear gradient from 0% to 60% Solvent B over 50 minutes. 


« Flow rate: The flow rate depends on the column used, but, in general, ~200 nl/min and 
500 n/min across 50- and 100-yum (1-D.) columns, respectively, which is achieved by 
restrictive flow splitting using a length (empirically determined) of capillary tubing set 
in a tee before the capillary column. This setup allows the use of normal flow HPLC 
pumps capable of operating between 0.5 and 4.0 ml/min te be used for microcapillary 
column operation. 
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FIGURE 9.5. Off-line IMAC and MS/MS. Flow diagram (a) of off-line IMAC followed by ULC-ESI-MS/MS sequencing of a phos- 
phopeptde. Tryptic digest of complex phosphoprotein sample was separated by IMAC using a pressure cell as in Figure 9.4. Pooled 
phosphopeptides underwent further fractionation by pLC on-line to ESI-MS. The post-cotumn split (a) sent 95% of the sample to 
microfuge tubes for fractionation by time to produce a radioactive profile (b) and the remainder went to the mass spectrometer for 
sequence identification by database search using the tandem mass spectrum (c). (Courtesy of Garry L. Corthals, Valerie C. Wasinger, 
and David R. Goodlett.) 


Peptides eluting from the column are introduced into the MS via a micro-ESI interface 
that can be built in the laboratory (Goodlett et al. 2000b) or purchased (contact 
garry.corthals&dim.hcuge.ch). Peptides for CID are subsequently selected for fragmentation 
using automated data-dependent MS/MS protocols available from most MS manufacturers. 
Where phosphopeptides are radiolabeled with “P, it is recommended that a post-column 
flow-splitter be integrated into the micro-ESI device so that a portion of each P-labeled pep- 
tide is captured into a microtiter plate. The recovered ?P-labeled peptides are used to create 
a radioactive profile versus chromatographic time by counting each fraction without scintil- 
lant (Gallis et al. 1999; Katze et al. 2000) as shown in Figure 9.5. The radioactivity profile gen- 
erated this way is useful for confirming the elution time of the phosphopeptides and, under 
certain circumstances, to quantify the amount of ?P-labeled peptide present in specific frac- 
tions (Goodlett et al. 2000a). As ESI-MS is essentially a concentration-dependent detection 
method (Goodlett et al. 1993; Wahl et al. 1993), post-column flow-splitting only minimally 
reduces the detection sensitivity. ?P-labeled peptide samples collected off-line have also been 
successfully further separated and analyzed by solid-phase extraction capillary electrophore- 
sis (SPE-CE) connected on-line with ESI-MS/MS. “Peak parking" has been found during cap- 
ilary electrophoresis (i.e. data-dependent modulation of electrophoretic voltage) to be 
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extremely advantageous for the detection of very iow-abundance phosphopeptides by data- 
dependent MS/MS (Figeys et al. 1999). This advantage is achieved because a reduction of the 
voltage reduces the electrophoretic mobility and therefore enhances the time available for the 
mass spectrometer to detect and select for CID peptides without adversely affecting sensitiv- 
ity (Goodlett et al. 1993). This technique has been applied to the analysis of in vivo phos- 
phorylation sites of endothelial nitric oxide synthase (Gallis et al. 1999). The utility of this 
method is highlighted in Figure 9.6. The system consists of the following: 


» Off-iine IMAC/HPLC pre-enrichment setup for phosphopeptides as described above. 
s SPE-CF device for peptide concentration and separation. 


« ESI-MS/MS instrument (e.g., ThermoFinnigan TSQ) that can be controlled via software 
protocols (e.g., ICL from ThermoFinnigan) such that capillary electrophoretic voltage is 
modulated in a data-dependent fashion in response to ion selection for CID; 1e., when a 
peptide is selected for CID, the ICL protocol decreases the capillary electrophoretic voltage 
(for ICL protocols, see Geneva Proteomics Web Site www.expasy.org/people/gpc.html or the 
companion Web Site to this book). 


In laboratories where a tandem mass spectrometer is unavailable for sequence analysis of 
phosphopeptides by CID, phosphatase treatment (Zhang et al. 1998) of IMAC-en riched pep- 
tide samples has been used in conjunction with MALDI-TOF analysis to identify the phos- 
phopeptides in the sample (see Protocols 2 and 3). {n these experiments, the collective mass- 
es of all peptides enriched by micro-IMAC are first recorded in the single-stage MALDI- 
TOF-MS. Next, the same sample is treated with a broad specificity phosphatase (e.g., calf 
intestinal phosphatase) in solution, or the identical sample still on the MALDI probe is treat- 
ed and masses are recorded after an incubation time appropriate to the phosphatase being 
used. The two MALDI-TOF mass spectra for the original untreated and phosphatase-treated 
samples are then compared to identify any mass that shows a decrease of 80 daltons (or mul- 
tiples thereof), implying loss of phosphate due to phosphatase treatment. This experiment is 
technically simple, and the interpretation of the spectra is straightforward due to the fact that 
MALDI-TOF-MS produces almost exclusively singly protonated peptide ions. 

Online micro-IMAC. The analysis of phosphopeptides using IMAC on-line with an MS 
or MS/MS system is desirable because the number of sample handling steps is decreased and 
thus peptide loss is minimized. Although the advantages of using IMAC to selectively enrich 
for phosphopeptides is evident, direct coupling of IMAC columns to an ESI-MS instrument 
is undesirable, because the solvent conditions used to elute phosphopeptides from the IMAC 
resin are not directly compatible with high-sensitivity ESI-MS. Successful use of on-line 
IMAC with ESI-MS/MS has been nevertheless achieved with both capillary electrophoresis 
(Caoand  ults 1999) and nLC (Affolter et al. 1994). In general, conditions for on-line IMAC 
are very similar to those used for off-line except that the eluted peptides pass directly onto the 
HLC column (Affolter et al. 1994). In a clever integration of the enrichment/separation sys- 
tem by IMAC-CE-ESI-MS/MS, the selective enrichment of casein phosphopeptides and their 
analysis was performed on-line using an ion trap that allowed low-picomole levels of sensi- 
tivity. To accomplish this, the IMAC column (5 cm x 150 um I.D., 360 jum O.D.) was fitted 
over a smaller (75 cm x 75 um ID., 150 ym O.D.) capillary electrophoresis column contain- 
ing a low-binding PVDF frit, and a 1-cm bed of activated Fe(III) Poros beads was pulled into 


the IMAC capillary by applying a vacuum. 


Chemical derivatization and selective enrichment 


Three approaches that combine both derivatization and selective enrichment of phosphory- 
lated species were recently introduced by Oda et al. (2001), Zhou et al. (2001), Gosche et al. 
(2001), and Steen and Mann (2002). The methods of Oda et al. (2001) and Gosche et al. 
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FIGURE 9.6, Modulation of capillary electrophoretic voltage in a data-dependent fashion. The sequence of 
events involved in peak parking with solid-phase extraction capillary électrophoresis (SPE-CE) on-line with 
a mass spectrometer capable of data-dependent ion selection for collision-induced dissociation (CID). The 
base peak ion current trace (a) is shown above a trace of the electrophoretic voltage (Vos) that was modu- 
lated based on data-dependent ion selection for CID; ie., a decrease in Vp results in data-dependent ion 
selection to provide more time for CID. A full-scan mass spectrum (5) acquired prior to selection of the ion 
at 844.2 m/z for CID and (c) the tandem mass spectrum of the phosphorylated peptide selected in 5. 
(Courtesy of Garry L. Corthals, Valerie C. Wasinger, and David R. Goodlett.) 
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FIGURE 9.7. Phosphoamino acid structures before and after phosphate elimination. (Courtesy of Garry L. Corthals, Valerie C. 


Wasinger, and David R. Goodlett.) 


(2001) are limited to the isolation of phosphoserine and phosphothreonine, because they rely 
on B-elimination of phosphate (see Figure 9.7) and the introduction of a biotin moiety by 
Michael addition. The derivatized peptides can be selectively purified based on the high affin- 
ity of avidin to biotin. The method of Steen and Mann (2002) also makes use of D-elimina- 
tion/Michael addition reactions to introduce a functional group at the original site of phos- 
phorylation, which gives rise to a dimethylamine-containing sulfenic acid derivative with a 
characteristic fragment ion at m/z 122.06 upon low-energy CID. This enables detection of the 
phosphorylated species within complex peptide mixtures by sensitive and specific precursor 
ion-scanning mode (see Chapter 8 and Figure 9.8) (Steen and Mann 2002). The method of 
Gosche et al. (2001) allows quantification of phosphopeptides between two different mix- 
tures of proteins using differential stable isotope labeling. The method and reagent name, 
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FIGURE 9.8. Derivatization strategy for serine-phosphorylated peptides. Serine-phosphorylated peptides undergo B-elimination 
upon treatment with strong bases. The resulting Michael system reacts with the nucleophilic thiol group of 2-dimethy- 
laminoethanethiol, which introduces an additional highly basic functional group at the former phosphorylation sife. Controlled oxi- 
dation of the thioether to the sulfoxide is accomplished by short incubation with 3% H,O,. The generated 2-dimethylaminoethane- 
sulfoxide derivative gives rise to a characteristic fragment ion (sulfenic acid derivative) at m/z 122.06 upon low-energy CID. 
(Reprinted, with permission, from Steen and Mann 2002 {@Elsevier Science|.) 


phosphoprotein isotope-coded affinity tags (PhIAT), are derived from a new class of com- 
pound, isotope-coded affinity tags (ICAT), that allow the determination of relative abun- 
dance of proteins from two different biological states (Gygi et al. 1999). Zhou et al. (2001) 
made use of phosphoramidate chemistry to introduce free thiol groups, which facilitated 
immobilization of the phosphopeptides on iodoacetyl-activated beads. For more details on 
these methods, see Table 9.2. Of these three new methods, only the method of Zhou et al. 
(2001) allows for the chemical isolation of phosphoserine and phosphothreonine, as well as 
phosphotyrosine. Quantitation without isolation had been demonstrated previously using 
this approach (Weckwerth et al. 2000). It is expected that these new phosphopeptide selective 


methods will eventually gain widespread use and perhaps eliminate the use of IMAC resins 
for phosphopeptide isolation. 


Mass Specirometry of Phosphopeptides 


There are a number of different mass spectrometric methods for detecting phosphopeptides 
in complex peptide mixtures and for determining which amino acid residue(s) in a peptide 
1s phosphorylated. These fall into two general categories. The first relies on the propensity of 
the phosphoester bonds of phosphoserine, phosphothreonine, and, to a lesser extent, phos- 
photyrosine to undergo fragmentation in a mass spectrometer. This type of fragmentation 
can be induced in the collision celi (Figure 9.92) or in the ion source region (Figure 9.9b) of 
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FIGURE 9.9. Diagramatic comparison of CID in a collision cell and in-source. (a) MS/MS where a mass selection device (Q1) 
purifies a single m/z that is fragmented in a collision cell (Q2) by the standard CID process to produce a product ion mass spec- 
trum containing fragment ions specific to the parent ion selected by OL. (b) [n-source CID at atmospheric pressure without mass 
selection to produce a mass spectrum in Q1 that contains fragments from all ions that entered the source at a given moment in 
time, Note that arrows with different lines represent ions at a unique m/z and that the process in panet b is often used in diagnos- 
tic ion scanning procedures described in the text. (Courtesy of Garry L. Corthals, Valerie C. Wasinger, and David R. Goodlett.) 


ESI instruments or during post-source decay (PSD) in a MALDI-MS (for further details, sce 
Chapter 8). Phosphopeptides that lose phosphate can then be identified by implementing 
one of several possible phosphate-specific diagnostic ion scans, which include precursor ion 
scans, neutral loss scans, and in-source dissociation. The phosphate-specific diagnostic ions 
(i.e., H,PO;, 97 daltons; PO;, 79 daltons; and PO ;, 63 daltons) generated by ES] in negative- 
ion mode during in-source CID (Katta et al. 1991) can be monitored to identify phospho- 
peptides (Huddleston et al. 1993; Hunter and Games 1994). In general, methods that produce 
some sort of phosphate-specific ion (i.e. a diagnostic ion) are useful in cases where incorpo- 
ration of ?P is not possible or in which the radiolabel has decayed past the point of detection 
(Meyer et al. 1993). 

The second type of detection method is based on the additional mass of a peptide that 
has been phosphorylated. Typically, in protein phosphorylation studies, the amino acid 
sequence of the protein investigated is known. Therefore, phosphopeptides derived from the 
protein can, in principle, be detected by a net mass differential of 80 amu that occurs when 
phosphate is added to the side chains of serine, threonine, or tyrosine. Thus, a peptide mass 
map of the proteolysed phosphoprotein can potentially identify the phosphorylated peptide 
by comparison to the theoretical peptide map. Neither method, however, identifies the phos- 
phorylated amino acid residues) within the peptide directly, except in cases where the pep- 
tide sequence contains only a single possible phosphorylation site; in which case, the phos- 
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phorylated residue is effectively located by default. Generally, the precise amino acid that is 
phosphorylated is most readily identified by CID in an ESI-MS/MS instrument, Often, the 
information obtained by the specific scanning methods is designed to differentiate phospho- 
peptides from nonphosphopeptides, rather than to provide sequence information. The 
remainder of this section describes several types of MS-based approaches for phosphopep- 
tide detection and sequence analysis following IMAC enrichment. For optimum results, it is 
recommended that the mass spectrometer be specifically tuned with phosphopeptide stan- 
dards before using any of the methods described below. 


Direct MS analysis of phosphoproteins and peptides bound to solid supports 


Elution of phosphopeptides from IMAC columns before MS analysis can result in significant 
sample loss. However, affinity-bound analytes, including phosphopeptides, can be directly 
analyzed by MALDI-MS without a separate elution step from the affinity media (Papac et al. 
1994; Raska et al. 2002). For example, affinity-bound phosphoproteins and phosphopeptides, 
like human apotransferrin and phosphokemptide, could be analyzed directly from Fe?'- 
loaded Sepharose using MALDI-MS. (It has recently been reported that MALDI-TOF-MS 
analysis of IMAC-retained phosphopeptides is in fact due to ligand displacement of the 
monophosphopeptide-NTA chelate, rather than laser-induced direct desorption from the 
chelate {Hart et al. 2002], and that IMAC-retained phosphopeptides can be solubilized by 
solutions of CHCA and DHB, with 2,5-DHB being more effective in obtaining a larger subset 
of monophosphopeptides.) The utility of this method has been extended by the observation 
that consecutive enzymatic reactions, such as phosphatase or carboxypeptidase Y digestion, 
can be carried out on affinity-bound peptides (Papac et al. 1994; Qian et al. 1999; Li and Dass 
1999; Zhou et al. 2000; Merrick et al. 2001) (see Protocol 3). When the affinity-bound phos- 
phopeptides are treated with phosphatase, the number of phosphorylation sites can be deter- 
mined based on the observation of 80-dalton (or multiples of 80-dalton) mass shifts in the 
MALDI-MS of the reaction mixture. Carboxypeptidase Y treatment of the affinity-bound 
phosphopeptides can also be used to cleave the amino acids from the carboxy! terminus, with 
subsequent direct analysis of the enzymatic products bv MALDI-MS to locate the phospho- 
rylation sites on the bound phosphopeptides. Cleavage of amino acids from the amino termi- 
nus with aminopeptidase does not work with IMAC, because metal ions are required for this 
enzyme's activity, and the IMAC media removes the metals from the solution. High sensitivi- 
ty, elimination of the need for radiolabeiing or HPLC separation, ease of use, and the ability 
to analyze extremely complicated phosphopeptide mixtures make this method attractive. 


Lability of phospho-moiety in phosphopeptides: A blessing or a curse? 


Removal of the phospho-moiety and the selective enrichment of the phosphorylated species 
are strategies devised to circumvent the problems caused by the labile nature of the phos- 
phate esters, which is often considered a curse. However, the lability of the phospho-moiety 
can also be considered a blessing, because the loss of HPO, or H,PO, provides a characteris- 
tic "signature" for phosphopeptides, which can be utilized in MS/MS-based approaches for 
selective detection of phosphopeptides present in crude peptide mixtures. The neutral loss of 
H,PO, (98 daltons) or HPO, (80 daltons) can be used for phosphopeptide-specific constant 
neutral loss experiments (Schlosser et al. 2001). This approach can also be applied to on-line 
LC-MS/MS experiments, where the data are screened “on the fly" for ion pairs with mass dif- 
ferences corresponding to the loss of the phospho-moiety. All peptide ion spectra can be 
screened after their acquisition for this characteristic peak pair discrepancy, to identify those 
product ion spectra that are derived from phosphopeptides. A distinct disadvantage of this 
method, however, is the bias against tyrosine-phosphorylated species resulting from the rel- 
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atively high stability of arylphosphates with respect to the loss of phosphoric acid (-98 dal- 
tons) as compared to alkylphosphoesters. 

Less discriminating is the use of the characteristic anion at —79 daltons (PO, ) observed 
in negative-ion mode, which is generated by all phosphopeptides. Hence, this characteristic 
“reporter” ion can be used in precursor ion experiments to identify phosphopeptides in crude 
peptide mixtures, This approach is reliable, highly sensitive, and very specific. lt must be kept 
in mind, however, that peptide sequencing is not efficient in negative-ion mode; therefore, the 
instrument must be switched to the positive-ion mode and the pH must be adjusted before 
sequencing the phosphopeptide by MS/MS (Carr et al. 1996; Wilm et al. 1996). 

In the special case of tyrosine-phosphorylated peptides, a characteristic “reporter” cation 
at 216.043 daltons can be used for precursor ion experiments in positive-ion mode. The high- 
est selectivity is achieved on high-accuracy, high-resolution tandem mass spectrometers, such 
as the quadrupole TOF-type instrument (Steen et al. 2001a,b). An example of a phosphoty- 
tosine- specific precursor ion experiment is shown in Figure 9.10. 
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FIGURE 9.10. Mass spectrometric analysis of a protein digest for the detection of tyrosine-phosphorylated 
peptides. (4) Mass spectrum of the tryptic in-gel digest of Bcr/ Abl protein, eluted from an Oligo R3 micro- 
column with 40% methanol/5% formic acid. (5) The specific precursor of m/z 216.045 t 0.020 experiment. 
(c) Product ion spectrum of the doubly charged tryptic peptide Tc (pY)SLTVAVK) at m/z 480.7 from 
which the “reporter” cation (m/z 216.045) was derived. (Courtesy of Hanna Steen, Allan Stensballe, and Ole 
N, Jensen, University of Southern Denmark.) 
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in-source CID 


Several MS methods for detecting phosphopeptides in complex mixtures rely on MS/MS (see 
Protocols 7 and 8; for further details on MS/MS and instrument configuration, see also 
Chapter 8}. One of the early successful methods used fragmentation in the ion source region 
of ESI instruments, These methods used a higher than normal difference in voltage across the 
lenses (see Figure 9.9b) or other devices used for desolvation/focusing of ions, together with 
the higher than normal gas pressure present in the ion source region of an atmospheric pres- 
sure ionization device. By these means, peptide fragmentation is effected similar to that pro- 
duced in a collision cell. However, as depicted in Figure 9.9b, no ion selection is available, and 
thus all ions that enter simultaneously are fragmented together. It was thus important to cor- 
relate loss of phosphate in the source region with a chromatographic trace. If carried out as 
done by Carr and co-workers (1996) in a triple-quadrupole (TQ) mass spectrometer with a 
nozzle-skimmer-type interface design in the ion source region (e.g., SCIEX TQ), then both 
the chromatographic marker and the phosphopeptide molecular weight can be determined 
in the same scan. The generation of fragment ions was accomplished by use of a high orifice 
potential across the nozzle-skimmer prior to QI, whereas the low m/z range was scanned for 
the low-mass diagnostic ions. When the TQ began to scan across higher m/z, then the noz- 
zle-skimmer potential was returned to a normal voltage that did not induce fragmentation. 
This allowed both the diagnostic ion scan and the scan to determine peptide molecular 
weight to be done in a continuously alternating fashion through the chromatographic analy- 
sis. A similar experiment can also be done on instruments in which a heated capillary-skim- 
mer design (c.g., ThermoFinnigan TQ) is used in place of the nozzle-skimmer (Aebersold et 
al. 1998). Here, an alternating scan approach was used, where selected ion monitoring of 
appropriate diagnostic ions at a high octapole offset voltage was followed by two full scans. 
The first full scan was conducted at the same high offset voltage as the single-ion monitoring 
(SIM) experiment, providing signals for the deprotonated phosphopeptide molecular ion 
and the phosphopeptide molecular ion minus phosphate. The second full scan was conduct- 
ed at a normal octapole offset voltage to provide a reference to the full scan at high octapole 
offset. This series of MS scans was repeated continuously throughout the LC separation. Such 
an experiment provided the same information as the method of Carr and co-workers (1996), 
but because SIM was used for the diagnostic ion detection, rather than simply scanning 
across the low m/z range, the protocol had a lower duty cycle and was potentially more sen- 
sitive. The first full scan at high octapole offset was compared to the full scan at low octapole 
offset to provide an indication as to which peptide ion was phosphorylated. This was partic- 
ularly useful if several peptides coelute, a situation that is common if complex peptide mix- 
tures are separated by WLC. Such techniques generally achieved a sensitivity limit of a few 
fmoles for phosphopeptide standards loaded on the column. However, the sensitivity of 
detection of “real” peptide samples generated by the digestion of in vitro or in vivo phos- 
phorylated proteins usually was in the picomole range. 

It would be advantageous to detect phosphopeptides by the release of phosphate-spe- 
cific diagnostic ions in negative-ion mode and then switch to positive-ion mode for a CID 
of the detected phosphopeptide. This switch between negative- and positive-ion mode is 
necessary because the phosphate-specific fragment ions are difficult to detect in positive- 
ion mode and because negative-ion CID spectra generally produce insufficient fragment 
ions for sequence elucidation. With current instruments, rapid switching between negative- 
and positive-ion mode is technically difficult to achieve. It can be anticipated that this prob- 
lem will be overcome in the future as better electronics are developed. 
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Neutral loss scanning 


Neutral loss scanning for phosphopeptide detection and analysis was first described by Covey 
et al. (1991a,b) and further developed by Carr and co-workers (Huddleston et al. 1993). It was 
carried out in positive-ion mode with ESI in a triple-quadrupole MS. Instead of using Q1 to 
select specific ions for fragmentation in Q2, Q1 and Q3 were scanned such that Q1 and Q3 
m/z scan ranges were offset by the neutral mass of interest (e.g., 49 amu for loss of phosphate 
from a [M 2H]?! parent ion) (see also Chapter 8). The two quadrupoles were then continu- 
ously scanned to monitor for loss of the neutral ion. As shown in Figure 9.7, only phospho- 
serine and phosphothreonine may undergo neutral loss of 98 amu by B-elimination (Gibson 
and Cohen 1990; Bateman et al. 2002). Phosphotyrosine remains intact under these conditions 
because the &-carbon proton that must be abstracted by a lone pair of electrons from the 
phosphate moiety is now too far removed. The method has not been as popular as the afore- 
mentioned in-source CID methods because of false positives and the need to know the charge 
state of the phosphate-containing peptide ion. An advantage of the method is that it is carried 
out in positive-ion mode and can be used with data-dependent scanning to acquire CID spec- 
tra for phosphopeptide sequencing and determination of the site of phosphorylation in the 
same experiment as a neutral loss scan for the detection of phosphorylated peptides. 


Precursor ion scanning 


In this method, negative-ion ESI is carried out with continuous scanning of Q1. All ions are 
fragmented in Q2, and Q3 passes only one ion, which for phosphopeptides is usually m/z 79 
(i.e. loss of PO ;) (see also Chapter 8). Consequently, the resultant full-scan mass spectrum 
shows only ions that lost m/z 79 (Wilm et al. 1996; Neubauer and Mann 1999). This greatly 
simplifies mixture analysis and is best done during direct infusion with a nanospray source. 
As described for the phosphate diagnostic ion scans, there is a problem associated with 
sequencing in positive-ion mode immediately after detecting the loss of phosphate in nega- 
tive-ion mode. This method requires a mass spectrometer with a collision ceil positioned 
between two mass separation devices such as a TQ ora quadrupole TOF (qTOF) arrange- 
ment (see Chapter 8). 

A novel precursor ion discovery method using an ESI hybrid quadrupole orthogonal 
acceleration TOF (QTOF) mass spectrometer has been described recently (Bateman et al. 
2002). In this study, the observed phosphotyrosine immonium ion at m/z 216.043 was used 
to discover phosphopeptides containing phosphotyrosine. (A similar approach for identify- 
ing phosphotyrosine-containing phosphopeptides has been described using à MALDI-qTOF 
mass spectrometer [Bennett et al. 2002].) 


PHOSPHOPEPTIDE SEQUENCING BY MS/MS 


During low-energy CID, the amide bonds of peptides fragment to yield y- and b-type ion 
series (see the information panel on FRAGMENTATION MECHANISMS OF PROTONATED PEPTIDES IN 
THE GAS PHASE in Chapter 8). The b fragments contain the original amino terminus of the 
peptide, whereas y-fragment ions include the original carboxyl terminus (for the nomencla- 
ture of peptide fragment ions, see Chapter 8) (Roepstorff and Fohlmann 1984; Biemann 
1988). The mass difference between two consecutive ions of the same peptide fragment ion 
series corresponds to the mass of an amino acid residue, which allows the sequence of the 
parent peptide to be determined. Phosphopeptides are sequenced in the same manner, tak- 
ing into account the mass alteration of serine, threonine, and tyrosine residues due to the 
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phosphorylation (+80 daltons). However, as serine- and threonine-phosphorylated peptides 
normally show a predominant loss of phosphoric acid (H,PO,, 98 daltons) under low-ener- 
gy CID conditions, resulting from gas-phase B-elimination of the labile alkylphosphoester, 
the unsaturated amino acid residues must be considered as well. Therefore, the position of 
phosphoserine residues can be identified based on the mass difference of two successive frag- 
ments (of the same fragment ion series) of 167 daltons (phosphoserine) or 69 daltons (dehy- 
droalanine) if a B-elimination reaction occurred. 

Phosphothreonine residues can be localized in the same manner, i.e., based on a frag- 
ment mass difference of 181 daltons (phosphothreonine) or 83 daltons (2-aminodehydrobu- 
tyric acid) after B-elimination. Because phosphotyrosine cannot undergo D-elimination reac- 
tions, tyrosine- phosphorylated peptides are stable under low-energy CID conditions. The 
location of phosphotyrosine residues can be determined by the mass difference of two suc- 
cessive peptide fragment ions of 243 daltons. Precursor ion scans, in conjunction with a 
nanoelectrospray source, provide a sensitive MS/MS tool for the detection of phosphorylat- 
ed peptides and the determination of phosphorylation sites (Neubauer and Mann 1999; 
Craig 2001; Bateman et al. 2002; Bennett et al. 2002) (see Protocols 7 and 8). 


MULTIDIMENSIONAL STRATEGIES FOR PHOSPHOPROTEIN CHARACTERIZATION 


Fo. comprehensive phosphorylation analysis, one method alone is often insufficient. This 
was demonstrated by Vihinen and Saarinen in their study of the phosphorylation of Semliki 
Forest virus nonstructural protein 3 (Vihinen and Saarinen 2000}, In addition to traditional 
biochemical methods such as "P-labeling, TLC separation, and Edman degradation for 
sequencing, they used IMAC purification and several mass spectrometric methods including 
precursor-ion scanning (m/z 79) and MALDI analysis before and after alkaline phosphatase 
treatment to confirm 8 phosphorylation sites and to localize up to 12 additional phosphory- 
lation sites, depending on the degree of phosphorylation. Other studies in which a combina- 
tion of techniques was used for phosphorylation analvsis include the following: 


* Zhang et al. (1998) used MALDI-TOF analysis before and after alkaline phosphatase treat- 
ment of an in-gel protein digest to identify the phosphorylated peptides. Subsequently, LC- 
MS and LC-MS/MS with preset m/z values corresponding to the expected m/z values of the 
identified phosphopeptides were pertormed for an unambiguous localization of the phos- 
phorylation site. 


Cao and Stults (1999, 2000) used a plug of IMAC material to retain the phosphopeptides 
generated by in-yel digestion of phosphoproteins. All peptides bound to the IMAC plug 
were eluted into a capillary electrophoresis column for separation. This setup was com- 
bined on-line with a mass spectrometer, allowing for the unambiguous identification of the 
phosphopeptides including the site of phosphorylation. 


Chen et al. (2002) used a variety of methods including IMAC, alkaline phosphatase treat- 
ment, MALDI PSD, and multidimensional electrospray-based approaches for the phos- 
phorylation site mapping of hyperphosphorylated proteins associated with Neti, a regula- 
tor of exit from mitosis in yeast. These authors concluded that with the use of existing 
technologies, no single method was able to identify all sites in highly phosphorylated Net1. 


For additional examples of different multidimensional protein phosphorylation studies, 
see Amankwa et al. (1995), Annan et al. (2001), Steen et al. (2001b), and Stensballe et al. 
(2001). 
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EMERGING MS AND MS/MS TECHNIQUES FOR PHOSPHOPEPTIDE SEQUENCING 


Several new types of MS/MS methods for peptide sequencing have been developed during 
the past few years. McLafferty and co-workers observed that multiply charged peptide and 
protein ions generated by ESI can dissociate upon electron capture in the cell of a Fourier 
transform-ion cyclotron resonance (FT-ICR) mass spectrometer (see Chapter 8) (Zubarev et 
a]. 1998). Electron capture dissociation (ECD) leads to extensive fragmentation of the 
polypeptide backbone generating c- and z-type ions, providing good amino acid sequence 
coverage for the fragmented peptides (Kelleher et al. 1999), ECD is a very gentle dissociation 
method that enables detailed analysis of posttranslational modifications, including identifi- 
cation of phosphorylated residues (Stensballe et al. 2000; Shi et ai. 2001). In contrast to other 
MS dissociation methods, ECD does not induce the neutral loss (B-elimination) of phos- 
phoric acid from phosphoserine and phosphothreonine even when multiple phosphoryla- 
tion sites are present in a single peptide (Stensballe et al. 2000). 

Although ECD seems to be the method of choice for the fragmentation of peptides con- 
taining labile modifications such as phosphopeptides, problems arise when the phosphopep- 
tide is present in a mixture such that the phosphorylated species must first be identified 
before further analysis can be performed. Flora and Muddiman (2001) recently introduced a 
new method for the selective detection of phosphopeptides in FT-ICR MS based on the loss 
of the phospho-moiety. Performing ESI FT-ICR experiments in negative-ion mode, mass 
spectra of the peptide mixture were acquired before and after low-intensity infrared irradia- 
tion. Although this level of irradiation did not induce cleavage of the peptide backbone, it was 
sufficient to induce the loss of the phospho-moiety. Comparing the two spectra and search- 
ing for peak pairs differing by 98 daltons permitted the identification of phosphorylated 
species in the mixture. 

External MALDI sources mounted onto qTOF and ion-trap tandem mass spectrometers 
provide the ability to generate amino acid sequence information from individual peptide 
species present in complex mixtures while still providing the advantages of MALDI (Qin and 
Chait 1997; Loboda et al. 2000; Shevchenko et al. 2000). MALDI tandem MS/MS analysis of 
singly protonated peptide ions produces somewhat complex mass spectra that nevertheless 
allow automatic protein identification via database searching (Qin et al, 1997; Shevchenko et 
al. 2002). MALDI-MS/MS is a very attractive method for characterization of posttransla- 
tionally modified proteins. Several researchers have demonstrated that MALDI-qTOF- 
MS/MS enables localization of phosphorylation sites in peptides up to 3.2 KD (Bennett et al. 
2000; Baldwin et al. 2001; Lee et al. 2001), and Chait and co-workers have demonstrated the 
utility of MALDI ion-trap MS/MS for the characterization of several types of modified pep- 
tides (Qin and Chait 1997). 

To avoid the problem of ionization suppression of phosphorylated peptides observed for 
MALDI as well as for ESI methods, Wind et al. (2001) utilized inductively coupled plasma 
mass spectrometry (ICPMS) in combination with liquid chromatography for the selective 
detection of phosphorylated species in a protein digest. Although no information about the 
phosphopeptides apart from the retention time is obtained, ICPMS has the significant advan- 
tage that the signal intensity is proportional to the molar amount of incorporated phospho- 
rus, i.e. the degree of phosphorylation can reliably be measured and quantified, 

As phosphorylation is a reversible process, there is significant interest in comparing the 
degree of phosphorylation between two different sample sets, such as different cell states, 
before and after drug treatment, or with and without exposure to certain external conditions. 
To address this question, preliminary studies providing proof of principle have been pub- 
lished recently (Oda et al. 1999; Weckwerth et al. 2000; Goshe et al. 2001). The first method 
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SUMMARY 


is based on metabolic labeling with stable isotopes, whereas the latter two approaches are 
based on B-elimination followed by the incorporation of nucleophilic groups that are either 
unlabeled or labeled with stable isotopes. For a discussion of new tools for quantitative phos- 
phoproteome analysis, see Conrads et a]. (2002) and Mann et al. (2002). 


A wide range of MS-based techniques for detailed analysis of phosphoproteins are now avail- 
able, but there is presently no "best method" for this purpose. The main challenge is the devel- 
opment of robust and efficient sample preparation methods for phosphopeptide analysis by 
MS. The most successful approaches utilize a combination of complementary methods for 
the analysis of phosphopeptides. Analytical strategies for global investigation and quantita- 
tion of phosphoproteins, i.e., phosphoproteome analysis, are in high demand. The research 
activity in the protein mass spectrometry community aimed at phosphoprotein characteri- 
zation holds promise for development of much improved methods in the near future. 


PROTOCOL Í 


Phosphopeptide Purification by IMAC with Feci) and Gall) 


MATERIALS 


Tumoenzen METAL ION AFFINITY CHROMATOGRAPHY (IMAC), first introduced by Porath 
(1975), makes use of matrix-bound metals to affinity-purify phosphoproteins and phospho- 
peptides. Commonly used metals in early studies such as Ni**, Co”, Zn^', and Mn’! were 
shown to bind strongly to proteins with a high density of histidines (Porath et al. 1975). More 
recently, immobilized Fe?*, Ga**, and AP* metal ions have been used for the selective enrich- 
ment of phosphopeptides from complex proteolytic digest mixtures containing both phos- 
phorylated and nonphosphorylated components (Stensballe ct al. 2001; Posewitz and Tempst 
1999). The use of a nitrilotriacetic acid (NTA) matrix over iminodiacetic-acid-modified 
matrices has been reported to provide an advantage in selectivity (Neville et al. 1997). The 
development of elution conditions that are directly compatible with MS analysis of the 
enriched phosphopeptide samples provides the option to interface IMAC and MS on-line. In 
general, the strength and the selectivity of the interaction between the immobilized metal ion 
and the phosphopeptide depend on numerous factors, including the degree of phosphoryla- 
tion, pH, salt concentration and composition of the sample solution, peptide concentration, 
type of chelated metal ions, temperature, and degree of exposure of chelated ions interacting 
with the peptide side chains. This protocol, kindly provided by Hanno Steen, Allan 
Stensballe, and Ole N. Jensen (University of Southern Denmark), describes the enrichment 
of phosphopeptides by IMAC using Fe"'- and Ga?'- NTÀ resin. 


P» Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
IMPORTANT: Use the purest chemicals and H,O available. 


Acetic acid (0.2 M) 
Add 0.12 ml of glacial acetic acid <!> to 9.88 ml of H,O to prepare a stack solution. The working 
solution throughout the protocol is 0.1 M acetic acid. 
Acetic acid (0.1 M) and acetonitrile (3:1 v/v) «!» 
EDTA (0.05 M) in 1 M NaCl 
Add 0.19 g of EDTA and 0.58 g of NaCl to 10 ml of H,O. 
Ferric chloride (FeCl,) <!> (0.2 M) or 0.2 M gallium chloride (GaCl;) <!> 
Add either 0.54 g of FeCl, or 0.35 g of GaCl, to 10 mi of H;O. 
WARNING: GaCl, reacts violently with H,O. Before use (Step 6) combine equal parts 0.2 M FeCl, 
or 0.2 M GaCl, with 0.2 M acetic acid. 
Formic acid (5%) «1» 
Add 0.5 ml of formic acid to 9.5 ml of H,O. 
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b Equipment 


MALDI matrix (semisaturated DHB <!> in 50% acetonitrile and 2,596 formic acid, or CHCA 
in 70% acetonitrile and 0.196 trifluoroacetic acid [TFA] <!>) 
If using ESI-MS instead of MALDI-MS, see Step 20 for alternative elution reagents. 

Methanol (5096) <!> 

Proteolytic enzyme (e.g., trypsin, Lys-C, or Asp-N) 

Solvent (pH 10.5) (~0.1% ammonia solution at pH 10.5 <!>) 


Centrifuge (benchtop) 

Desaiting resin, Poros 10R2; Poros Oligo R3; Poros 20MC (Applied Biosystems) 

[MAC resin, QIAGEN uitrilotriacetic acid (NTA)-silica <!> (16-24-pm particle size, QIAGEN) 
or Poros 20MC (Applied Biosystems) 

MALDI probe (sample plate), polished stainless steel or AnchorChip type 

Mass spectrometer, preferably a MALDI-TOF with delayed extraction and reflectron, or an 
ESI-qTOF (e.g., Bruker, Micromass, and Applied Biosystems) 

Pipette tips (long narrow tips) (e.g., GELoader tips, Eppendorf) 

Software: GPMAW (http://welcome.to/gpmaw) 

Syringe (plastic, 1 mi) 


» Biological Sample 


Proteins of interest separated either in-gel or in-solution 
The protocol is intended for Jow- or subpicomole sample amounts per experiment; however, the 
method can be scaled up by increasing volumes of resins and reagents appropriately, 


» Additional Reagents 


METHOD 


Step 11 of this protocol requires the reagents listed in Chapter 7, Protocol 5. 


Preparation of the IMAC Resin 


In Steps 1-10, the slurry is gently mixed with 1 ml of each reagent, in turn, in a microfuge 
tube for 30 seconds at ambient temperature, At the end of each step, centrifuge the slurry and 
discard the supernatant before proceeding with the next step. 


t. In a microfuge tube, resuspend the IMAC resin in H,O to a concentration of -1 mg/ml. 
. Wash the resin with H-O. 
. Incubate the resin twice with 50 mM EDTA in 1 M NaCl at room temperature. 


2 

3 

4. Wash the resin with H,O. 

5. Wash the resin with 0,1 M acetic acid. 
6 


. Incubate the resin twice with 0.1 M FeCl, or GaCl, in 0.1 M acetic acid at room tempera- 
ture. 


7. Wash the resin with 0.1 M acetic acid. 
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8. 
9. 
10. 


Wash the resin with 0,] M acetic acid and acetonitrile (3:1 v/v}. 
Wash the resin with 0.1 M acetic acid. 
Resuspend the resin in 0.1 M acetic acid. 


The ready-to-use slurry of IMAC resin can be stored for at least 40 days at eC without 
reducing the performance of the resin. 


Preparation of the Peptide Sample 


Ti. 


12. 


Perform in-gel or in-solution digestion of the sample protein with the desired enzyme 
(trypsin, Lys-C, Asp-N, etc.) as detailed in Chapter 7, Protocol 5. 


Dilute an aliquot of the digest corresponding to 0.1-10 pmoles of the sample into 30750 
ul of 0.1 M acetic acid. 


Phosphopeptide Purification by Nanoscale IMAC 


13. 


20. 


Prepare an IMAC column by loading enough IMAC resin (from Step 10) into a long, nar- 
row pipette tip to pack a 15-20-mm column of IMAC resin (for the preparation and use 
of this column, see the panel on ADDITIONAL PROTOCOL: PREPARATION AND USE OF MICRO- 
COLUMNS FOR SAMPLE DESALIING OR NANOSCALE IMAC). 


. Load the protein sample (30-50 pl) slowly onto the IMAC column (1-3 l/min) using air 


pressure from a syringe or a micropipettor. 


. Collect the flowthrough in the upper end of a nanoscale desalting column (Poros R2 or 


Oligo R3 prepared according to the additional protocol at the end of this protocol) prior 
to desalting and MALDI-TOF-MS peptide mass mapping. 


This fraction contains all of the peptides that were not retained on the IMAC column. 


. Wash the IMAC column with 20 ut of 6.1 M acetic acid. 


Here, and in the following wash steps, use gentle air pressure from an attached syringe to 
push the solutions through the column. Discard the eluents from each wash step. 


. Wash the IMAC column with 20 pl of 0.1 M acetic acid and acetonitrile (3:1 v/v). 


18. 
19. 


Wash the IMAC column with 20 ul of 0.1 M acetic acid. 


Flute the retained phosphopeptides with two times 5-pl volumes of pH 10.5 solvent 
directly into 20 ul of 5% formic acid placed in the upper end of a fresh nanoscale desalt- 
ing column (Poros R2 or Oligo R3). 


Alternative eluting reagents are MALDI matrices such as DHB in 50% acetonitrile/2.5% 
formic acid or CHCA in 70% acetonitrile/0, 1% TEA. If these are used, the eluent should be 
spotted directly onto the MALDI probe. 


Elute the desalted IMAC eluate from the desalting column with 1 ul of MALDI matrix 
solution (see Step 19). 


For the preparation of samples for MALDEMS analysis, see Chapter 8, Protocol 1. The eluate 
is spotted as a series of 5-8 droplets onto the MALDI probe. The major part of the analyte will 
be present in droplet | and 2. Alternatively, the peptides can be eluted for nano-Es! analysis 
using either 1-2 jl of 50% methanol/196 formic acid or 1-2 pl of 596 NH,/50% methanol plus 
acidification for negative-ion-mode analysis (acidification by 50% formic acid/50% methanol). 
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ADDITIONAL PROTOCOL: PREPARATION AND Use OF MICROCOLUMNS FOR SAMPLE DESALTING OR NANOSCALE IMAC 


This method, which is very similar to the preparation of microcolumns detailed in Chapter 8, 
Protocols 2 and 3, is based on the principles of Gobom et al. (1999) and Stensballe et al. (2001). 
Microcolumns can be prepared using any kind of chromatographic resin. The resin is held in place 
by making a constriction at the end of a GELoader pipette tip. Sample loading, washing, and elu- 
tion is performed by loading liquid on top of the resin and applying air pressure to generate a low 
flow through the column. No frits are necessary. 


Additional Materials 

CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 

Chromatographic resin (e.g., Poros R1, R2, Oligo R3, or IMAC) 

GELoader tips (Eppendorf) 

Methanol <!> 

Plastic syringe (1 ml) 

Syringe adaptor 
The syringe is attached to the GELoader tips by using the top part of a 200-11 pipette tip modified to 
connect the syringe and the GELoader tip. z 


Method 


V. Prepare a slurry of 100-200 ul of chromatographic resin in 1 ml of methanol. 


2. Partially constrict a GELoader pipette tip by gently squeezing or twisting the end of the tip. This 
allows liquid to flow through the tip while retaining the chromatographic resin. 


3. Load 5 pl of slurry into the GELoader tip from the top and pack it by applying air pressure with the 
1 -mi syringe adapted to fit the GELoader tip microcolumn. The column height should be 2-4 mm. 


In the case of the IMAC column prepared for the main protocol, use enough resin to prepare acol- _ 
umn of 15-20 mm in height. 


4. Equilibrate the resin by flushing the column with 10-20 ul of 5% formic acid through the 
GELoader tip using air pressure from the syringe. mr 
TFA or acetic acid can be used instead of formic acid in the mobile phases, whereas acetonitrile can 
substitute for methanol. Formic acid and methanol are recommended for nano-electrospray mass. 
spectrometry (see Chapter 8). m — 


5. Dissolve the peptide or protein sample in 20-40 ul of 5% formic acid.. 
6. Load 5-20 ul of sample onto the microcolumn and gently press it through the column by air 


pressure using the syringe. 


7. Wash the resin by flushing 10-20 ul of 5% formic acid through the packed GELoader tip by air 
pressure (syringe). arin tt 


8. Elute the sample using a small volume of 5% formic acid/50% methanol. 


The eluate can be collected in a microfuge tube, deposited directly onto the MALDI probe tip, or 
eluted directly into a nano-electrospray needle. In the latter two cases, the elution volume should 
be 1-2 ul only. np js 

Peptide separation may be improved by eluting the sample from the microc umn by a step gradi- 
ent using a series of mobile phases containing 596 formic acid in 1596 methanol, 3096 methanol 


and 5096 methanol, respectively. For MALDI-MS analysis, elute the sample directly onto the- d 
MALDI probe using matrix solution, e.g., HCCA, SA, or DHB dissolved in 30-50% methanol or 
acetonitrile. Deposit the eluate in a series of tiny droplets rather than one large drop (see Chapter 

8, Protocol 11). | 7 
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PROTOCOL 2 


Alkaline Phosphatase Treatment of Phosphopeptides 
Either Before or After MALDI Analysis 


Tus USE OF THE ENZYME ALKALINE PHOSPHATASE allows identification of phosphopeptides in a 
mixture of predominantly nonphosphopeptides. Using a MALDI-MS instrument, the masses 
of peptides are acquired both pre- and postalkaline phosphatase treatment, which removes 
phospho- moieties from serine, threonine, and/or tyrosine. (Any peptides whose mass decreas- 
es by 80 daltons, or multiples thereof, is a phosphopeptide.) An advantage of using MALDI- 
MS for these experiments is that the peptide ions produced tend to be singly charged rather 
than multiply charged (as with ESI), thus making the interpretation easier (see Chapter 8). In 
this protocol, provided by Hanno Steen, Allan Stensballe, and Ole N. Jensen (University of 
Southern Denmark), three methods are given: 


« Method A: The in-solution dephosphorylation method prior to MALDI-MS analysis. 


« Method B: in-solution dephosphorylation after MALDI-MS analysis based on the protocol 
of Stensballe et al, (2001). 


+ Method C: On-probe dephosphorylation after MALDI-MS analysis based on the protocol 
of Larsen et al. (2001). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
IMPORTANT: Use the purest chemicals and H,O available. 


» Reagents 
Acetonitrile (70%) <!> in 50 mm NH,HCO, 


Alkaline phosphatase (AP) (0.05 unit AP/ul in 50 mm NH,HCO;) 
Stock concentrations of calf intestinal alkaline phosphatase (Roche) of 20 units/ul rather than 1 
unit/pl is recommended due to a lower concentration of storage buffer in the usage concentration. 


Formic acid (596) «1» 
Add 0.5 ml of formic acid to 9.5 ml of H,O. 

MALDI matrix (semisaturated DHB <!> in 5096 acetonitrile and 2.596 formic acid or CHCA 
in 7096 acetonitrile and 0.1% trifluoroacetic acid <!>) 
For choice of MALDI matrix, see Table 8.8 in Chapter 8. 


NH,HCO, (50 mM, pH 7.8) «t» 
Add 0.040 g of NH,HCO, to 10 ml of H;O. 


Trifluoroacetic acid (5%) (TFA) 


» Equipment 
Centrifuge (benchtop) 
Desalting resin, Poros 10R2; Poros Oligo R3; Poros 20MC ‘Applied Biosystems) 
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MALDI probe 
Mass spectrometer 


The spectrometer should be capable of generating MALDI reflector-TOF and MALDT linear- TOF 
spectra. 


Plastic box 


The box should have a lid able to keep the MALDI metal probe in a humid environment. 5ec 
Method C, Step 2. 


Pipette tips (long narrow tips) (e.g., GELoader tips, Eppendorf) 
Syringe (disposable plastic, 10 ml) 


b Biological Sample 
Phosphopeptide sample either in solution (Method A) or bound to a MALDI probe (Methods 
B and C) 


The protocol is intended for low- or subpicomole sample amounts per experiment. 


METHODS 


Method A: In-solution Dephosphorylation Prior to MALDI-MS Analysis 


1. Prepare two nanoscale desalting columns (Poros R2 or R3) according to the additional 
protocol in Protocol 1. Make sure that the resin remains wet with 5% formic acid to keep 
it humid during the incubation (Step 3). The upper end of the GELoader tip serves as a 
reaction chamber. 


2. Place 20 u! of alkaline phosphatase in 50 mm NH,HCO, in one of the reaction chambers. 
Dilute the peptide sample (typically 1—2 ul) into the alkaline phosphatase solution. 


The final pH of the reaction must be ~8, but if necessary adjust it with 50 mw NH,HCO, 
(may be checked with pH paper). 


IMPORTANT: Prepare a control reaction within a GELoader tip containing 50 mM 
NH,HCO,. Add an equivalent amount of peptide, but no alkaline phosphatase. 


3. Incubate the GELoader tips for 45 minutes at 37°C to dephosphorylate the peptides. 


4. Acidify the samples by addition of 20 ul of 596 formic acid and load each sample onto a 
Poros resin by application of air pressure from a syringe. 


5. Wash the columns with 20 ul of 5% formic acid and then elute with 1 pl of saturated 
MALDI matrix. Spot the eluates as a series of 5-8 droplets onto the MALDI probe for 
optimal sensitivity. The major part of the analytes will be present in droplets 1 and 2. 


6. Record MALDI reflector- TOF and MALDI linear-TOF spectra on a portion of the phos- 
phopeptides, on the material from Step 5, and the control sample. 


Method B: In-solution Dephosphorylation After MALDI-MS Analysis 


This method is applicable for both DHB and CHCA MALDI-MS matrices. 


l. Prepare two nanoscale desalting columns (Poros R2 or R3) according to the additional 
protocol in Protocol 1. Make sure that the resin remains wet with 596 formic acid to keep 
it humid during the incubation (Step 4). 
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. Place 20 ul of alkaline phosphatase in 50 mm NH,HCO, in the upper end of the 


GELoader tips, which serves as a reaction chamber. 


IMPORTANT: Prepare a control reaction within a GELoader up containing 50 mM 
NH,HCO,,. 


. Record MALDI reflector-TOF and MALDI linear-TOF spectra on the peptide mixture. 


. Use a micropipettor to carefully dissolve the analyte/matrix deposits using a maximum 


volume of 0.5-1.5 ul of 70% acetonitrile in 0.05 M NH,HCO,. Transfer equal volumes of 
dissolved analyte/matrix deposits directly to each reaction chamber. 


The stainless steel surface of the MALDI probe must be of a "polished" type or AnchorChip 
for the recovery of analyte to be successful. 


. Incubate the GELoader tips for 45 minutes at 37°C to dephosphorylate the peptides. 


. Acidify the samples by addition of 2 volumes of 5% formic acid and load each sample 


onto a Poros resin by application of air pressure from a syringe. 


. Wash the columns with 20 ul of 5% formic acid and then elute with ! pl of saturated 


MALDI matrix. Spot the eluates as a series of 5-8 droplets onto the MALDI probe for 
optimal sensitivity. The major part of the analytes will be present in droplets 1 and 2. 


. Record MALDI reflector-TOF and MALDI linear- TOF spectra on the alkaline phos- 


phatase-treated peptides. 


Method C: On-probe Dephosphorylation After MALDI-MS Analysis 
This method is applicable only for CHCA MALDI-MS matrix. 


1. 


After obtaining MALDI reflector-TOF and MALDI linear-TOF spectra from an ana- 
lyte/matrix deposit, dissolve the matrix on the target using 1.5 ul of 0.05 M NH,HCO, 
containing alkaline phosphatase (0.05 unit/ul). 


. Place the target in a closed plastic box containing a wet tissue to prevent the sample from 


drying. Incubate the box for 30 minutes at 37°C. 


. Acidify the sample with 0.5 ul of 5% TFA, thus allowing the matrix to recrystallize. 


. Prior to MALDI analysis of the dephosphorylated peptides, wash the surface of the 


matrix microcrystals gently with 0.196 TFA to wash away salts and glycerol originating 
from the dephosphorylation buffer. Apply the 10 yl of 0.1% TFA as a droplet onto the 
matrix; deposit and remove immediately using the edge of a tissue. 


PROTOCOL 3 


Characterization of Phosphopeptides Using a 
Combination of Immobilized Metat lon Affinity 
Media and Direct Analysis by MALDI-TOF-MS 


Phosenorrorens AND PEPTIDES CAN BE BOUND with high specificity to immobilized metal 
ions, such as Fe?* and Ni?' (Andersson and Porath 1986; Muszynska et al, 1992; Olcott et al. 
1994). Recently, Tempst reported that Ga^* has better selectivity than Fe™ for phosphopep- 
tides (Posewitz and Tempst 1999). This technique can be used with either on-line or off-line 
MS analysis (Michel et al. 1988; Nuwaysir and Stults 1993; Neville et al. 1997; Cleverley et al. 
1998; Hanger et al. 1998; Li and Dass 1999; Posewitz and Tempst 1999). Elution of the phos- 
phopeptides from the metal ion column prior to MS analysis can, however, result in sample 
loss. Affinity-bound analytes, including phosphopeptides, can be directly analyzed by 
MALDI-MS without prior elution from the affinity media (Papac et al, 1994), The utility of 
this method has been extended by the observation that consecutive enzymatic reactions, such 
as phosphatase or carboxypeptidase Y digestion, can be carried out on affinity-bound pep- 
tides (Papac et al. 1994; Qian et al. 1999; Zhou et al. 2000; Merrick et al. 2001). When the 
affinity-bound phosphopeptides are treated with phosphatase, the number of phosphoryla- 
tion sites can be determined based on the observation of 80-dalton (or multiples of 80 dal- 
tons) mass shifts in the MALDI-MS of the reaction mixture. Carboxypeptidase Y treatment 
of the affinity-bound phosphopeptides can also be used to cleave the amino acids from the 
carboxyl terminus, with subsequent direct analysis of the enzymatic products by MALDI-MS 
to locate the phosphorylation sites on the bound phosphopeptides, This protocol details the 
preparation and use of Fe* or Ga?! metal IMAC with the on-bead analysis of phosphopep- 
tides by MALDI-MS (see Figure 9.11). Enzymatic digestion of affinity-bound peptides is also 
described. This protoco] was provided by Leesa J. Deterding, Jenny M. Cutalo, and Kenneth 
B. Tomer (National Institute of Environmental Health Sciences, Research Triangle Park, 
North Carolina). 
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FIGURE 9.11, Schematic of affinity binding of phosphopeptides to immobilized metal ion affinity columns. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


p Reagents 
Acetic acid (100 mm} <!> 
Ammonium bicarbonate (50 mM, pH 8.0) <!> 
EDTA (100 mM in H,O, pH 8.0) 
Gallium chloride (GaCl,) (60 mm) <!> or 100 mM ferric chloride (FeCi,) <!> 
MALDI matrix 


The MALDI matrix is a saturated solution of a-cyano-4-hydroxy-cinnamic acid <!> in 45:45:10 
ethanol:H,0:formic acid <!> (v/v/v). Store it in the dark at room temperature. 


Nickel-nitrilotriacetic acid (Ni-NTA) resin (QIAGEN) 
Sodium citrate (50 mM, pH 6.0) <!> 


» Equipment 
Compact reaction columns (CRC) and filters (35-Ltm pore size) (USB Corporation) 
Incubator (slow rotation, i.e., rotating wheel) 
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MALDI mass spectrometer (e.g., the Voyager DE-STR) or equivalent 
The Voyager DE-STR mass spectrometer (Applied Biosystems) is equipped with a nitrogen laser 
(337 nm) to desorb and ionize the samples. A close external calibration, using two points to brack- 


et the mass range of interest, shouid be performed prior to analyzing the protein sample (see 
Chapter 8, Protocol 1). 


MALDI target 
Microfuge tubes (0.7 mi) 


P Biological Molecules 
Calf intestinal alkaline phosphatase (Roche) 
Carboxypeptidase Y (1 pg/pl in H,O) 
EXPERIMENTAL TIP: Because MALDI-MS sensitivity is significantly reduced in the presence of 
glycerol, it is highly recommended that enzymes not be in solutions containing glycerol. 
Phosphoprotein of interest 


TPCK-modified trypsin (sequence-grade, e.g., Promega) 


METHOD 


Preparation of the Immobilized Metal lon Affinity Column 


t. Insert the CRC filter (35 tum) into the CRC, and place the CRC in a standard microfuge tube. 


2. Add ~30 ul of the Ni-NTA resin slurry to the bottom of the CRC tube and drain the col- 
umn by centrifugation at 120g for ~1 minute at room temperature. 


These centrifugation conditions are the same for all washes unless otherwise stated. 


3. Wash the Ni-NTA resin three times each with 30 il of 100 mm EDTA (pH 8.0) to remove 
the bound Ni?! metal. 


4, Wash the column three times each with 30 ul of H;O. 
5. Wash the column three times each with 30 ul of 100 mM acetic acid. 


6. To regenerate the columns with either Fe?* or Ga% metal, wash the column three times each 
with 30 ul of either 60 mm GaCl, or 100 mM FeCl. 


7. To remove any unbound metal ions, wash the column three times each with 30 ul of H,O 
followed by three times each with 30 pl of 100 mM acetic acid. 


Affinity Binding of Phosphopeptides to IMAC Column 
8. Digest the phosphoprotein with trypsin (protein:enzyme ratio of 20:1 to 100:1 [w/w]) in 


50 mM NH,HCO, buffer (pH 8) for 2 hours at 37°C. 


For these experiments, the protein concentration was 0.1 ug/pi. 


9. Load 20 ul (~20 pg) of the trypsin-digested phosphorylated protein with 30 ul of 100 mm 
acetic acid onto the IMAC column. Incubate the column on a slowly rotating wheel for 
~30 minutes at 37°C, 


The column should be checked periodically for leakage from the bottom of the CRC. Leakage 
can be alleviated by sealing the bottom of the CRC with Parafilm. 


634 u CHAPTER 9 


10. To remove unbound peptides from the IMAC column, wash the column with 30 ul of 
H-O followed by 30 ul of 100 mM acetic acid. Repeat this step three times. 


The IMAC column (with bound phosphopeptides) can be stored in 100 mm acetic acid for 
at least 2-3 days at 4"C. Even bound phosphopeptides from à complex mixture, such as a 


cell extract, should be stable, because endogenous phosphatases are digested by the trypsn 
and/or elute from the column. 


‘lo verify that nonspecific peptides are washed off the column or if the unbound peptides 
are of interest, analyze the washes by MALDT-MS, 


1l. Spot a 0.5-ul aliquot of settled beads with 0.5 ul of MALDI matrix on the MALDI target. 
Alkaline Phosphatase Digestion of Affinity-bound Phosphopeptides 


12. Mix a 5-ul aliquot of the phosphopeptides affinity-bound to the IMAC media (from Step 
10) with 5 ul of 50 mM NH,HCO, (pH 7.8) and 1 unit of calf intestine alkaline phos- 
phatase in a 0.7-ml microfuge tube. 


13. Incubate the tube on a slowly rotating wheel at 37°C. 


14. Monitor the time course of the digestion periodically by spotting a 0.5- il aliquot of the reac- 


tion mixture (supernatant and beads) with 0.5 jl of MALDI matrix onto a MALDI target. 
Take the first aliquot 30 minutes after beginning Step 13. 


initial aliquots are typically taken at 30-minute intervals. Depending on the protein, incu- 
bation times for the digestion can range from 1 hour to overnight. 


Carboxypeptidase Y Digestion of Affinity-bound Phosphopeptides 


15. Transfer a 25-1 aliquot of the phosphopeptides affinity-bound to the IMAC media (from 
Step 10) into another CRC. 


16. Wash the column three times each with 30 ul of 50 mM sodium citrate (pH 6). 
17. Add 40 ul of 50 mM sodium citrate (pH 6) to the column. 

18. Add 1 ul of carboxypeptidase Y solution (1 g/l) in H;O to the column. 

19. Incubate the column on a slowly rotating wheel at 37°C. 


Depending on the protein, the carboxypeptidase Y digests can take from a few minutes to 
overnight to complete. Thus, the time course of the digest (Step 20) should be monitored 
initially at 30—60-second intervals. 


20. To monitor the time course of the reaction: 
a. Wash the beads three times each with 50 pl of 100 mm acetic acid. 
b. Add 30 pl of 100 mm acetic acid. 


c. Spot 0.5 ul of beads with 0.5 pl of MALDI matrix onto the MALDI target for MALDI- 
MS analysis. 


d. Resume the carboxypeptidase Y reaction by repeating Steps 16-19. 
Mass Spectrometry 


21. Perform MALDI analyses on the samples from Steps 11, 14, and 20a using a delayed- 
extraction TOF mass spectrometer. 
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Case STUDY — 


A tryptic digest of a solution containing two a-casein variants (S1 and $2) and trace amounts 
of B-casein was analyzed by MALDI-MS (see Figure 9.12a). Nine major peaks were observed 
in the mass range shown, The c-casein tryptic digest was then loaded onto a Fe” IMAC col- 
uni, Direct MALDI-MS analysis of the affinity-bound peptides (Figure 9.12b) showed only 
two major ions, both of which correspond in mass to monophosphorylated tryptic peptides 
T15 and 134-15. It should be noted that tryptic peptides which contain multiple phospho- 
rylation sites were not observed under these experimental conditions, presumably because 
the Fe**-loadéd IMAC support binds these phosphopeptides more tightly than does the cor- 
responding Ga**-IMAC-loaded support (Zhou et al. 2000), and thus are not released during 
the MALDI process. After phosphatase treatment (1 hour at 37°C} of the -casein phospho- 

eptides bound to the Fe**-IMAC, ions corresponding in mass to dephosphorylated T15 and 
TI4-15 were observed in addition to the phosphorylated T15 and T14-15 (Figure 9.12c). 
Further digestion with alkaline phosphatase (overnight at 37°C) resulted in nearly complete 
dephosphorylation of the affinity-bound phosphopeptides (Figure 9.12d). From these data, 
the number of phosphorylation sites in the affinity-bound peptides can be determined. 

To gain séquence information about the affinity-bound phosphopeptides, the immobi- 
lized metal ign affinity columns can be subjected to carboxypeptidase Y digestion. The car- 
boxypeptidasé Y digest of the tetraphosphorylated peptide T1-2 of B-casein resulted in 
cleavage of the carboxy-terminal amino acid residues SITR (Zhou et al. 2000). From these 
Mita, the location of the four phosphates could be assigned to the remaining four serines 
on T1-2. It should be noted that carboxypeptidase Y digestion of immobilized metal ion 
affinity media containing multiple affinity-bound peptides results in complex MALDI 
‘spectra, making interpretation of the sequence information difficult. In these cases, 
sequence information may be obtained by elution of the peptides from the IMAC followed 
by ESI-MS/MS analysis. 

With the development of hybrid quadrupole time-of-flight (QTOF) and TOF-TOF 
instfümentation with MALDI sources, the MS/MS sequence data can be obtained directly 
fron MALDI desorbed ions. The MALDI-QTOF-MS/MS spectrum of m/z 1952, 
monophosphorylated T14—15 from Q-casein, was acquired directly from the Fe” IMAC 
resitt and is shown in Figure 9.13. The most abundant ion observed, m/z 1855, is due to 
losssef 98 daltons, H,PO,, from the protonated molecule. In addition, a nearly complete 
series of ions containing the carboxy] terminus, y ions, and y ions plus loss of 98, are 
observed. y ions 1 through 4 are not shifted in mass, indicating that the phospho-group is 
not located on these residues (A116 through R119). In contrast, the y ions y, through y, 
all show an increase in mass of 98 daltons, There is also a series of ions arising from loss of 
98 daltons from the y ioris y, (5115) through y,, (V106). These data show that the location 
of the phospho-group is the serine residue $115. 

The combination of immobilized metal ion affinity and direct MALDI-MS analysis 
allows the sélective enrichment and detection of phosphopeptides from complex reaction 
mixtures. The detection of phosphorylation sites in low-abundance phosphoproteins, where 
typically only a small fraction of the protein is modified, should be enhanced. Subsequent 
enzymatic digestion or MS/MS of the‘immobilized phosphopeptides can provide addition- 
on the &umber and location of phosphorylation sites in the peptides. 
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FIGURE 9.12. MALDI/MS spectra (performed in linear mode) of a-casein tryptic digest in solution (a); 
Fe*-N'TA bound phosphopeptides from o-casein tryptic digest (b); 1-hour phosphatase treatment of Fe™- 
NTA-bound phosphopeptides from a-casein tryptic digest (c); and overnight phosphatase treatment of 
Fe*-NTA bound phosphopeptides from œ-casein tryptic digest (d). The ion labeled with an asterisk (*) is 
a background ion. 51 and S2 are -casein variants. T is an abbreviation for tryptic peptide. 
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FIGURE 9.13. MALDI-QTOF-MS/MS spectrum of m/z 1952, T4415, from the @-casein tryptic digest acquired directly from the 
Fe?-NTA resin. Structurally significant ions are indicated on the spectrum. Matrix was a saturated solution of 2,5-dihydroxyben- 
zoic acid in 70:30 acetonitrile/0.1% TFA. 


PROTOCOL 4 


Off-line Micro-IMAC Enrichment of Phosphoproteins 


Losowuzi METAL AFFINITY CHROMATOGRAPHY (IMAC) can be performed off-line or on-line 
with LLC-ESI-MS/MS. The on-line configuration has been used successfully despite subopti- 
mal low-level analysis due to the relative disparity between IMAC and LLC flow rates (see the 
earlier sections on On-line micro-IMAC and Off-line micro-IMAC). With IMAC performed 
off-line from u1C-ESI-MS, sample volumes from 1 to 100 ul can be loaded using a sample 
pressure vessel (Goodlett et al. 2000b) or an autosampling system equipped with a sample- 
trapping cartridge (Hayashi et al. 2001). This protocol describes the construction and use of a 
micro-IMAC column for the enrichment of phosphoproteins (additional details are available 
in Corthals et al. (1999] and Figure 9.4). It is similar to the procedure used to generate a micro- 
capillary HPLC column (Karlsson and Novotny 1988; Kennedy and Jorgenson 1989) (sec 
Chapter 5, Protocol 2 and Chapter 8, Protocol 4). Protocol 5 describes the next step: the analy- 
sis of the enriched sample of phosphoproteins by uLC-ESI-MS/MS. This protocol was pro- 
vided by Garry L. Corthals, Valerie C. Wasinger, and David R. Goodlett (Geneva Proteomics 
Centre, Geneva University Hospital, Geneva, Switzerland; University of New South Wales, 
Kensington, Australia; and Institute for Systems Biology, Seattle, Washington, respectively) 


MATERIALS 

CAUTION: See Appendix 3 for appropriate handling of materials marked with «2. 
p Reagents 

Ammonium acetate (0,196) «1» in 50 mM Na, HPO, (pH 8.0) <!> 

EDTA (0.1 M) 

Ferric chloride (FeCl) (0.1 M) <!> in 0.1 M acetic acid <!> 

See Step 10. 

» Equipment 


Fittings and unions (e.g., Microtight Adapters, Upchurch, Washington) 

Helium, pressurized gas <|> 

IMAC resin (e.g. Poros-MC, PerSeptive Biosystems) 

Polyimide-coated fused-silica capillaries (360 um O.D. x 50-100 um LD.) fe.g.. Polymicro 
Technologies, Tucson, Arizona) 

Pressure vessel (contact garry.corthals@dim-hcuge.ch) 


Teflon tubing (1/16 inch O.D. x 0.0001 inch LD.) 
See Step I. 


» Biological Sample 
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Peptides from phosphoprotein sample of interest l , - 
The phosphoprotein should be digested into peptides according to Chapter 7, Protocol 5. 
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METHOD 


Construction and Preparation of an IMAC Microcolumn 


In addition to the method described in this protocol, IMAC columns can also be prepared in 
constricted gel-loader pipette tips or commercially prepared tips can be purchased (e.g., MC- 
ZipTip, Millipore). For details on packing and using an IMAC "tip column,” see Protocol 1 
and Chapter 8, Protocol 2. 


l. 


11. 


To one end of a 10-cm-long piece of Teflon tubing (1/16 inch O.D. x 0.0001 inch L.D.), 
insert a piece of polyimide-coated fused-silica capillary and hold it in place with a union. 


. Before fixing the second of the two polyimide capillaries in place, place the open Teflon 


end in a slurry of IMAC resin inside a vessel pressurized by helium. 


. Pack the column to a length of ~5 cm at 500 psi pressure. 


. Insert the second piece of fused-silica capillary into the Teflon tubing, and fix it in place 


with a second union. 


. Place one of the two polyimide capillaries in the pressure vessel, The other capillary serves 


as an outlet. 


This configuration allows loading of the sample, washing away of contaminants, and elut- 
ing of the phosphopeptides from the IMAC column rapidly and at a relatively low pressure. 


. Place a microfuge tube containing H,O in the pressure vessel, and wash the column with 


H,O for 5 minutes at 500 psi. Monitor the solution that elutes from the column with pH 
paper. 
Monitor the pH during Steps 6- 11. 


. Replace the microfuge tube in the pressure vessel with one containing 0.1 M EDTA, and 


wash the column for ~2.5 minutes (until there is a change in the pH) at 500 psi. 


. Replace the microfuge tube with one containing H,O and wash the column for 5 minutes 


at 500 psi. 


. Replace the microfuge tube with one containing 0.1 M acetic acid and wash the column 


for 5 minutes at 500 psi. Wash until there is a change in the pH. 


. Replace the microfuge tube with one containing 0.1 M FeCl, in 0.1 M acetic acid and acti- 


vate the column for 5 minutes at 500 psi. 


Other metals can also be used, such as cupric sulfate, nickel chloride, or gallium nitrate. 


Replace the microfuge tube with one containing 0.1 M acetic acid and wash the column 
for 10 minutes at 500 psi to remove any unbound metal ions. 


Enrichment of Phosphopeptides 


12. 


Load 2] pmole of the protein digest («5 ul) onto the IMAC column at 200 psi. The pep- 
tide mixture should be loaded onto the column in either H,O or 0.05 M acetic acid. 


13. Wash the column with 0.1 m acetic acid for 2.5 minutes at 200 psi. 
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14. Wash the column with H,O for 2.5 minutes at 200 psi. 


15. Elute the phosphopeptides from the column with 0.1% ammonium acetate in 50 mM 
Na,HPO, (pH 8.0) at 200 psi. Collect the eluted peptides in microfuge tubes and use 
directly for MS/MS analysts in Protocol 5. 


REGENERATING THE IMAC MEDIA 

The IMAC column can be regenerated ag follows: 

Flush the column with H,O for 2.5 minutes at 500 psi. 

Flush the column with 0.1 M EDTA for 2.5 minutes at 500 psi. 

Flush the column with H,O for 2.5 minutes at 500 psi. 

Fiush the column with 0.1 M acetic acid for 2.5 minutes at 500 psi. 

The-column éan be stored in 0.1 M acetic acid and reused three times before it should be 
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Analysis of Phosphopeptides by uL C-ESI-MS/MS 


MATERIALS 


b Reagents 


» Equipment 


W nemer USING A PRESSURE CELL (Figure 9.4) or gel-loader tips, IMAC-purified peptides 
(from Protocol 4) can be injected into a ULC separation system and analyzed by ULC-ESI- 
MS/MS (Becker et al. 1998; Katze et al. 2000). Phosphopeptides behave and elute similarly to 
nonphosphorylated peptides during RP-HPLC, but because of a decrease in hydropbobicity 
due to the addition of the phosphate moiety, a phosphopeptide will generaily elute before the 
nonphosphopeptide with the identical amino acid sequence. This difference in elution time 
is most pronounced for short peptides. Thus, when analyzing phosphopeptides from LC 
on-line with ESI-MS/MS, it is wise to acquire data during the wash step (i.e., prior to gradi- 
ent elution), because some short phosphopeptides may not even bind to the C resin in 
10096 aqueous solutions. Otherwise, these very hydrophilic peptides will be missed (howev- 
er, see Figure 8.24 and accompanying text in Chapter 8). 

The following protocol, supplied by Garry L. Corthals, Valerie C. Wasinger, and David R. 
Goodiett (Geneva Proteomics Centre, Geneva University Hospital, Geneva, Switzerland; 
University of New South Wales, Kensington, Australia; and Institute for Systerns Biology, 
Seattle, Washington, respectively), provides typical LC conditions for the separation of 
IMAC-enriched phosphopeptides when the ULC system is on-line with the ESI-MS/MS. 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Solvent À 
0.4% acetic acid <!> 
0.005% aqueous heptafluoro-butyric acid (HFBA) <!> 
H,O 

Solvent B 
80% aqueous acetonitrile <!> 
Only acetonitrile is used for Solvent B, as the addition of acid here is unnecessary for good chro- 
matographic separation of peptides (acid is used with systems that monitor separation via UV 
detection to avoid a rise in the base line with increasing acetonitrile). The addition of acid adds sub- 
stantially to chemical noise in the mass spectrometer. 


HPLC system 


Mass spectrometer capable of tandem MS analysis of phosphopeptides 
Suitable mass spectrometers include a triple-quadrupole MS, an ion-trap MS, or a quadrupole- TOF- 
MS (e.g., ThermoFinnigan, Micromass, Applied Biosystems, or Bruker Daltonics) (see Chapter 8). 


Micro-ESI interface 
A micro-ESI interface can be homemade (Goodlett et al. 2000b) or purchased (New Objective, Inc., 
Woburn, Massachusetts). 
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Micro-LC column 


Choose a 50-um (1.D.) microcapillary column for optimum sensitivity or a L00-tm (1.0) micro- 
capillary column for optimum ruggedness, pressure packed, in either case, with, e.g, ODS-AQ 
(Waters Corp.). Columns can be purchased from various vendors (eg. LC Packings, ‘The Nether- 
lands or Michrom Bioresources or New Objective) or homemade (see Chapter 8, Protocols 4 and 5). 


P Biological Sample 


METHOD 


Peptides eluted from an IMAC column (Protocol 4, Step 15) 


7 ` ^ 4 
p an DÉEODEHGOD 
* à 


1. Set up, wash, and equilibrate the HLC column according to the manufacturers instruc- 
tions, 


The column flow rate depends on the column used, but in general, ~200 nl/min and 500 
nl/min across 50 and 100 um (L.D.), respectively, is achieved by restrictive flow splitting 
using a length (empirically determined) of capillary tubing set in a splitting tee before the 
capillary column. This setup allows normal-flow HPLC pumps, capable of operating 
between 0.5 and 4.0 ml/min, to be used for microcapillary column operation. 


2. Load the protein sample (from Protocol 4, Step 15} onto the LC column. 


3. Elute the peptides using a linear gradient from 0% to 60% Solvent B over 50 minutes. The 


eluting peptides from the column are introduced directly into the MS via a micro-ES! 
interface. 


4. Peptides for CID are subsequently selected for fragmentation by the MS using automat- 
ed data-dependent MS/MS procedures, optional in most MS control software packages. 


In láboratoriés where a tandem mass spectrometer is not available for sequence analysis of 

phosphopeptides by CID, phosphatase treatment (Zhang et al. 1998) of IMAC-enriched 

peptide samples has been used in conjunction with MALDI-TOF analysis to identify the 

phosphopeptides in the sample (see Protocol 2). The general steps are: 

« Record the collective masses.of ali peptides enriched by micro-IMAC in the single-stage 
MALDI-TOF-MS. ^ — 

 "freat another aliquot of the peptide sample with a broad-specificity phosphatase (e.g. 
calf intestinal phosphatase) in solution. Alternatively, treat the sample still on the 

> MALDI probe with the phosphatase. 
Pe nm ul » E 


: 


pation time (which is determined by the properties of the 


atase), re he paises of the treated protein sample, 


Mu 


* Comparé MALDI-TOF mass spectra for the original untreated and phosphatase- 
treated simples to identify any mass that shows a decrease of 80 amu (or multiples there- 


‘of), implying loss of phosphate due to phosphatase treatment. 
"This expétiment is techflically simple, and the interpretation of the spectra is straight- 


j the fact that MALDI-TOF-MS produces almost exclusively singly proto- 
nated peptide ions. 


PROTOCOL 6 


Tyrosine Phosphorylation Site Identification by MALDI-MS 


Tus DIFFERENT STRATEGIES UTILIZING MS FOR THE IDENTICATION of protein phosphorylation 
sites are employed in Protocols 6-8. In this first protocol, radiolabeled proteins are separated 
using SDS-PAGE and digested in-gel. The phosphopeptides are subjected to two-dimension- 
al LC and identified by MALDI-MS and MALDI-PSD. The identification of eight tyrosine 
phosphorylation sites in the human Gab-! protein is provided as an example of the method's 
utility. The expression and phosphorylation of human Gab-1 with the insulin receptor kinase 
in vitro as described by Lehr et al. (2000) are summarized as an example of this approach to 
identifying phosphorylated tyrosine residues. 

This protocol was provided by Albert Sickmann and Helmut E. Meyer (Medical Proteom 
Center, University of Bochum, Germany). 


MATERIALS 


CAUTION: Sec Appendix 3 for appropriate handling of materials marked with <!>, 


» Reagents 
Acetonitrile <!> 
Digestion buffer A (10 mm NH,HCO, at pH 7.8) <!> 
Digestion buffer B (10 mm NH,HCO,, 50% acetonitrile at pH 7.8) 
lon exchange buffer A (20 mm H,CCOONH, at pH 7.0) <!> 
Ion exchange buffer B (0.5 M KH,PO, at pH 4.0) <!> 


Matrix solution 
The matrix solution is a saturated solution of a@-cyano-4-hydroxy-cinnamic acid <!> in 0.1% TEA 
<>, 50% acetonitrile (v/v). 


Protease solution (0.05 Ug/ul trypsin in Digestion buffer A} 
Use sequencing grade, modified trypsin (e.g., from Promega). 


URP-HPLC buffer A (0.1% TFA) 
LLRP-HPLC buffer B (0.08% TFA, 84% acetonitrile) 


b Equipment 
Bath sonicator, cooled to 4°C 
Centrifugal evaporator (e.g., Speed Vac, Thermo Savant, or equivalent) 
HPLC column (Nucleogel SAX 1000-8/46, 4.6 x 50 mm) (Macherey & Nagel, Germany) 
HPLC system (e.g., Beckman Gold system) 
MALDI-MS (e.g., Ultraflex, Bruker-Daltonic, Bremen, Germany? 
MALDI target, stainless steel 


Micro-HPLC pump (e.g., ABI 145D or Dionex Ultimate) 
The micropump must be capable of generating a reproducible gradient at 220 pl/min. 


Test tubes (borosilicate, Na*- and K*-free) 
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UV detector (e.g., Shimadzu SPD 10 A, Tokyo, Japan) 
Zero death volume T-split 


» Biological Sample 


Radiolabeled phosphorylated (as described by Lehr et al. 2000) protein(s) «1» of interest with- 
in a polyacrylamide gel 
Separate the phosphorylated protein by 10% SDS-PAGE (see Chapter 2, Protocol 1). 


METHOD 


Digestion of the SDS-PAGE Separated Gab-! 


1. Dissect gel pieces from the polyacrylamide gel that contain phosphorylated protein bands 
as revealed in the autoradiography. 


2. Dice the gel pieces into 1-mm? pieces, and place them test tubes. 


Use specific Na*- and K*-free test tubes for digestion, otherwise Na* and K' adducts will be 
found during mass spectrometry. 


. Wash the gel pieces in Digestion buffer A for 10 minutes at 37°C. 
. Wash the gel pieces in Digestion buffer B for 10 minutes at 379C. 


. Repeat Steps 3 and 4 twice more. 


oO m be Ww 


. Shrink the gel pieces at room temperature by addition of acetonitrile until they are white 
(-3 minutes). 


7. Swell the gel pieces with protease solution for 20 minutes at 379C. 


The temperature depends on the protease (e.g., Glu-C requires 25°C and trypsin requires 
37°C). 


8. Cover the gel pieces with additional Digestion buffer A to prevent drying, and incubate 
the gel pieces overnight at 37°C. 


Extroction of the Gel Pieces for Anion-exchange Chromatography 


9. Add 20 pl of Digestion buffer A to the gel piece and incubate in a sonication bath for 20 
minutes at 4°C. 


10. Collect the supernatant. 


11. Repeat Steps | and 2 twice more. 


Anion Exchange Chromatography 


12. Adjust the flow rate through the anion exchange column to 0.5 ml/min. 


13. Inject the combined supernatants onto the column. 
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FIGURE 9.14. Anion exchange chromatography of phosphorylated human Gab-1 after tryptic digestion. 
Fractions of 0.5 ml are collected and the radioactivity is measured and plotted against the retention time. 
Fractions a—j are pooled for further chromatography. 


14. Wash the column with Ion exchange buffer A for 20 minutes. 


15. Elute peptides from the column by beginning a gradient of Ion exchange buffer B from 
096 to 1096 over the course of 40 minutes. Collect a 0.5-ml fraction every minute during 
the elution. 


16, Increase the Ion exchange buffer B from 10% to 50% over the course of 75 minutes. 


17. Measure the radioactivity in every fraction either using Cerenkov counting or by mea- 
suring the radioactivity of fraction aliquots amplified with scintillant. 


18. Plot the radioactivity of each fraction versus the fraction number (see Figure 9.14 for an 
example). 


RP-HPLC Chromatography of Radioactive Fractions 


19. Select a radioactive fraction from the anion exchange chromatography eluent. 

20. Reduce the fraction volume from 500 pi to 80 pl in a centrifugal evaporator. 

21. Adjust the L-HPLC pump flow rate to 70 l/min. 

22. Adjust the -HPLC column flow rate to 16 [l/min using a zero death volume T-split. 
23. Inject the sample into the 80-41 sample loop. 

24. Wash the column with 95% RP-HPLC buffer A for 30 minutes. 


25. Adjust the UV detector to zero. Begin UV detection of the RP-HPLC eluent at 215 nm 
and 295 nm with a photometer sampling rate of 2 Hz. 
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26. 
27. 
2B. 


29. 


Begin a gradient of RP-HPLC buffer B from 596 to 50% over the course of 90 minutes. 
Collect 8-16-ul fractions from the column eluate. 


‘Transfer 0.5 ul of each fraction to a vial for measurement of radioactivity. 


Fraction volume will depend on the size of the peak. 


Freeze the fractions immediately at -70°C. A typical RP-HPLC chromatogram is shown 
in Figure 9.15. 


MALDI-MS Analysis of Fractions Containing Radioactivity 


30. 


31. 


32. 
33. 


34. 


Select a radioactive fraction from the RP-HPLC, and apply 0.3 ul of the fraction to a 
stainless steel MALDI target. 


Apply 0.3 ul of matrix solution to the droplet. Allow the sample/matrix to air dry {~t 
minute) 


Record a MALDI-MS spectrum in linear and reflector mode (see Figure 9.16). 


Compare the in silico peptide masses obtained from a MALDI-MS spectrum with exper- 
imentally determined masses obtained from a theoretical digest (e.g., GPMAW, Protein- 
Prospector) of the phosphorylated protein. Take into consideration: 


s methionine oxidation 

« amino-terminal acetylation 

phosphorylation of S, T, and Y (in this example, Y phosphorylation) 
generation of pyroglutamic acid at XXRQXX and XXKQXX 
incornplete digest 


Select probable phosphopeptides to be analyzed by MALDI-PSD. Four different MALDI- 
PSD spectra of phosphotyrosine peptides are shown in Figures 9.17 and 9.18. The com- 
mon fragmentation pattern of peptides is shown in Figure 9.19 (Biemann 1992). 


PROTEOMIC METHODS FOR PHOSPHORYLATION SITE MAPPING m 647 


20 40 60 Time (min) 


0 
20 46 60 Time (min) 20 40 80 Time (min) 


0 0 | 
20 AD 60 Time (min) 20 40 60 Time (min) 


20 40 60 Time (min) 


0 0 8) 
20 40 60 Time (min) 30 50 Time (min) 


FIGURE 9.15. Reversed-phase chromatography of radioactive fractions derived from anion exchange chro- 
matography (see Figure 9.14). The UV trace at 215 nm (peptide backbone) is plotted against the retention 
time. Fractions containing radioactivity are marked with an asterisk and subjected to mass spectrometry. 
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FIGURE 9.16. MALDi-MS spectra of four different phosphopeptide-containing fractions (see Figure 9.15). In fractions a66 and 
b26, additional measurements in the linear mode are necessary because no potential phosphopeptides are detected in the reflector 
mode. Especially the large peptides of >4000 daltons can only be detected in the linear mode due to several proline residues in the 
primary structure o£ the peptides. Fragment ion spectra of additional peptides are shown in Figures 9.17 and 9.18. 


TABLE 9.3. Sequence analysis results of hGab-1 phosphopeptides 


Amino acid sequence of Amino acids Amino 
identified phosphopeptides [M-Hl|"* q— inhGab-1 acids Fraction* "p (96) 
HGMNGFFQQOMIYpDSPPSR 2320.0 230-248 Y 242 b26, di& 1.8 
VSPSSTEADGELYpVENTPSGTSSVETOMR 3156,4 273-30) ¥ 285 {29 1.7 
TASDTDSSYpCIPTAGMSPSR 2197.8 365-384 Y 373 b6 2.7 
NVITVGSVSSEELDENYpPMNPNSPPR 3024.4 431-457 Y 447 hi2, g19, c59 27.6 
QHSSSETEPIQEANYpVPMTPGTEDFSSFGM 4367.0 458-498 Y 472 a63, a66-67, 30.7 
QVPPPAHMGFR 135-38 
FPMSPRPDSVHSTTSSSDSHDSEENYpVPM 6186.6 594-648 Y 619 b26, e25-26, 17.8 
NPNLSSEDPNLFGSNSLDGGSSPMIKR h24-26, 124 
QVEYpLDLDLDSGK 1574.7 554-666 Y 657 j 7-18 6.1 
$8GSGSSVADERVDYpVVVDQOK 2392.1 675-696 Y 689 dls 11.6 


3Monoisctopic mass of the single charged ion. 
"Traction after anion exchange chromatography {letters RP-HPLC (numbers; (see Figures 9.15 and 3.16) 
“Total amount of radioactivity incorporated into the phosphopeptides corresponds to 100%. 
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FIGURE 9.17. Three different MALDI-PSD spectra of phosphotyrosine-containing peptides. In each of these spectra, the p' 
phorylated tyrosine residue is unambiguously identified by b and/or y ions. In the case of the large peptide in the middle c 
figure, peptide signals can be detected only in the linear mode. The ion selector was set on the mass 4367.0 (see Table 9.3j 
MALDI-PSD spectrum is recorded in the reflector mode. This shows that PSD spectra can be recorded even when na s! 
achieved in the reflector mode. 
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FIGURE 9.18. MALDI-PSD spectrum of a very large phosphopeptide. All major signals in this spectrum 
are derived from internal cleavages and only some a and b ions are generated. Together with Edman degra- 


dation (Edman 1949), the phosphorylation can be localized at the tyrosine residue (see Table 9.3). 
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PROTOCOL 7 


Serine/Threonine Phosphorylation Site 
Identification by ESI-MS 


Mocs OF THE PROTEIN PHOSPHORYLATION WITHIN CELLS is mediated by the activity of inde- 
pendent kinases. However, autophosphorylation of proteins is another way signat transduc- 
tion pathways are regulated. This protocol is designed to distinguish between autophospho- 
rylation and endogenous phosphorylation. For the localization of all endogenous phospho- 
rylation sites, the protein is digested with trypsin and the peptides are analyzed by on-line 
coupling of nano-HPLC and ESI-MS. To determine the autophosphorylation sites, the pro- 
tein is incubated with radioactive ATP prior to digestion with trypsin. Only 20 pmoles of pro- 
tein is used for each experiment. The stoichiometry of phosphorylated protein is below 5% 
of the total protein. Prior to analysis, the protein is separated by 10% SDS-PAGE and the pro- 
tein band is excised. Before digestion, the protein is reduced and alkylated in both cases. This 
protocol was provided by Albert Sickmann and Helmut E. Meyer (Medical Proteom Center, 
University of Bochum, Germany). 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


» Reagents 
Acetonitrile <!> 
Alkylation solution 
40 mM iodoacetamide <!> 
0.1 M Tris-Cl (pH 8.2) 
Digestion buffer A (10 mw NH,EHCO, at pH 7.8) <!> 
Digestion buffer B (10 mM NH,HCO,, 50% acetonitrile at pH 7.8) 
Formic acid (596) <!> 
B-Mercaptoethanol <!> 
Nano-RP-HPLC buffer A (0.1 96 formic acid) 
Nano-RP-HPLC buffer B (0.1% formic acid, 84% acetonitrile) 


Protease solution (0.05 ug/pl trypsin in Digestion buffer A) 
Use sequencing grade, modified trypsin (e.g., from Promega). 


Reduction solution 
80 mM dithiothreitol (DTT) <!> 
0.5 mM guanidinium-HC] <!> 
0.8 mM EDTA 
0.1 M Tris-Cl (pH 8.2) 
Trifluoroacetic acid (TFA) (0.196) <!> 


652 a CHAPTER 9 


p Equipment 


Bath sonicator, cooled to 49C 

HPLC pump (e.g., ABI 140D, Perkin Elmer) 

HPLC system (e.g., Beckman Gold system) 

Mass spectrometer, ESI-MS with a nanospray ion source (e.g., LCQ, ThermoFinnigan MAT) 
Nano-HPLC column (NAN 75-25-05-PM 75 am x 250 mm) (LC-Packings, Amsterdam, The 


Netherlands) 


Precolumn (0.5 x 5 mm) 


‘The precolumn and separation column are packed with the same reversed-phase resin (see Tables 
5.1 and 5.3). 


Precolumn splitter 
SEQUEST software 
Test tubes (borosilicate, Na+- and K*-free) 


b Biological Molecules 
Radiolabeled phosphorylated protein(s) «1» of interest contained within polyacrylamide gel 


METHOD 


— 


piece(s) 


Radiolabel the proteins as described by Suer et al. (2001). Separate the phosphorylated proteins by 
10% SDS-PAGE (see Chapter 2, Protocol 1). Excise the gel piece containing the protein of interest 
as revealed by autoradiography. 


Reduction, Alkylation, and Digestion of the Phosphoprotein 


. Wash the gel piece in Digestion buffer A in Na"/K*-free test tubes for 10 minutes at 37°C. 
. Cover the gel piece with reduction solution for 15 minutes at 37°C. 


. Remove the reduction solution, and cover the gel piece with alkylation solution for 15 


minutes at 37°C in the dark. 


Avoid Jonger incubation times, because methionine is alkylated by iodoacetamide (Sick- 
mann et al. 2000). 


. Add 5 ul of B-mercaptoethanol to the alkylation solution covering the gel piece. Incubate 


for 5 minutes more at 37°C. 


. Remove the liquid from the container, and wash the gel piece in Digestion buffer A for 10 


minutes at 37°C, 


_ Discard Digestion buffer A, and wash the gel piece in Digestion buffer B for 10 minutes 


at 37°C. 


. Repeat Steps 5 and 6 twice more. 

_ Shrink the gel piece by addition of acetonitrile until it is white (~3 minutes). 
_ Re-swell the gel piece with protease solution for 20 minutes at 37°C, 

. Cover the gel piece with additional Digestion buffer A to prevent drying. 


. Incubate the digest overnight at 37°C. 


PROTEOMIC METHODS FOR PHOSPHORYLATION SITE MAPPING # 653 


Sample Concentration Sample Separation 
Loading pump Loading pump Waste 
(30 ukur) Waste (30 ulmin 


Mano Pamp 
(150 ni/min) 


Column 


FIGURE 9.20. On-line sample concentration prior to nano-LC-MS. (Left) Sample concentration. The sample is injected into the 
sample loop of valve A and concentrated on the precalumm. Valve B is set to the 2-3 position (waste). Concentration of the sam- 
ple is performed at a flow rate of 60 pil/min for 10 minutes. The volume of all capillaries and the precolumn is «2 pl (sample loop: 
additional 30 ul). (Right) Separation of the sample. Valve A is in position 2-3 (no loop) and valve B in position 1-4 (nano-column 
in the flow). The flow rate is reduced from 60 [l/min to 100—200 nl/min. 


Extraction of the Gel Piece for Nano-HPLC Separation 
12. Discard the digest buffer surrounding the gel piece. Cover the gel piece with 15 ul of 5% 
formic acid, and incubate it for 20 minutes in a cooled sonication bath. 
13. Collect the liquid from around the gel. This contains the digested peptides. 
14. Repeat Steps 12 and 13. 


Precolumn Concentration of Peptides during Nano-HPLC 


Commonly, nano-HPLC is done with flow rates «0.3 ul/min. During the peptide extraction, 
a volume of ~30 ul is achieved, which is much too large for nano-HPLC. Therefore, a special 
peptide concentration step as shown in Figure 9.20 is necessary. The major advantage of this 
method is the on-line concentration of the sample resulting in a low injection volume. 


15. Inject the sample into the sample loop. Concentrate the sample onto the UC, precolumn 
using 0.1% TFA with a flow rate of 60 «l/min for 10 minutes. 


16, After concentration, move valve B into the 1-4 position into the pump flow. 
17. Elute the peptides as follows: 
a. Adjust the pump flow to 70 ul/min (if using the ABI 140D system). 
b. Adjust the column flow to 100-200 nl/min with a precolumn splitter. 
c. Equilibrate the 75-um I.D. column with Nano- RP-HPLC buffer A for 10 minutes. 


d. Elute the peptides with a gradient of Nano-RP-HPLC buffer B from 5% to 50% over 
90 minutes. 


LC-MS/MS Analysis 
18. Load the peptides onto an ESI mass spectrometer using à nanospray ion source and à 
micro-ESI interface. 


19. Record the MS spectra with a triple-scan event (full scan, first dependent MS/MS scan, 
second dependent MS/MS scan) with a scan duration of 2 seconds for each event. 
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Automatic Dato Interpretation with the SEQUEST Algorithm 
20. Automatically process the LC-MS/MS raw data file using the SEQUEST algorithm. 


Using this protocol to study the phosphorylation of human PI4K92 protein, the first step 
generated nearly 800 DTA files from the raw data file (settings: grouped scans 2; minimal 
ion intensity: 1 E-5; mass range: 300-3500 daltons; charge: single-, double-, or triple- 
charged ion). The search is done against the actual NCBI database (settings: enzyme: 
trypsin KR/P; missed cleavage sites: 1; modification: C457 daltons; differential modifica- 
tion: M+i6; STY+80 daltons). 

In Figure 9.21, the total ion current (TIC, i.e., the intensity of all ions in each scan plot- 
ted against the retention time) and the BASE PEAK chromatogram (BASE, i.e., the inten- 
sity of the base peak in each scan plotted against the retention time) are shown. The base 
peak chromatogram is comparable to the UV trace of a photometer. Some automatically 


interpreted MS/MS spectra of the phosphopeptides RRLSpEQLAHTPTAFK and 
TASpNPKVENEDEPVR are shown in Figure 9.22. 
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FIGURE 9.21. Nano-LC-MS/MS analysis of PIAK92 phosphopeptides. (Top Total ion chromatogram 
(TIC). The intensity of all ions in every mass spectrum is plotted against the retention time. (Bottom) Base 
peak chromatogram. The intensity of the base peak in every mass spectrum is plotted against the retention 
time. This chromatogram is comparable to a UV trace of a HPLC separation, and signals of several peptides 
with different retention times can be observed. 
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FIGURE 9.22. ESI-MS/MS spectra of two different phosphopeptides derived from human PI4K92 (see 
Figure 9.21). Surprisingly, no remarkable loss of 80 or 98 daltons is observed in the spectrum of the peptide 
TASpNPKVENEDEPVR. The phosphorylation site can be localized to the serine residue. The common 
behavior of phosphopeptides is demonstrated by the peptide RRLSPEQLAHTPTAFK. Several cleavage 
products are indicated by asterisks (*). 
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ADDITIONAL PROTOCOL: IDENTIFICATION OF AUTOPHOSPHORYLATION SITES USING RADIOACTIVE PHOSPHATE 


To localize an autophosphorylation site (in this case, of the human PI4K92 protein), incubate the 
protein with radioactive ATP <!>. 


Additional Materials 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


H-HPLC system (e.g., ABI 140D pump and Dionex Ultimate HPLC system) 
MALDI mass spectrometer 

LRP-HPLC buffer A (0.1% TFA) <!> 

RP-HPLC buffer B (0.08% TFA, 84% acetonitrile) <!> 

RP-HPLC column (0.18 x 150 mm; C,,; 3- or 5-41m particle size) 

Step 1 requires the reagents and equipment listed in Chapter 2, Protocol 1. 


‘Method 
l. Separate the reaction mixture by 10% SDS-PAGE. 
2. Reduce, alkylate, and then digest the protein within the gel piece as described in the main pro- 
tocol. 


3. Extract the peptides twice with 10 yul of 1% TFA solution in a sonication bath for 20 minutes 
at 4°C, 


4. Inject the digested peptides into the sample loop of a u-HPLC system. 
5. Separate the peptides as follows: 
a, Adjust the -HPLC pump flow to 70 l/min. 
b. Adjust the -HPLC column flow to 1.5 l/min using a zero death volume T-split. 
c. Inject the sample into the 20-4 sample loop. 
d. Wash the column with 95% uRP-HPLC buffer A for 30 minutes. 


a photometer sampling rate of 2 Hz. 


f. Elute the peptides with a gradient of RP-HPLC buffer B from 5% to 5096 over 90 min- 
utes. 


6. Collect 1—1.5-yl fractions of the column eluate, and immediately freeze them at -70°C, 
7. Assay for radioactivity in each of the fractions, using the entire sample. 


Prompt freezing of the samples reduces peptide loss by adsorption to the surface of the 
collection tubes. 


8. Analyze the radioactive fractions by MALDI-MS as described in Protocol 6, Steps 31-34. 


Figure 9.23 shows the linear and reflector mode spectra together with the recorded PSD ical fo 
serine and threonine phosphate is the side-chain cleavage of the phosphate group during pee 2. : 
A summary of all identified phosphorylation sites in the human PI4K92 protein is given in Table 9.4. 
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TABLE 9.4. Sequence analysis results of PI4K92 phosphopeptides 


Amino acid sequence of the Kind of 

identified phosphopeptides [M«H]* Amino acid phosphorylation 
ELPSLSPAPDTGLSPSKR 1931.9 T 263 autophosphorylation 
ELPSLSPAPDTGLSPSKR 1931.9 S 266 endogenous 
ELPSLSPAPDTGLSPSKR 1931,9 5 258 endogenous 
SKSDATASISI.SSNLKR 1844.9 5 227 endogenous 
TASNPKVENEDEPVR 1764.7 5 294 endogenous 
SVENLPECGITHEQR 1848.8 T 423 autophosphorviation 
RRLSEQLHTPTAFK 1763.8 5 496 endogenous 
LSEQLHTPTAFK 1451.6 T 504 endogenous 
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FIGURE 9.23. MALDI-MS analysis of the PI4K92 autophosphorylation sites. After radioactive labeling, digestion, and (LC sepa- 
ration af the peptides, these spectra are recorded using radioactive HPLC fractions. (Top) MALDI-MS spectra in reflector and lin- 
ear mode. Phosphoserine and phosphothreonine undergo specific cleavage in the side chain of the phosphorylated amino acid, 
which can only be detected in the reflector mode. The linear spectrum does not show these products because they are generated 
after the ion acceleration. Therefore, the intact peptide and all products reach the ion detector at the same time. (Bottom) MALDI- 
PSD spectra of the ions with 1848.8 daltons and 1931.9 daltons, The specific cleavage products are marked with an asterisk (*). 


PROTOCOL 8 


Histidine Phosphorylation Site Identification by ESI-MS 


Has: PHOSPHATE I$ MODERATEIY STABLE under alkaline conditions; however, the half-life 
of this phospho-amino acid under acidic conditions (pH <5) is less than 20 minutes. Thus, 
it is better to process proteins containing phosphohistidine rapidly and under alkaline con- 
ditions. The use of acidic matrices, such as o-cyano-4-hydroxy-cinnamic acid, for MALDI- 
MS prevents the detection of phosphohistidine. An alkaline MALDI matrix is a poor alter- 
native because it often decreases the sensitivity of detection of phosphorylated amino acids. 
Surprisingly, phosphohistidine is stable under acidic conditions when stored on C,, column 
resins. Therefore, HPLC separation on C,, resin with an acidic solvent system combined with 
ESI-MS is possible. This protocol was provided by Albert Sickmann and Helmut E. Meyer 
(Institut für Physiologische Chemie der Ruhr Universitát Bochum, Germany]. 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with «I». 
P Reagents 
Nano-RP-HPLC buffer A (0.1% formic acid) «1» 
Nano-RP-HPLC buffer B (0.1% formic acid, 84% acetonitrile) <!> 
NH,HCO, (50 mM, pH 7.8) <!> with 5% trypsin 
Trifluoroacetic acid (TFA) (0.196) «1» 
b Equipment 


LC, precolumn 

HPLC system (e.g., ABI 140D, Dionex Ultimate) 

Mass spectrometer, ESI-MS with a nanospray ion source (eg., LCQ, ThermoFinnigan MAT) 

Nano-HPLC column (NAN 75-25-05-PM 75 um x 250 mm) (LC-Packings, Amsterdam, The 
Netherlands) 

Precolumn splitter 

SEQUEST software 

UV detector (e.g., Shimadzu SPD 10 A, Tokyo, Japan) 


P Biological Sample 
Phosphoprotein mixture of interest 


g| 


METHOD 
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Digestion and Concentration of Peptides 


1. 


Digest the protein mixture in 50 mM NH,HCO, (pH 7.8) with 5% trypsin for 4 hours at 
37°C, 


. Inject the digested peptides into the sample loop (see Figure 9.20). 


. Concentrate the sample onto the uC, precolumn using 0.1% TFA with a flow rate of 60 


ul/min for 10 minutes. 


. After concentration, move valve B into the 1-4 position into the pump flow. 


. Elute the peptides as follows: 


a. Adjust the pump flow to 70 l/min (if using the ABI 140D system). 
b. Adjust the column flow to 100—200 ni/min with a precolumn splitter. 
c. Equilibrate the 75-j1m ID. column for 10 minutes with Nano-HPLC buffer A, 


d. Elute the peptides with a gradient of Nano-RP-HPLC buffer B from 596 to 5096 over 
90 minutes. 


. Load the peptides onto an ESI mass spectrometer using a nanospray ion source. 


. Record the MS spectra with a triple scan event (full scan, first dependent MS/MS scan, 


second dependent MS/MS scan) with a scan duration of 2 seconds for each event. 


Automatic Data Interpretation with the SEQUEST Aigorithm 


8. Automatically process the LC-MS/MS raw data file using the SEQUEST algorithm. 


The search parameters are explained in the SEQUEST users guide. 
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Figure 9.24 shows the mass spectra of HPr before digestion. The raw ESI mass spectrum in 
panel a and the deconvoluted ESE mass spectrum of HPr in panel b show a mass shift of 80 
daltons, which indicates monophosphoryated HPr. Finally, Figure 9.25 shows the MS/MS 
spectrum of the phosphohistidine peptide MEQNSYVHDETGIHpAR. The database search 
is done against the NCBI database and leads to this histidine phosphorylated peptide. 
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FIGURE 9.24. Nanospray-ESI spectrum of HPr before digestion. Approximately 5 pmoles of freshly phosphorylated HPr is used 
for low-flow nano-ESI-MS. The charge states 4 to 11 can be observed in the spectra. After deconvolution of the spectrum in a, the 
mass (9494 daltons) for HPr and the mass (9574 daltons) for phosphorylated HPr were calculated. Approximately 2596 af the HPr 
is not phosphorylated. 
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FIGURE 9.25. ESI-MS/MS spectrum of the phosphopeptide MEQNSYVIIDETGIHpAR. Localization of the 
phosphorylated histidine residue is easily done, The spectrum is recorded ~5 hours later than the spectrum 
in Figure 9.24, panel a. Due to the unstable phosphohistidine residue, ~5% of the HPr is phosphorylated at 
this point in timc. 
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One OF THE REVELATIONS OF THE HUMAN GENOME PROJECT, thus far, is that the genome har- 
bors only ~30,000 genes (Lander et al. 2001; Venter et al. 2001), However, it should be noted 
that researchers disagree on the precise number of protein-encoding genes (Hogenesch et al. 
2001), with estimates being as high as 60,000 genes. Irregardless of the final number of genes 
found in the human genome, the number of proteins encoded is likely to be substantially 
larger than its set of genes due to posttranslational modifications (see Chapter 9) and alter- 
nate splicing, which is reported to be highly frequent in humans (Mironov et al. 1999; Black 
2000). It is believed that alternative splicing, which occurs in nearly all metazoans and is espe- 
cially common in vertebrates, provides an economical means for producing functionally 
diverse polypeptides from a single gene (Lopez 1998; Black 2000; Smith and Valcarcel 2000). 
By aligning expressed sequence tag (EST) sequences and mapping the resultant mRNA fam- 
ilies, it has been estimated that ~35% of human genes yield variably spliced protein products 
(Croft et al. 2000). Because the published ESTs derive from a limited number of tissues, this 
value may be an underestimate, and the 30,000 genes could easily produce several hundred 
thousand different proteins. 


ALTERNATIVE MRNA SPLICING HAS SPECIFIC EFFECTS ON PROTEIN PRODUCT ACTIVITY 


In the human genome, there are many examples of multiple mRNA transcripts being pro- 
duced from a single gene (e.g., the n-cadherins, calcium-activated potassium channels, and 
neurexins). Altered mRNA splicing has been shown to have very specific effects on the activ- 
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ity of a protein product. For example, alternative splicing has been shown to influence the lig- 
and binding of growth factors and cell adhesion molecules, determine the subcellular local- 
ization of the encoded protein, determine the phosphorylation status of the encoded protein, 
and affect the activation domains of transcription factors (Lopez 1998; Black 2000; Smith and 
Valcarcel 2000). 

At present, the rules governing splice site choice are poorly understood, which makes it 
difficult to recognize exons and predict splicing patterns within the genome sequence with 
any degree of confidence. For example, functional splice sites often do not fully match the 
recognized consensus sequences, Conversely, there are many splicing site sequence motifs in 
the genome that are not recognized by the splicing apparatus. 


PHYSIOLOGICAL COMPLEXITY IS REPRESENTED BY THE INTERACTOME 


For the reasons discussed above, it 1s not surprising that the physiological complexity of 
organisms is not merely a consequence of gene numbers (e.g., humans have less than twice as 
many genes as the 959-cell nematode Caenorhabditis elegans). Rather, evolution of the 
increased physiological complexity of higher-order organisms is undoubtedly due to à num- 
ber of other mechanisms that include alternative splicing, diversification of gene regulatory 
networks, and the ability of intracellular signaling pathways to interact with one another. 
Indeed, it is becoming increasingly apparent that biological signaling pathways can interact 
with one another to form complex networks comprising a large number of components. Such 
complexity arises from the overlapping functions of components, from the connections 
among components, and from the spatial relationship between components (Weng et al. 
1999). Additionally, many cellular processes are performed and regulated not by individual 
proteins, but by proteins acting in large protein assemblies or macromolecular complexes. 
For instance, the eukaryotic ribosome, which translates RNA into protein, consists of ~80 
unique proteins (Wool et al. 1995), and the RNA polymerase II transcription complexes in 
eukaryotic cells, which are involved in DNA replication, comprise at least 50 different pro- 
teins (Pugh 1996). 

Only a global view of a cell’s mRNA and protein complement, and a detailed knowledge 
of how these complements change with development and the environment (especially in dis- 
ease), will allow a full understanding of how a complex organism works. This is the real impe- 
tus of proteomics, especially cell- mapping proteomics, which aims to describe all protein- pro- 
tein interactions (both spatially and temporally) within a given cell. The full complement of 
protein-protein interactions of a cell's proteome is now referred to as the “interactome.” 
Uncovering protein-protein interaction schemes will have a pivotal role in the post-genome 
era by shedding light on the molecular mechanisms underlying biological processes (see Weng 
et al. 1999; Aravind 2000; Chen and Han 2000; Mayer and Hieter 2000; Gerstein et al, 2002). 

A number of diverse combinatorial biology methods have proved to be invaluable both 
in identifying natural protein-protein interactions and also in mapping the specificities and 
energetics of these interactions (for a review, sce Pelletier and Sidhu 2001). In these methods, 
DNA-encoded libraries of peptides and proteins have been constructed that include a cova- 
lent linkage between each peptide/protein and the encoding DNA. Members of the libraries 
can be selected by virtue of a specific binding interaction, and their protein sequence can be 
deduced from the sequence of the cognate DNA. These combinatorial biology methods can 
be categorized into two groups depending on the method for generating DNA-encoded 
polypeptide libraries. In vitro methods include phage display (for review, see Rodi and 
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Makowski 1999; Sidhu et al. 2000) and ribosome display technologies (Hanes and Pluckthun 
1997). (The principal application of phage display and ribosome display, thus far, has been 
for the selection and evolution of antibodies.) In contrast, in vivo combinatorial biology 
methods use polypeptide libraries that are expressed inside living cells along with a target 
protein of interest. The most widely used in vivo method for the detection of protein-protein 
interactions is the yeast two-hybrid system (Fields and Song 1989), which is discussed in the 
next section. 

In this chapter, various proteomic approaches for identifying members of protein com- 
plexes are discussed. Historically, the biochemical purification of protein complexes in suffi- 
cient amounts and purity to enable their characterization has been difficult. Invariably, a dif- 
ferent strategy had to be formulated for each complex under consideration, and mild separa- 
tion techniques that do not destroy protein-protein interactions had to be developed. The 
following are some of the physical techniques used to detect and purify interacting proteins: 


+ Blue native polyacrylamide gel electrophoresis (Protocol 4). 
» Native agarose gels (Protocol 3). 

* Immunoprecipitation. 

e Affinity capture (Protocols ! and 2). 


Protein interactions identified using some of these physical techniques are fraught with 
ambiguities and may not necessarily be a true reflection of biological complexes that occur 
within the cell. For instance, if the target “bait” used in affinity capture approaches exists in 
more than one posttranslationally modified form (e.g. phosphorylation state), and each of 
these forms is involved in a multiprotein complex, then the complex captured by the affinity 
tag (which is common to all posttranslationally modified forms of the target protein) will rep- 
resent a composite of a number of complexes. Such posttranslationally modified forms of the 
target bait may be located in different parts of the cell (in different multiprotein compiexes) 
and may never be in contact in a physiological sense. In addition, to add to the complexity of 
intracellular networking, such multiprotein complexes will vary temporally (e.g. different 
stages of the cell cycle). It should also be borne in mind that biological macromolecules have 
evolved to function in crowded environments such as the inside of cells (where the total con- 
centration of protein and RNA is in the range of 300—400 g/liter [Zimmerman and Minton 
1991]) and that protein-protein interactions are protein-concentration-dependent (for 
reviews on “molecular crowding,” see Ellis 2001; Verkman 2002). Hence, if the cellular con- 
centration of expressed recombinant target protein (bait) varies markedly from one experi- 
ment to another, the nature of the corresponding multiprotein complexes derived from these 
experiments may bear no resemblance to each other. For example, the concentration of 
recombinant bait protein is more likely to resemble the intracellular concentration of the 
mature protein if the expression is driven off the natural promoter compared to a situation 
where it is overexpressed constitutively. The latter situation may result in the formation of 
multiprotein complexes that differ markedly in their protein components. For these reasons, 
it is important to use more than one physical method to verify any putative protein-protein 
interaction and, importantly, to verify the biological nature of such interactions using in vivo 
methods such as fluorescence resonance energy transfer (FRET) (see Protocol 8). 

In addition to identifying the components of multiprotein complexes, it is important to 
understand the spatial organization, or topography, of such complexes. Such information is 
critical to understanding the function of the complex at the molecular level. This informa- 
tion is also important in the pursuit of small molecules that disrupt protein-protein interac- 
tions (e.g, in the rational drug design of a small-molecule antagonist of protein-protein 
complex formation). In the absence of three-dimensional structure information, chemical 
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cross-linking has been used to reveal closest-ncighbor relationships between components 
within a protein complex. In this chapter, a number of basic protocols for performing chem- 
ical cross-linking or label transfer studies are discussed (see Protocols 5—7). 

One of the ultimate goals of studying protein-protein interactions is to understand the 
consequences of the interactions for cell function. This depends in turn on understanding the 
strength of the interaction in the cell (i.e. the binding constant, Ka), the concentration of the 
components involved in the interaction, the effect of cellular compartmentation, thc influ- 
ence of competing proteins, and the flux of the individual reactions. (The global nature of 
such studies, and the ability to predict the effect of external stimuli on cellular pathways, is 
referred to as total "systems biology" [Ideker et al. 2001; Davidson et al. 2002; Kitano 2002; 
Noble 2002].) For a discussion of methods for estimating and determining binding con- 
stants, see Phizicky and Fields (1995), Schuck (1997a,b), Lakey and Raggett (1998), Medaglia 
and Fisher (2001, 2002), and Cantor and Schimmel (1980). In this chapter, the fluorescence 
gel-retardation assay for determining K, values is discussed (see Protocol 9). 


IDENTIFYING PROTEIN-PROTEIN INTERACTIONS USING YEAST TWO-HYBRID 
ANALYSIS VERSUS PROTEOMIC APPROACHES 


Traditional genetic and biochemical methods for identifying protein-protein interactions 
largely focus on one target gene or protein at a time. Using one of the most popular meth- 
ods, the yeast two-hybrid system, thousands of new protein-protein interactions have been 
identified (Phizicky and Fields 1995). Figure 10.1 provides a schematic depiction of the basic 
principles of the yeast two-hybrid system; for a detailed discussion of the method, including 
basic protocols, see Sambrook and Russell (2001). In a study of the protein-protein interac- 
tions in the yeast Saccharomyces cerevisiae (the complete genome sequence is now available), 
this approach, until recently, largely focused on individual protein complexes (Flores et al. 
1999; Mayes et al. 1999). More recently, a high-throughput yeast two-hybrid method was 
developed for mapping all protein interactions in this organism (Uetz et al. 2000). This study 
revealed 957 putative protein interactions involving 1004 S. cerevisiae proteins; these data are 
now publicly accessible via the Web Site, http://curatools.curagen.com, thereby enabling indi- 
vidual researchers to manipulate the data themselves. A global network of protein-protein 
interactions in yeast (2709 putative interactions encompassing 2309 different proteins), gen- 
erated using a combination of yeast two-hybrid (Ito et al. 2000; Uetz et al. 2000) and bio- 
chemical protein interaction data in pubhe databases, has been described recently 
(Schwikowski ct al. 2000). 

Using a set of 29 genes that had been previously characterized, Walhout and co-workers 
have used a similar high-throughput yeast two-hybrid approach to build an interaction 
matrix among a large complement of proteins encoded by C. elegans (Walhout et al. 2000). 
Interaction maps have also been proposed for a number of viruses such as Escherichia coli 
bacteriophage T4 (Bartel et al. 1996), hepatitis C (Flajolet et al. 2000), and vaccinia virus 
(McCraith et al. 2000) and more recently for the prokaryote Helicobacter pylori (Rain et al. 
2001). User-friendly navigable databases, including BIND (http://binddb.org), DIP 
(http://dip.doe-mbi.ucla.edu}, and MINT (http://cbm.bio.uniroma2.it/mint/index. html) 
(Zanzoni et al. 2002), are being developed to store and represent protein network informa- 
tion in such a way that individual scientists can formulate new interpretations or pose ques- 
tions for experimental testing (Mayer and Hieter 2000; Xenarios et al. 2000; Bader et al. 2001; 
Duan et al. 2002}. 
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FIGURE 10.1. Detection of protein-protein interactions using the yeast two-hybrid systern. Two-hybrid 
analysis works by separating the coding sequences for the DNA-binding and activation domains of a tran- 
scriptional activator and cloning them into separate vector molecules. The coding sequence of a candidate 
protein whose partners are sought (referred to as "bait") is then fused with the DNA-binding domain. A 
library of coding sequences for proteins that might interact with the “bait” (called “prey”) is ma de in fusion 
with the activation domain. Yeast has two sexes, called œ and a, Therefore, baits and prey can easily be intro- 
duced into the same yeast cell by mating. If they physically interact, the DNA-binding and activation 
domains are closely juxtaposed, and the reconstituted transcriptional activator can mediate the switching on 
of the gene that effects the color change. (a) Fusion of the bait protein and the DNA-binding domain of the 
transcriptional activator cannot turn on the reporter gene. (b) Likewise, fusion of the prey protein and the 
activating region of the transcriptional activator is also insufficient to switch on the reporter gene. (c) When 
bait and prey associate, however, the DNA-binding domain and activator region are brought close enough 
together to switch on the reporter gene. The result is gene transcription and a color change that can be mon- 
itored. (Redrawn, with permission, from Oliver 2000 [@Macmillan].} 


Although yeast two-hybrid analysis has been highly successful in detecting many poten- 
tial protein-protein interactions, this technique has a number of limitations. 


» Because two-hybrid analysis is a transcription-based system, any bait protein that self-acti- 
vates the reporter genes is unsuitable for screening. 


» Proteins that require posttranslational modification(s) for protein-binding activity (e.g. 
phosphorylation) will be less likely to be isolated by this screen. 


+ If an additional nonpeptide factor is involved in the protein interaction, the interaction 
may go undetected. For example, two transcription factors might require a stretch of DNA 
sequence to form a stable protein complex. 


+ The interaction may lack biological context. For example, many protein interactions only 
occur when the cell is in a particular physiological state and thus may go undetected. Other 
potential interactions, which are detected by this approach, may never occur in vivo, 
because the proteins are located in separate cellular compartments. 
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TABLE 10.1. Selected examples of proteomic analyses of multiprotein complexes, organelles, and signal 


ing pathways 
Multiprotein complex 
Human spliceosome (HeLa cells) Neubauer et al. (1998) 
Yeast nuclear pore complex Rout et al. (2000) 
E. coli chaperonin Grotl Houry et al. (1999) 
Yeast ribosome Link et al. (1999) 
Yeast spliceosome UI sa RNP Neubauer et al. (1997) 
Yeast 268 proteosome Verma et al. (2000) 
Yeast nuclear pore complex Allen et al. (2001); Rout et al. (2000) 
Yeast protein complexes Gavin et al. (2002); Ho et al (2002; 
Yeast anaphase-promoting complex Zachariae et al. (1998) 
Organelle/cellular component 
Human nucleolus Anderson et al. (2002) 
Human paraspeckles (subnuclear bodies! Fox et al. (2002) 
Mouse microsomal fraction (H1 40 cells: Han et al. (20011 
Mouse phagosome (]774 cells) Garin et al. (2001) 
Human epithelial cell membranes (LIMI215) Simpson et al. (2000) 
Signaling pathways/secreted proteins 
EGFR (HeLa cells) Pandey et al. (2000b) 
Secreted proteins during differentiation of mouse 3T3-L1 Kratchmarova et al. (2002) 


preadipocytes to adipocytes 


*snRNP — small nuclear ribanucleoprotein. 
*EGFR = epidermal growth factor receptor. 


PROTEIN-PROTEIN INTERACTIONS CAN BE ANALYZED USING AFFINITY 
CAPTURE TECHNIQUES 


The use of cell-mapping proteomics to identify protein interaction partners is a powerful way 
to elucidate the biological activity of a protein of unknown function. Important insights into 
the function of a protein can be gained from knowledge of the biochemical interactions in 
which the protein is involved, particularly where the function of the interaction partner is 
well-defined. The task of identifying interacting proteins has usually been tackled with tech- 
nologies that rely on the expression of gene products from a cDNA library in conjunction 
with some form of a screen to identify interacting proteins. The most widely used of these 
methods has been the yeast two-hybrid approach (see the previous section). In contrast to 
these genetic-based approaches, proteomic identification typically relies on isolating the 
interacting protein directly from a biological source using: 


* Immunoprecipitation. Coimmunoprecipitation is a classical method for detecting protein- 
protein interactions and has been used extensively during the past couple of decades (see 
Phizicky and Fields 1995), 


* Isolation of multiprotein complexes. For a list of multiprotein complexes that have been 
extensively studied using cell-mapping proteomics, see Table 10.1. 


+ Affinity capture. This uses a recombinant fusion protein comprising the target protein and 
an affinity “tag” (or bait) covalently fused to the amino or carboxyl terminus of the target 
protein. The tag may be a small peptide epitope (in this case, an antibody is used to cap- 
ture the fusion protein, as well as proteins interacting with the target protein) or a biospe- 
cific protein (e.g., glutathione-S thioreductase [GST], which can be affinity-captured using 
glutathionine) or a synthetic peptide derived from a “structural motif.” For a list of com- 
monly used affinity tags, see Table 10.2. 
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Affinity-purified proteins are typically fractionated by gel electrophoresis, using either 
one-dimensional (1D) or two-dimensional (2D) gels (see Chapters 2 and 4, respectively), and 
visualized by Coomassie Blue staining. Specific bands are then excised from the gel, prote- 
olytically digested in situ, and identified by mass spectrometry (MS) (see Chapters 7 and 8). 
Comparison of gel profiles with those obtained from the incubation of a cell lysate with a 
control resin (1e., a resin that lacks the immobilized probe protein) or the affinity tag fused 
to an irrelevant protem is used to identify proteins that interact specifically. Alternatively, an 
intact complex can be digested, without prior separation of the component proteins, and its 
component proteins can be identified by direct analysis of the complex peptide mixture (see 
Chapter 8, Protocol 6). 

Affinity purification of interacting proteins from cell lysates is an inherently simple pro- 
cedure. The main limitation to identifying novel proteins by this technique is in isolating suf- 
ficient quantities of protein that enable subsequent characterization by MS-based methods. 
Success is ultimately dependent on the relative abundance of the interacting protein within 
the biological source used for the experiment, It is worthwhile to consider the practical 
aspects of this limitation. Ideally, identification of a protein band on a gel bv in situ digestion 
and MS requires quantities that are detectable by Coomassie staining, i.e., 20.05 ug of pro- 
tein. For a 50-kD low-abundance protein present at ten copies per cell, it would take 6 x 10" 
cells to provide 0.05 ug of material (see Figure 1.8), and this does not take into account sam- 
ple losses incurred during isolation of the target protein. For a highly abundant protein pre- 
sent at 10,000 copies per cell, only 6 x 10’ cells would be required to produce the same 
amount of protein. Although the production and processing of lysate from 10* cells are like- 
ly to be straightforward for most mammalian cell lines, scaling this up to 10!! cells is a more 
significant challenge. With this in mind, it is important to address the sensitivity issue in ini- 
tial pilot experiments. These are best performed using intrinsic [?S] Met/Cys radiolabeling or 
more sensitive silver staining to detect protein bands on a gel (see Chapter 2, Protocols 5-7 
and Chapter 7, Protocol 11). If specific interacting proteins are detected in these studies, then 
the investigator can proceed with scaling up the amounts of starting material used in the 
experiment to generate sufficient material for MS-based sequence analysis. 

Protocols 1 and 2 provide details for performing two types of affinity capture. Protocol 1 
describes the use of FLAG-epitope-tagged proteins for both small-scale analysis and large- 
scale coimmunoprecipitation of proteins interacting with the human inhibitor of apoptosis 
protein (IAP) complex using a FLAG-tag IAP fusion protein transiently introduced into 
mammalian U292 cells (Verhagen et al. 2000). The ectopically expressed (transiently or sta- 
bly expressed) epitope-tagged protein X (e.g., FLAG-protein X) can be immunoprecipitated 
using commercially available epitope-tag-specific monoclonal antibodies. The advantage of 
examining the interactions of endogenously expressed proteins is that these proteins are 
more likely to be physiological and less likely to be an artifact of overexpression. In scaling up 
the procedure, however, epitope-tagged overexpressed protein X may be necessary for suffi- 
cient coimmunoprecipitating protein to be isolated. Protocol 2 contains a generic affinity 
capture method using a recombinantly expressed protein that is known or believed to inter- 
act with other proteins. Two successful applications are described within Protocol 2, one 
using the GST recombinant protein system to isolate the interacting partners for the sup- 
pressors of cytokine signaling (SOCS) family of proteins, which act as intracellular inhibitors 
of several cytokine signal transduction pathways (Zhang et al. 1999). DNA fragments encod- 
ing a conserved 40-amino-acid region of the SOCS family of proteins (SOCS box) with a 14- 
amino-acid linker sequence were generated by polymerase chain reaction (PCR) and cloned 
into the GST bacterial expression vector pGEX-2T (Smith and johnson 1988). The GST 
fusion proteins were first purified by affinity chromatography on glutathione Sepharose 4B 
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OPTIMIZATION OF VARIABLES FOR AFFINITY CAPTURE METHODS 


Given the sensitivity issue, careful selection of conditions that maximize the yield of inter- 
acting proteins can make the difference between experimental success and failure. Several 
variables must be considered: 


+ What is the nature of the probe protein? The probe can be either a full-length protein, a 
relevant interaction domain, or even a smaller bioactive fragment that can be chemical- 
ly synthesized. Although a full-length protein must, by default, contain all of the regions 
likely to participate in protein-protein interactions, a smaller active fragment may be eas- 
ier to produce, may be immobilized at higher density, and may be less susceptible to non- 
specific interactions. 


How is the probe protein to be immobilized in preparing the affinity matrix? Covalent 
immobilization allows more stringent wash and elution conditions, but may result in 
partial or even total inactivation of the probe protein. A protein that is noncovalently 
immobilized via association of a fused affinity tag (e.g., GST) with an immobilized li- 
gand is likely to retain activity; however, if large excesses of the tagged probe protein are 
eluted from the affinity resin, this may obscure interacting proteins in the SDS-PAGE 
(SDS-polyacrylamide gel electrophoresis) analysis. 


What cell line will be used as a source of the interacting protein present? This is a crit- 
ical consideration. A cell line that is known to express the probe protein naturally is a 
good place to be looking for interacting partners. However, the investigator should also 
consider how easy it would be to scale up production of that particular cell line. In this 
regard, a cell line that grows in suspension is preferable to one which is adherent. 


How will the cell lysate be prepared? The conditions used for lysing the cells should not 
inactivate any potential interacting protein, nor interfere with the subsequent affinity 
capture. If the probe protein is naturally localized to a particular cellular compartment, 
preparations of that cell compartment may be preferable to using a whole-cell lysate (see 
Chapter 3). 


e. Do the source cells need to be protreated prior to their lysis? For instance, if the probe 
protein is a component of a cell signaling pathway, activation of the target cells with the 
appropriate growth factor may be required prior to lysis. 


+ How will the affinity resin be washed following incubation with cell lysate? Some cel- 
lular proteins will bind nonspecifically to the affinity matrix; thus, it is important to min- 
imize the capture of these contaminating proteins. Nonspecifically associating proteins 
are removed by washing the affinity resin after incubation with the cell lysate. Although 
extensive washing will remove these proteins, too much washing can result in the loss of 
specifically associated proteins. Consequently, the extent of washing should be adjusted 
so as to maximize the capture of specifically intetacting proteins while removing as 
much of the background as possible. As nonspecifically associating proteins are never 
totally removed by washing, it is important to run a parallel control experiment to show 
that the affinity capture of a cellular protein is due to interaction with the immobilized 
probe protein and not to other components of the affinity matrix. Comparison of the 
SDS-PAGE profiles of cellular proteins captured by the probe protein resin versus the 
control resin is used to distinguish specific from nonspecifically associated proteins (also 
see below Tandem Affinity Purification). 

One- or two-dimensional gel electrophoresis? One-dimensional SDS-PAGE is technical- 
ly less challenging than running two-dimensional gels, but the latter has a far greater 
capacity to resolve complex mixtures. Determining which of these two techniques 
should be used will depend on the level of background-contaminating proteins that cop- 
urify with the interacting protein(s). 
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and then covalently coupled to Sepharose resin. This affinity resin was used to isolate SOCS- 
box-binding proteins from the M1 cell lysate, which were subsequently identified by MS- 
based methods as elongins B and C. 


Tandem Affinity Purification 


For stringent purification of protein complexes, a tandem affinity purification (TAP) tag, 
comprising a fusion cassette encoding a low-affinity tag/enzymatic cleavage site/high-affini- 
ty tag, has been described (see Figure 10.2) (Rigaut et al. 1999). in this system, calmodulin- 
binding peptide (CBP) was chosen as the low-affinity tag because it allows for efficient selec- 
tion and specific release from the affinity column (e.g., using EGTA under mild conditions), 
and ProtA (two IgG-binding units of Protein A of Staphylococcus aureus) as the high-affinity 
tag. In contrast to CBP, Prot can only be released from matrix-bound IgG under denatur- 
ing conditions at low pH. With the TAP strategy, a specific TEV protease recognition 
sequence is inserted between the CBP and ProtA tags to allow specific proteolytic release of 
the bound complex under native conditions. For additional details and protocols using TAP, 
see Séraphin et al. (2002). 
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FIGURE 10.2. Scheme of TAP strategy for affinity capture. (Redrawn, in part, with permission, from Rigaut 
at al. 1999.) 
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isolation of Multiprotein Complexes Using Electrophoresis 


An important tool for the biochemist is the ability to analyze protein complexes in their 


native state. Two commonly used methods are native agarose gel electrophoresis and Blue 
Native- PAGE (BN-PAGR). 


Native agarose gel electrophoresis 


Native agarose gel electrophoresis is a well-established method for isolating multiprotein 
complexes (Hames 1990; R. Kim et al. 1999). The method uses a horizontal gel apparatus that 
is simple to set up, takes a short time to run, and avoids the use of toxic components, This 
system allows the detection of both positively and negatively charged proteins as well as pro- 
tein-protein complexes in the same gel (R. Kim et al. 1999). A protein in native agarose can 
have either a positive or negative charge, depending on its isoelectric point (pl) and the pH 
of the buffer used to perform the electrophoresis. Proteins with a pl value lower than the 
buffer pH carry a net negative charge and migrate toward the anode, whereas proteins with 
a pl higher than the buffer pH carry a positive charge and migrate toward the cathode. The 
gel is run in a submerged horizontal platform with the wells positioned in the center of the 
gel. This allows for negatively and positively charged proteins to migrate toward the anode 
and cathode, respectively. Proteins with different molecular weights and pl values can be test- 
ed as well as proteins that form complexes, whether they be two pure proteins forming a com- 
plex or a complex formed after incubating a pure protein with a crude extract. Once a com- 
plex is formed, it can be cut from the gel and the components isolated using ultrafiltration. 
This method does not replace isoelectric focusing (IEF) gels because it does not determine 
the pl of a protein, but it does facilitate the detection of protein-protein complexes and may 
provide information on the protein's charge at a defined pH. A generic method for perform- 
ing native agarose gel electrophoresis is given in Protocol 3. 


BN-PAGE 


BN-PAGE is a technique developed by Schágger and von Jagow (1991) for the separation and 
analysis of membrane protein complexes, often in their enzymatically active form. Its main 
advantage over conventional PAGE systems is the use of Coomassie Brilliant Blue G in both 
the sample loading buffer and cathode buffer. Binding of the dye to native proteins performs 
two important functions: 


e It imparts a slight negative charge on the proteins, thereby enabling them to enter the 
native gel at neutral pH where the stability of protein complexes is most optimal. 


>» By binding to hydrophobic regions of proteins, the dye prevents protein aggregation dur- 
ing electrophoresis. 


Electrophoresis separates detergent from protein complexes, which often results in the 
aggregation of hydrophobic membrane proteins. However, the presence of Coomassie dye in 
BN-PAGE maintains protein solubility, thereby enabling multiprotein complexes to separate 
one from another according, largely, to their apparent molecular mass ( Mj). A further advan- 
tage of this technique is that protein bands are visible during electrophoresis and thus subse- 
quent staining of the gel is not always necessary. Although the charge shift on membrane pro- 
teins and their complexes exerted by the Coomassie dye can lead to aberrant molecular mass- 
es, the M, of proteins with a pI below 8.6 does not deviate significantly when compared with 
most soluble protein markers (Schágger et al. 1994). 

BN-PAGE has been instrumental in the analysis of protein complexes of the mitochond- 
rial membranes, in particular respiratory complexes (Schagger 1996). It has also been an 
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important tool in the analysis and assembly of mitochondrial protein translocation complexes 
(Dekker et al. 1998; Ryan et al. 1999; Model et al. 2001). Additionally, the method has been used 
to study individual or multiple protein complexes from membranes including chloroplasts 
(Poetsch et al. 2000), endoplasmic reticulum (Wang and Dobberstein 1999), and the plasma 
membrane (Schamel and Reth 2000). BN-PAGE can also be used for the analysis of soluble pro- 
tein complexes, as has been observed for the heptameric mitochondrial matrix form of Hsp60. 

Not all membrane proteins and their complexes resolve on BN-PAGE. For example, 
many mitochondrial proteins streak from the high- to the low-M, range, which may be due 
to the proteins dissociating from their complexes during electrophoresis or from a change in 
their solubility. Other complexes resolve extremely well. This variability may be dependent on 
a number of factors including the detergent employed and the stability of the protein com- 
plex, as well as whether Coomassie dye in fact binds to the protein(s) being analyzed. It is 
important to use a detergent that is efficient at solubilizing membranes but does not disrupt 
the integrity of the membrane protein compiex. Initial studies should determine which deter- 
gents are most suitable for maintaining the particular protein complex intact prior to appli- 
cation and for the duration of the run. Most studies employ Triton X- 100, n-dodecyl malto- 
side, or digitonin as detergents. In the case of mitochondria, digitonin gives more discernible 
protein complexes of higher M, in comparison to dodecyl maltoside or Triton X-100. Indeed, 
the stable complexes observed on BN-PAGE following Triton X-100 solubilization can 
instead be seen as supercomplexes when digitonin is used (Schagger and Pfeiffer 2000). 

BN-PAGE has also been used to analyze the subunit composition of membrane protein 
complexes. Total extracts or purified membrane protein complexes can be subjected to BN- 
PAGE, and individual subunits can be separated using SDS-PAGE in the second dimension. 
Well-resolved protein spots originating from complexes can be observed and subjected to 
further downstream processing such as Coomassie staining, immunoblot analysis, or amino 
acid sequencing. This is a particularly useful technique because two-dimensional gel elec- 
trophoresis using [EF in the first dimension may be problematic for resolving membrane 
proteins. Other applications of BN-PAGE include: 


» Purification of membrane protein complexes for crystallization trials for structural analy- 
sis (Poetsch et al. 2000). 


* Separation of individual mitochondrial respiratory complexes prior to activitity determi- 
nations using in-gel histochemical staining. 


» Examination of cell lines originating from patients with mitochondrial disorders 
(Dabbeni-Sala et al, 2001). 


Protocol 4 describes a generic method for performing BN-PAGE. 


UNDERSTANDING THE ARCHITECTURE OF MULTIPROTEIN COMPLEXES 


Chemical Cross-linking Provides Information about Protein Proximity 
and Spatial Organization 


Cross-linking fuses proteins that are physically close to each other {Kamp 1988). Once a 
linked pair is formed, it remains intact even under harsh purification conditions such as SDS- 
PAGE. The linked proteins can be identified using mass spectrometric methods. This allows 
one to conclude that these proteins either interact directly or at least reside in close proxim- 
ity in a protein complex. The strategy can be employed in two scenarios: the identification of 
weakly or transiently bound complex components and the analysis of nearest-neighbor rela- 
tions in protein complexes ( Rappsilber et al. 2000). 
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Using cross-linkers, weak interactions can be stabilized throughout the purification of a 
complex. In this way, novel complex components may be identified by MS that otherwise 
would have been lost in the washing steps of the purification. This approach has successfully 
been employed for some membrane complexes (Layh-Schmitt et al. 2000). However, due to 
the cell membrane, which restricts free access of the cross-linkers to the interior of the ccll, 
and the requirements of cross-linkers to work in specific buffer compositions, in vivo cross- 
linking is still in its infancy. 

Cross-linking can also be used to investigate the spatial organization of proteins in a 
complex (see, e.g., Wong 1991). For such a topological analysis, a protein complex is incom- 
pletely cross-linked and the products are separated using SDS-PAGE (see Chapter 2). The 
components of each band, i.e., the pairs of neighboring proteins, are then identified using 
MS, obviating the need for special reagents like antibodies. The cross-linking reaction is best 
performed on a purified complex as this allows better control of the reaction conditions. 

The optimal yield of the reaction cannot be 10096, because this would mean that all pro- 
teins of a complex are covalently linked together and pairs cannot be isolated. Instead, the goal 
should be a distribution of all possible stages of linkage— pairs, trimers, and so on. This means 
that only a fraction of the complex gives rise to each of the products and the yield is therefore 
very low. The typical material requirement with standard equipment in a nonspecialized lab- 
oratory is ~5 pmoles, allowing the analysis of products that have a yield as low as 2-596. 

Every complex is different with respect to the orientation of functional groups toward 
each other and within their microenvironment. Therefore, no one ideal cross-linker suits all 
cases. Cross-linkers with different chemical properties must be tested and the best reaction 
conditions must be determined empirically. 

Cross-linkers are available with a variety of reactive groups, spacer lengths, rigidity of 
spacers, and hydrophobicity/hydrophilicity properties. Most cross-linkers are composed of 
two functional groups separated by a spacer. They may have identical (homobifunctionai 
cross-linker) or different (heterobifunctional cross-linker) functional groups on each end. 
Cross-linkers react only with specific amino acid side chains, most often the side chains of 
cysteine, lysine, aspartic acid, and glutamic acid, except in the case of photoinducible groups, 
which are fairly promiscuous. Zero-length cross-linkers function differently by activating an 
amino acid side chain. It is not known beforehand which functional groups are displayed in 
a suitable geometry on the surfaces of the proteins in the complex to be linked. Therefore, a 
set of cross-linkers must be tested. In Protocol 5, the following cross-linkers were chosen: 
EDC, a zero-length cross-linker that activates carboxyl groups to react with amino groups; 
SMCC, a heterobifunctional cross-linker that reacts with sulfhydryl groups on one side and 
amino groups on the other side; and the two homobifunctional cross-linkers BMH and BS’, 
the first linking sulfhydryl groups, and the second linking amino groups. 

It is not enough that two proteins display the necessary surface groups for cross-linking. 
The groups must also present the correct geometry to each other. This can be tested by using 
cross-linkers with tethers of different lengths. Sometimes, the resulting data can even be used 
for a detailed study on the distances between proteins within the complex. Generally, howev- 
er, the shorter the linker, the more convincing the conclusion that the linked proteins are 
direct neighbors. It should be noted that for steric reasons, a longer linker with the identical 
functional groups will not necessarily yield all of the products given by a shorter one. 

To determine the best distribution of products, the reaction time, reaction temperature, 
and concentration of reactants must be varied. The temperature and the ratio of cross-link- 
er to complex will influence the extent of linkage. The temperature influences the flexibility 
in the complex, with a higher degree of linkage at elevated temperatures. ‘Too much linker 
results in complete linkage of the complex, whereas too much of the complex can result in 
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oligomerization and linkage of proteins belonging to separate complexes. As the amount of 
available complex may actually be too low to allow quantification, the optimal ratio of cross- 
linker to complex is determined by varying the concentration of the cross-linker. An excess 
of cross-linker over protein is required as the cross-linker is hydrolyzed. The reaction ends 
due to this hydrolysis and the modification of the available surface groups of the proteins. 

The partially cross-linked complex is denatured and analyzed by SDS-PAGE. To resolve 
the high-molecular-weight products, a low percentage of acrylamide should be chosen. The 
products are recognized by comparison to a lane where unlinked complex has been separat- 
ed. The migration of linked protein species on a acrylamide gel can deviate from the expect- 
ed behavior on the basis of the calculated molecular mass. This migration pattern may also 
be influenced by the position of the link due to a different geometry of the resulting complex. 
Products from different cross-linkers with the same protein composition may therefore have 
slightly different migration properties. The protein bands may be visualized using Coomassie 
Brilliant Blue, colloidal Coomassie, fluorescent dyes such as SYPRO stains (i.e., Orange, Red, 
or Ruby), as well as negative staining using ZnCl, without restriction with regard to the 
planned analysis by MS (for staining protocols, see Chapters 2, 4, and 7). Of these, colloidal 
Coomassie has the highest sensitivity. Although silver staining is even more sensitive, a pro- 
tocol must be used that is compatible with mass spectrometric analysis (e.g., see Shevchenko 
et al. 1996; also see Chapter 2, Protocol 7 and Chapter 4, Protocol 12). Standard silver stain- 
ing protocols employ high concentrations of reagents such as glutaraldehyde, which cross- 
link the proteins in the gel and impair the subsequent analysis by MS. 

For identification by MS, the proteins are enzymatically degraded, with trypsin being the 
protease of choice. Extensive modification and cross-linking impairs protease activity. The 
low amount of linkage used in topological studies, however, is tolerated in the standard pro- 
tocols (Shevchenko et al. 1996). Sensitivity can be severely compromised by losses in sample 
transfer. In addition, drying down the peptide solution should be avoided (Simpson et al. 
1989; Speicher et al. 2000). The mass spectrometric analysis can be done using standard 
equipment for the identification of proteins such as ion traps, quadrupole time-of-flight 
(TOF) instruments, or high-sensitivity reflector matrix-assisted laser desorption ionization 
(MALDI)-TOF instruments (see Chapter 8}. 

For the initial protein identification, the peptide mass data (see Chapter 8, Peptide Mass 
Fingerprinting) can be used to search a nonredundant sequence database without restric- 
tions. However, depending on the data quality, the species and protein masses may be used as 
constraints. The confidence in an identification can be greatly increased if fragmentation data 
can be obtained in some way, either by post-source decay (PSD) or by tandem MS (MS/MS) 
experiments in the mass spectrometer (see Chapter 8 for both PSD and MS/MS). For topo- 
logical analyses, the complex components are known and the searches can therefore be 
restricted to these proteins. Fragmentation data are then only required if special conclusions 
are to be drawn from a particular observed mass. This is the case for observed signals that 
potentiaily arise from cross-linked peptides. 


Degree of difficulty of this technique 


As an alternative to determining the identity of cross-linked proteins, the direct identification 
of the linked peptides is an intriguing way tu obtain information about the folding of a pro- 
tein or the location of interaction surfaces (Young et al. 2000; Pearsón et al. 2002). For a num- 
ber of reasons, however, this technique is also rather arduous. The efficiency of cross-linking 
is low, resulting in a limited amount of product. The products may be homogeneous in that 
protein A is linked to protein B, yet the sites of linkage may be heterogeneous. 
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An additional problem is dynamic range. For example, although 1 pmole of product will 
give 1 pmole of generated peptide, the amount of cross-linked peptide will be dependent on 
the extent of cross-linking. For example, if only 1096 cross-linking occurs, the yield of cross- 
linked peptide will be 0.1 pmole. 

Furthermore, the sensitivity to detect linked peptides is generally lower, because they may 
be lost more easily during the sample work up (e.g., less efficiently extracted from the gel 
plugs when compared to unlinked peptides); MS has a lower sensitivity toward larger pep- 
tides, with linked peptides having twice the size. Taken together, these arguments account for 
the rare observation of linked peptides in a cross-link investigation. This can only be 
improved if a method is used specifically to enrich or detect linked peptides. In any case, a 
single mass alone is insufficient to allow conclusions as to what peptides are linked! 


Subsecond Photo-cross-linking Using Water-soluble Metal Complexes 


Conventional cross-linkers are usually applied to the analysis of interactions of a few proteins 
in equilibrium, mainly because typical bifunctional reagents, consisting of two electrophilic 
or UV-light-activated phenones or azides separated by a linker arm, work too slowly to mon- 
itor rapid changes in protein environrnent (Wong 1991). Additionally, the time required for 
these constitutively reactive reagents can effect large-scale modification of nucleophilic side 
chains, such as acylation of lysines, on the surface of proteins during the extended incubation 
times. This raises the concern of artifactual results attributable to structure destabilization. 
To circumvent some of these problems, a fundamentally new type of protein cross-link- 
ing chemistry (coined PICUP or photo-induced cross-linking of unmodified proteins) based 
on radical coupling reactions has been developed (Fancy and Kodadek 1999; K. Kim et al. 
1999). In this procedure (see Protocol 6), the proteins of interest are mixed with water-solu- 
ble metal complexes such as tris(2,2"- bipyridyl) ruthenium(ID) chloride hexahydrate 
(Ru(I1][bpy],2*) (see Figure 10.3a) (Fancy and Kodadek 1999) or a palladium porphyrin, 


ar 
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FIGURE 10.3. Water-soluble metal complexes used for sub-second photo-cross-linking: (a) (Rulbpy] Cli 
(b) Pá(IDjtetrakis-(4-N-[methyl] -pyridyl-porphyrin). (Courtesy of David A. Fancy, University of Texas 
Southwestern Medical Center.) 
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such as Palladium(lI) (tetrakis 4-N-(methyl)-pyridyl porphyrin) (see Figure 10.3b: (K. Kim 
et al. 1999) and ammonium persulfate. The mixture is then briefly photolyzed with visible 
light (2400 nm) to photo-oxidize the metal complex via electron transfer to ammonium per- 
sulfate (scc Figure 10.4) (Bolletta et al. 1980; Aboul-Enein and Schulte-Frohlinde 1988; 
Nickel et al. 1994). The now-activated metal complex, either Ru(HI) or a Pd(ID) porphyrin 
radical cation, extracts an electron from an aromatic amino acid, such as tryptophan or tvro- 
sine, resulting in a protein-centered radical species that can attack a wide variety of other 
groups, eventually leading to covalent cross-linking. The speed and ease of using this tech- 
nique have opened a door for the examination of protein-protein interactions not yet in 


equilibrium, and thus shouid aid biochemists in their pursuit to understand cellular func- 
tion. 


The Structural Organization of Protein-DNA and Multiprotein-DNA Complexes Can 
Be Analyzed by Site-specific Protein-DNA Photo-cross-linking 


Site-specific protein-DNA photo-cross-linking is able to define positions of proteins relative 
to DNA within large multiprotein-DNA complexes, and thus permits analysis of the struc- 
tural organization of these complexes (Bartholomew et al. 1990; Bel! and Stillman 1992; 
Lagrange et al. 1996). In conjunction with rapid mixing and laser flash photolvsis, site-spe- 
cific protein-DNA photo-cross-linking is also able to define the kinetics of formation of indi- 
vidual protein-DNA interactions within large multiprotein-DNA complexes, and thus also 
permits analysis of pathways and kinetics of assembly of complexes (S. Druzhinin et àl., 
unpubl.). 
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FIGURE 10.4. Scheme depicting photo-oxidation of metal complex (via ammonium persulfate) to form a 
radical cation that then abstracts an electron from an aromatic amino acid (e.g.. tryptophan or tyrosine}, 
resulting in a protein-centered radical species that can then attack a wide variety of other amino acid side 
chains, resulting in covalent cross-links. After absorption of visible light by a soluble metal complex, the 
excited electron is trapped by ammonium persulfate, leading to a deficient metal center and sulfate radical 
anion. The oxidized metal center then abstracts an electron from an aromatic amino acid, such as tyrosine, 
leaving a radical hole competent for attack by another aromatic side chain or nucleophile. This eventually 
leads to covalent linking by an undetermined mechanism. (Courtesy of David A. Fancy, University of Texas 
Southwestern Medical Center.) 
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Several procedures have been developed for site-specific protein-DNA photo-cross-link- 
ing within large multiprotein- DNA complexes (Bartholomew et al. 1990; Bell and Stillman 


1992; Lagrange et al. 1996). The most effective procedure involves the following four steps 
(see Figure 10.5): 


1. Chemical (Fidanza et al. 1992; Yang and Nash 1994; Mayer and Barany 1995) and enzy- 
matic (Sambrook and Russell 2001) reactions are used to prepare a DNA fragment con- 
taining a phenyl-azide photoactivatible cross-linking agent and an adjacent radiolabel 
incorporated at a single, defined DNA phosphate (with a 9.7 À linker between the pho- 
toreactive atom of the cross-linking agent and the phosphorus atom of the phosphate, 
and with an «11 A maximum “reach” between potential cross-linking targets and the 
phosphorus atom of the phosphate). 


2. The multiprotein-DNA complex of interest is formed using the site-specifically deriva- 
tized DNA fragment, and the multiprotein-DNA complex is UV-irradiated, initiating 
covalent cross-linking with proteins in direct physical proximity to the cross-linking agent. 


3, Extensive nuclease digestion is performed, eliminating uncross-linked DNA and convert- 
ing cross-linked DNA to a cross-linked, radiolabeled 3-5-nucleotide “tag.” 


4. The “tagged” proteins are identified, usually by denaturing PAGE followed by autoradi- 
ography. 

The procedure is performed in a systematic fashion, with preparation and analysis of at 
least ten derivatized DNA fragments, each having the photoactivatible cross-linking agent 
incorporated at a single, defined DNA phosphate (typically, each second DNA phosphate 
teach 12 Al on each DNA strand spanning the region of interest; Lagrange et al. 1996, 1998; 
Kim et al. 1997, 2000a; Wang and Stumph 1998; Bartlett et al. 2000; Naryshkin et al. 2000). 

The results of the procedure define the translational positions of proteins relative to the 
DNA sequence. Plotted on a three-dimensional representation of a DNA helix, the results 
also define the rotational orientations of proteins relative to the DNA helix axis, and the 
groove orientations of proteins relative to the DNA major and minor grooves (Lagrange et al. 
1996, 1998; Kim et al. 1997, 2000a; Wang and Stumph 1998; Naryshkin et al. 2000). 

Protocol 7 presents detailed methods for cross-linking of protein-DNA complexes immo- 
bilized on streptavidin-coated paramagnetic beads (“on-bead” cross-linking; see Naryshkin et 
al. 2000). (For protocols on cross-linking of protein-DNA complexes in solution [“in-solution” 
cross-linking] or in polyacrylamide gel matrices | "in-gel" cross-linking], see Kim et al. [2000b] 
and Naryshkin et al. [2001].) On-bead cross-linking offers three important advantages: 


+ The ability to perform stringent washing to eliminate nonspecific and nonproductive com- 
plexes (Kim et al. 2000a; Naryshkin et al. 2000). 


e Tbe ability to rapidly change reaction media (e.g., to add substrates, inhibitors, or allosteric 
effectors: Kim et al. 20002; Naryshkin et al. 2000). 


e The ability to change reaction temperatures rapidly. 


In view of these advantages, we now employ on-bead cross-linking in all photo-cross- 
linking experiments in our laboratory not requiring rapid mixing and laser flash photolysis 
(Naryshkin et al. 2000; Y. Kim et al., unpubl.). 


PROTEIN-PROTEIN INTERACTIONS CAN BE ANALYZED IN ViVO WITH FRET 


The ability to study protein complexes in living cells is crucial for understanding the nature 
of the complexes, how their properties are affected by various conditions, and how their 
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FIGURE 10.5. Site-specific protein-DNA photo-cross-linking (Lagrange et al. 1996, 1998; Kim et al. 1997, 20002, 2000b; Naryshkin 
et al. 2000, 2001). (a,b) Chemical and enzymatic reactions are used to prepare a DNA fragment containing phenyl-azide photoac- 
tivatible cross-linking agent (R) and an adjacent radioactive phosphorus (*) incorporated at a single, defined DNA site. On the 
basis of the chemistry of incorporation, the maximum distance between the site of incorporation and the photoreactive atom 1s 
9.7 A; the maximum distance between the site of incorporation and a cross-linked atom is «11 A. (c) UV irradiation of the deriva- 
tized protein-DNA complex initiates cross-linking. Nuclease digestion eliminates uncross-linked DNA and converts cross-linked, 
radiolabeled DNA to a cross-linked, radiolabeled three- to five-nucleotide “tag.” The "tagged" proteins are identified by denaturing 
polyacrylamide gel electrophoresis and autoradiography. (Redrawn, with permission, from Lagrange et al. 1996.) 


structures and functions are related. The development of green fluorescent protein (GFP) 
derivatives has provided an opportunity to study protein-protein interactions in living celis 
with FRET (Heim and Tsien 1996; Miyawaki et al. 1997; Veveer et al. 2002). GFP-based FRET 
has been used in a wide range of applications (van Roessel and Brand 2002). 

FRET involves the indirect excitation of an acceptor, such as yellow fluorescent protein 
(YFP), by a directly excited donor, such as cyan fluorescent protein (CFP) (Stryer 1978; Clegg 
1996). FRET can occur only under certain conditions: When the donor and acceptor are very 
close in space and have a certain relative orientation. The maximum interfluorophore dis- 
tance depends on the properties of the specific fluorophores used and is ~52 A for the CFP- 
YFP pair (Tsien 1998). Therefore, if two proteins are tagged, one with CEP and the other with 
YEP, the detection of FRET requires very close association between the target proteins. The 
use of CFP and YFP as a donor-acceptor pair is possible because of substantial overlap 
between the emission of CFP and the excitation of YFP (see Figure 10.6). The broad spectra 
of these fluorophores yields substantial cross-talk (i.e., detection of one fluorophore with fil- 
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FIGURE 10.6. Excitation and emission spectra of 
CFP and YFP. The normalized fluorescence intensi- 
ties for excitation (solid lines) and emission (dashed 
lines) as a function of wavelength are shown for CEP 
(top) and YFP (bottom). The overlap between CFP 
emission and YFP excitation allows FRET to occur 
(arrows) under certain conditions, (Courtesy of 
425 475 525 575 Marc Damelin and Pamela Silver, Dana Farber 
Wavelength (nmj Cancer Institute.) 
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ters used for the other fluorophore), but this problem can be surmounted by appropriate cor- 

rections during the data analysis. FRET can be a powerful method for studying protein inter- 

actions in living cells for the following reasons: 

> Target proteins are studied under physiological conditions while they are carrying out their 
normal cellular function. This is generally not the case for other in vivo systems, such as the 
yeast two-hybrid system, in which protein fusions are overexpressed and targeted to the 
nucleus to allow gene expression readouts. 

e A detected FRET signal is more likely to represent a direct interaction than analogous 
results from yeast two-hybrid or coimmunoprecipitation experiments; the strong depen- 
dence of ERET on the interfluorophore distance implies that two target proteins yielding a 
FRET signal are very close in space. 

« Protein interactions can be monitored in vivo over the course of the cell cycle, drug treat- 
ments, or other environmental stimuli. 


e The structural organization of macromolecular protein complexes, which may contain 
scores of protein interactions and may be difficult to study in vitro, can be analyzed. 


It is also important to realize the limitations of this method. In contrast to in vitro FRET 
experiments that allow careful calibration of concentrations and other parameters and the 
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use of small fluorophores, in vivo FRET measurements suffer from natural fluctuations 
among cells and (currently) require very large fluorophores (28-kD GFPs). Thus, the preci- 
sion of in vitro FRET measurements simply cannot be obtained in vivo. Additionally, the 
strict requirements for FRET (the distance between CFP and YFP and their relative orienta- 
tion) imply that negative results are inconclusive. For example, two target proteins may inter- 
act, but the nature of the interaction compared to the experimental conditions may preclude 
energy transfer. 

The capabilities of in vivo FRET are enhanced in tractable genetic systems. For example, 
the budding yeast 5. cerevisiae offers many advantages for FRET experiments due to the ease 
of genetic manipulations: 


e DNA can be integrated into the genome at a targeted site, such that the fusion protein 
replaces the endogenous protein in the cell, eliminating competition between the endoge- 
nous and fusion proteins. The fusion protein is expressed under its own promoter, and 


consequently, the expression level of the fusion protein is physiological and consistent from 
cell to cell. 


* The functionality of each fusion protein can be tested genetically by replacing the endoge- 
nous protein with the fusion and testing for cell viability under conditions in which the 
protein is required for growth. This assay ensures that a fusion protein can carry out all 
essential functions of the endogenous protein. 


+ FRET experiments can be extended to different genetic backgrounds to study the physio- 
logical relevance of specific interactions as well as the organization of protein complexes. 


* Many potential protein-protein interactions can be screened with FRET in a relatively short 
time period, allowing for the analysis of large protein complexes. 


One application of FRET in the yeast system has been to study the nuclear pore complex 
(NPC). The NPC, which is central to the transport of macromolecules between the nucleus 
and cytoplasm, is a 60-MD membrane-embedded structure that has been difficult to analyze 
in its entirety in vitro. In one recent study of the NPC's structural organization, more than 
100 potential interactions between individual NPC components were screened with FRET 
and many novel interactions were identified (Damelin and Silver 2002). Another FRET study 
screened potential interactions between two transport receptors and many of the individual 
NPC components, thus mapping the translocation pathways of the receptors (Damelin and 
Silver 2000). Both of these studies demonstrated the potential of GFP-based FRET for ana- 
lyzing macromolecular protein complexes in vivo. A protocol focusing on a standard micro- 
scope-based assay for budding yeast is given in Protocol 8. Many aspects of this method are 
applicable to other systems and assays. 

For measuring FRET between two target proteins, Protocol 8 can be extended to address 
more complex questions. In particular, the relative ease of manipulations in yeast not only 
facilitates the actual FRET experiments, as discussed above, but also encourages a wide range 
of potential follow-up experiments to characterize a given interaction. A FRET interaction 
between two target proteins can be compared in different genetic backgrounds. The interac- 
tion may depend on other proteins in the protein complex or on signaling proteins that reg- 
ulate the interaction; mutations in the target proteins themselves may also influence the 
interaction. Note that an added benefit of integrating the fusion constructs into the genome 
is that mutant and wild-type alleles can be tagged with the same DNA, unless the mutation 
is very close to the fusion site. For example, the same integration vectot was used to fuse YFP 
to NUPI and the nup1-8 mutant (Damelin and Silver 2000). 
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The dependence of a FRET interaction on cell cycle phase or on defined environmental 
conditions can also be investigated. Synchronization of a cel) population, drug treatments, 
and other environmental stimuli can be used to characterize an interaction further. However, 
because data acquisition can take a long time, dividing a cell culture and staggering the treat- 
ments may be necessary. 

Ultimately, the study of protein interactions must extend beyond the confines of two 
Target proteins into the realm of large multiprotein complexes. The FRET protocol is applic- 
able to such studies because the noninteracting proteins within the complex can serve as 
Control proteins. To this end, as many proteins as possible in a given complex are tagged with 
CEP and YFP. A recent study on the NPC provides an example of using FRET to study the 
structura) organization of a large complex (Damelin and Silver 2002). Fifteen components of 
the NPC were tagged with CEP and YEP, and the various combinations were assayed for 
FRET. Nine novel interactions and four previously documented ones were identified. 
Interestingly, when the CFP tag on a 120-kD component was moved from the carboxyl to the 
atnino terminus, different FRET interactions were observed. This result demonstrates the 
specificity of the FRET signal and also illustrates the utility of generating both amino- and 
carboxy-terminal fusions (see Protocol 8, Step 1). 


PROTEIN-PROTEIN INTERACTIONS CAN BE ANALYZED BY DETERMINING 
BINDING CONSTANTS 


Understanding the strength of the interaction between two proteins in a cell is crucial to an 
understanding of the consequences of that interaction for cell function. ‘To assess the interaction 
between two proteins, detailed information concerning the following parameters is required: 


e. Binding constant describing the interaction of one protein with another. For a simple 
bimolecular interaction between two proteins Pl and £2, the interaction is governed by the 
binding constant Ky, according to the equation K, = |P1] {P2|/{P1—P2], where PI and P2 
refer to the free (i.e, unbound) concentrations of P1 and P2, respectively. The interaction 
between proteins P1 and P2 is also expressed in two other ways. First, it is often expressed 
as an affinity constant, K, = |P1-P2]/|P1] [P2], te. K, = i/K,. Second, it is often expressed 
as a ratio of two rate constants. The rate of formation (association) of P1-P2 isk, [P1] [P2], 
where k, is the association rate constant, and the rate of breakdown (dissociation) of PL-P2 
is ky [P1—P2], where ka is the dissociation rate constant. At equilibrium, the rate of forma- 
tion of P1-P2 equals the rate of breakdown of P1-P2 (i.e., Ky = ky/k,). 


Concentrations of the interacting proteins, Yn addition to knowledge of the K, values for 
two interacting proteins, the total cellular concentrations of P1 and P2 (i.e, the sum of 
bound and free concentrations) are needed to evaluate the extent to which P1 and P2 can 
interact. 


Solution conditions. Salt concentration, pH, and excipients that cause “molecular crowd- 
ing" (e.g. polyethylene glycol) (for reviews on molecular crowding, see Ellis 2001 and 
Verkman 2002) can markedly affect the strength of protein-protein interactions. 


Role of cofactors. Several factors can influence the nature of interacting proteins, especial- 
ly small effector molecules and ions such as Ca’’, the cofactors such as the low-molecular- 
weight metabolites ATP and GTP, and posttranslational modifications (e.g.. phosphoryla- 
tion, glycosylation, acylation, and alkylation). Other macromolecules such as DNA, RNA, 
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and proteins can influence protein-protein interactions by forming multiprotein complex- 
es involving a large number of components. 


Influence of competing proteins. Even if two proteins (P! and P2) have a high affinity for 
each other, and their total cellular concentrations are sufficiently high for them to interact 
functionally within the cell, they may not do so in vivo. If a protein can interact with more 
than one other protein (i.e., PI-P3, P1~P—P4, PI—Pn, etc.), then these other proteins may 
effectively compete with P2 for P1. For instance, some of the competing proteins (P3, etc.) 
may be present in concentrations 1000-fold higher than P2, and even if their affinity is 
much lower (10-fold) than that of P1-P2, they will interact (titrate out) with P1, leaving 
negligible amounts of Pl available to interact with P2. Of course, this problem is com- 
pounded by the likelihood of P3, etc., also interacting with a different subset of proteins. 
(It is now generally recognized that proteins do not exist in isolation within a cell and that 
they arc involved in more than one intracellular pathway, having several interacting part- 
ners.) Moreover, a protein that is capable of interacting with a particular protein, or subset 
of other proteins, is greatly influenced by its intracellular location. Although two proteins 
may be capable of interacting in vitro, the question remains as to whether they have the 
opportumty to interact with each other in vivo, especially if they are located in different cel- 
lular compartments. Some proteins are regulated in part by their partitioning between dif- 
ferent intracellular compartments and can only interact when they traffic between these 
compartments (e.g., transcription factors are regulated by their partitioning between the 
cytoplasm and the nucleus, and they can only interact with the transcription machinery 
when they partition to the nucleus). 


Several methods are available for determining binding constants, such as surface plas- 
mon resonance (Schuck 1997a,b; Medaglia and Fisher 2001, 2002; Natsume et al. 2001, and 
references therein), gel filtration, sedimentation equilibrium, microcalorimetry (see 
Appendix 2), and various fluorescence methods (for fluorescence polarization assay and flu- 
orescence gel-retardation assay, see Protocol 9) (for a more detailed understanding of bind- 
ing constants, see Cantor and Schimmel 1980; Lakey and Raggett 1998). 


PROTEIN MICROARRAYS FACILITATE THE LARGE-SCALE STUDY OF PROTEINS 


During the past decade, microarray technology has revolutionized the study of gene expres- 
sion. Arrays of nucleic acids, composed of either single-stranded oligonucleotides or double- 
stranded PCR products, have been used to measure the abundance of thousands of different 
nucleic acid species in cell or tissue samples. Because the microarray format has proven to be 
both convenient and effective for studying nucleic acids, many have anticipated that microar- 
rays of proteins would similarly facilitate the large-scale study of proteins. 


Protein Profiling versus Protein Function 


There are two primary areas in which microarrays of proteins can prove useful: protein pro- 
filing and the determination of protein function. Protein profiling is most analogous to tran- 
scription profiling (the large-scale study of gene expression) and involves measuring the 
abundance, modification, and localization of many different proteins in a cell or tissue sam- 
ple. This requires arrays of immobilized molecules that can selectively capture specific pro- 
teins from complex mixtures and report on their abundance and state of posttranslational 
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modification, The arrayed molecules may be other cellular proteins, but they are more often 
antibodies, antibody mimics, or even nonproteinaceous receptors such as nucleic acid 
aptamers. Because antibodies are most often used, however, array-based protein profiling is 
usually discussed in the context of protein arrays. 

Array-based protein profiling is not to be confused with the other application of protein 
arrays: the high-throughput study of protein function, For this application, the proteins 
being studied are themselves immobilized in a microarray format. Different assays may then 
be performed on the arrays to investigate various aspects of the proteins’ functions, includ- 
ing their ability to bind to other molecules (including other proteins) or their ability to serve 
as substrates for protein-modifying enzymes (such as kinases or proteases). 


Fabricating Protein Arrays 


The substrate 


Because the uses for protein arrays are many and varied, no single substrate has been adopt- 
ed as a standard. However, a number of constraints direct the choice of substrate and con- 
tinue to drive the field to improve in this area. 


« Many applications currently use fluorescence as a detection method. It is therefore best if 
the substrate is intrinsically nonfluorescent. For this reason, glass is most often used, 
although plastics and membranes are also used. 


Most applications require the proteins being arrayed to remain folded and functional 
throughout the experiment. Hydrophilic substrates are therefore most often employed. For 
the most demanding applications (such as profiling low-abundance proteins in cell lysates), 
it is also important to use a surface with very low nonspecific binding to minimize back- 
ground levels on the arrays. Surface chemistry plays an important role because the proteins 
being arrayed must attach to the substrate irreversibly and yet remain functional. 


It is useful for the substrate to be compatible with standard arrayers and scanners, which 
typically accommodate 2.5-cm x 7.5-cm x 1-mm microscope slides. Many vendors now seil 
slides in this format that can be used for a variety of applications. Typically, these slides 
come with protocols for arraying, blocking, and processing. With that said, two substrates 
are particularly useful in our hands: aldehyde slides and BSA-NHS slides. Protocol 10 
details the preparation of these slides. 


Sample preparation and storage 


Some proteins require special buffer compositions to retain their activity. In addition, differ- 
ent substrates use different chemistries to mediate attachment of the arrayed proteins. For 
these reasons, there is no general solution when preparing proteins for arraying. However, for 
the substrates described above (aldehyde slides and BSA-NHS slides), the parameters dis- 
cussed below should be considered. 


Protein concentration and humectants 


The concentration of the protein being spotted determines how much attaches covalently to 
the surface of the slide within a reasonable period of time. If no humectant is included in the 
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protein sample, the nanodroplet will evaporate within a second or two of spotting, and the 
concentration of the protein will increase dramatically. Under these conditions, less protein 
is required for efficient attachment, but the protein will often denature. As a result, we typi- 
cally include 20-40% glycerol in our protein samples to prevent evaporation. 

The inclusion of glycerol has the added benefit that it stabilizes many proteins and 
enables longer-term storage at —20°C. Assuming the inclusion of glycerol, most assays require 
the protein being spotted to be present at a concentration of at least 100 ug/ml. Higher con- 
centrations yield more intense spots, but little benefit is derived from exceeding 1 mg/mi. 
Demanding applications, such as monitoring low-abundance proteins with arrayed antibod- 
ies, typically require concentrations of at least 0.5 mg/ml. 

if short peptides are being spotted, it is more appropriate to adjust their concentration in 
units of molarity, rather than mg/ml. This is because peptides wili most often attach via their 
&-amine (one per peptide). With proteins, the contribution from lysine residues becomes 
more significant (more lysines per molecule) and hence the unit “mg/ml” is more appropri- 
ate because it provides a better estimate of the concentration of primary amines in the sam- 
ple. When spotting peptides, we have found that a concentration of 20—0 um yields good 
results, but lower concentrations are also acceptable. (Note that 40 um corresponds to 0.1 
mg/ml for a 2.5-kD peptide.) 


Buffer, pH, and ionic strength 


Proteins attach to both aldehyde slides and BSA-NHS slides via nucleophilic groups on their 
surface (the £&-amine of lysine residues or the @-amine at the amino terminus). As such, the 
buffer cannot contain molecules with strong nucleophiles, such as primary amines. Iris ( Tris- 
[hydroxymethylJaminomethane) (pKa = 8.2), a commonly used buffer, is not compatible 
with these slides. However, the following are some commonly used buffers that are compati- 
ble with both aldehyde and BSA-NHS slides: 


MES (2-[N-morpholino] -ethanesulfonic acid) (pKa = 6.2) 

PIPES (piperazine-N,N"-bis|ethanesulfonic acid|) (pKa = 6.8) 

Phosphate (pKa2 = 7.2) 

MOPS (3-[N-morpholino}propanesulfonic acid) (pKa = 7.3) 

HEPES (N-2-hydroxyethylpiperazine-N’-2-ethanesulfonic acid) (pKa = 7.7) 
Borate (pKa = 9.2). 


The pH of the buffer affects the rate at which the proteins attach to the surface of the 
slides. High pH favors rapid attachment. In practice, if the protein is printed at a pH of 7 or 
higher, it attaches efficiently ta the slide. If lower-pH buffers are required, it may be necessary 
to use higher concentrations of proteins or longer reaction times to achieve acceptable results. 
These parameters depend on the specific application and are best determined empirically. 

Ionic strength can have an important role in ensuring that a protein remains folded and 
functional. Fortunately, ionic strength has little impact on either spotting or attachment and 
so can be set appropriately depending on the proteins being arrayed. 


Reductants 


Many proteins require reductants in the buffer to prevent unwanted disulfide bonds from 
forming. In addition, proteins often suffer from unwanted oxidation (such as at methionine 
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residues) when stored without reductants. Since both dithiothreitol (DTT) and B-mercap- 
toethanol contain thiols, which are nucleophilic, they cannot be included in the buffer, To cir- 
cumvent this problem, | mM Tris(2-carboxyethylphosphine) hydrochloride (ICEP; from 
Pierce Endogen, Rockford, Illinois) has been used successfully as a reductant. This is a water- 


soluble reagent that effectively reduces disulfide bonds but does not interfere with the slide 
chemistry. 


Detergents and denaturants 


In general, it is best to avoid the inclusion of detergents in the buffer. Although they do not 
interfere with the slide chemistry, detergents reduce surface tension and so result in larger, 
more diffusc spots. If detergents are absolutely necessary, they may be included, but the 
density of the spots on the arrays must be decreased accordingly. For some applications, it 
may also be desirable to array proteins or peptides under denaturing conditions and then 
refold the immobilized molecules on the slide surface. This may be done by including 6 M 
guanidine hydrochloride in the buffer. Because guanidine hydrochloride also acts as a 
humectant, it is not necessary to include glycerol under these circumstances. However, the 
inclusion of 20-40% glycerol results in more uniform spots and so is usually desirable. If 
guanidine hydrochloride is included in the spotting buffer, it is best not to use BSA-NHS 
slides since guanidine hydrochloride alters their surface and results in increased nonspecit- 
ic binding, 


Printing the arrays 


Protein microarrays can be fabricated using most commercial or homemade arrayers. 
However, the quality and consistency of the spots vary depending on the arraying method. 
We typically array proteins in 40% glycerol/60% phosphate-buffered saline (PBS), which is 
relatively viscous. Split pins can pick up and deliver this buffer; however, the size of the spots 
tends to decrease on successive slides. To avoid this problem, the Affymetrix 417 arrayer 
(Affymetrix, Santa Clara, California), which uses a ring-and-pin system to pick up and deliv- 
er the sample, can be used. Noncontact methods also work well; for example, PerkinElmer 
Life Sciences sells the BioChip Arrayer and SpotArray Enterprise, which use piezo-electric 
dispensing to deliver sub-nanoliter volumes to the surface of the slide and can accommodate 
solutions with 40% glycerol. Regardless of which arrayer is used, care should be taken in the 
following areas: 


e Humidity. Although glycerol is very effective at preventing the spots from evaporating, very 
low humidity still takes its toll. To control the humidity, the arrayer should be enclosed 
(most commercial arrayers are) and outfitted with a humidifier and humidistat set at 
60-70% relative humidity. 


e Temperature. Arrays can be fabricated at room temperature; however, the sample plates 
should be kept at 4°C when not in use. (Some arrayers feature automated plate handling 
and keep the plates in a chilled hotel.) Of particular concern is that the interior of some 
arrayers heats up during a print run due to poorly located motors or excessive friction. lt 
is advisable to monitor the temperature during extended print runs and take appropriate 
measures to counteract this problem. Some arrayers also tend to heat the pins while drying 
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FIGURE 10.7. Aldehyde slide (see text for details). (Courtesy of Gavin MacBeath, Harvard University.) 


them. Make sure that the arrayer pauses long enough for the pins to cool down before 
inserting them in the next sample. 


e. Carryover. Because buffers containing 40% glycerol are viscous, extra care should be taken 
in washing the pins or rings between samples to avoid carryover. 


Quenching arrays 


After printing the arrays, sufficient time must be allowed for the proteins to attach to the slide 
surface. On aldehyde slides, amines on the proteins form a Schiff base with the aldehydes on 
the slide (see Figure 10.7). This reaction is very rapid, so the slides may be quenched as soon 
as 15 minutes after arraying is complete (longer incubation times are not detrimental). On the 
BSA-NHS slides, amines on the proteins form amide or urea bonds with the activated car- 
boxylates or activated amines on the BSA, respectively (see Figure 10.8). This reaction is slow- 
er than Schiff base formation, and hence, there should be a wait of at least 1 hour before 
quenching the slides. A method for quenching slides that prevents smearing the arrayed 
droplets is provided in the panel on ADDITIONAL PROTOCOL: PREVENTING THE FORMATION OF 
COMETS at the end of Protocol 11. 
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FIGURE 10.8. BSA-NHS slides (see text for details). (Courtesy of Gavin MacBeath, Harvard University.) 


ÀA Protein-Protein Interactions 
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FIGURE 10.9. Protein-protein interactions (see text for details). (Courtesy of Gavin MacBeath, Harvard 
University.) 
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FIGURE 10.10. Protein-small molecule interactions (see text for details). (Courtesy of Gavin MacBeath, 
Harvard University.) 


Assays with Protein Arrays 
Protein-protein interactions 


Protein arrays can be used to screen for protein-protein interactions, For this application, a 
protein microarray is incubated with a fluorescently labeled protein, the array is washed, and 
stable interactions are identified by scanning the slide for fluorescent spots (see Figure 10.9). 
it is also possible to probe the array with an epitope-, hapten-, or biotin-tagged protein and 
subsequently visualize bound proteins with a fluorescently labeled antibody or streptavidin 
conjugate. However, direct labeling of the protein of interest produces the best results. For a 
labeling and probing protocol for protein-protein interactions, see Protocol 11. 


Protein-small molecule interactions 


Protein arrays can also be used to screen for protein-small molecule interactions. For this 
application, a protein microarray is incubated with a fluorescently labeled compound, the 
array is washed, and stable interactions are identified by scanning the slide for fluorescence 
(see Figure 10.10 and Protocol 12). It is convenient to label compounds by covalently linking 
them to a carrier protein, such as bovine serum albumin (BSA), that has previously been 
labeled with a fluorophore. Not only is it often more convenient to label proteins in this way 
(rather than by directly coupling them to a fluorescent molecule), but the carrier protein also 
aids in rendering the labeled compound water-soluble. In addition, the valency of the conju- 
gate can be increased to detect low-affinity interactions if desired. If the compounds cannot 
easily be immobilized, they may in principle be labeled with tritium, and the interactions 
identified on the basis of radioactive spots. However, this approach has not yet been tested. 


Kinase-substrate interactions 


1n addition to screening for long-lived interactions, protein arrays can also be used to screen 
for transient interactions, such as those between an enzyme and its substrate. As long as the 
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FIGURE 10.11. Enzyme-substrate interactions (see text for details). Courtesy of Gavin MacBeath, Harvard 
University.) 


enzyme modifies its protein substrate and that modification can be detected, the transient 
interaction between enzyme and substrate can be inferred. Perhaps the most useful applica- 
tion in this area is the identification of putative substrates for protein kinases. Protein kinas- 
es modify their substrates by transferring a phosphate group from adenosine 5 "-triphosphate 
(ATP) to a side chain on the protein (serine, threonine, tyrosine, or histidine). To identify 
substrates of protein kinases, a protein microarray is incubated with a kinase and [y P]ATP 
(see Figure 10.11). Following an appropriate incubation, the array is washed, coated with a 
photographic emulsion, further incubated, and finally developed and imaged with a scanning 
light microscope (for details, see Protocol 13). 


Protein profiling 


In addition to screening for interactions, protein arrays can be used to quantitate the abun- 
dance and modification states of proteins in complex mixtures (such as serum, cell culture 
supernatants, or even cell lysates). For this application, it is necessary to array receptor mol- 
ecules, such as antibodies, on the slides and then use these molecules to capture specifically 
their cognate antigens from solution. Although various label-free detection methods are 
under development, it is likely that investigators will continue to rely on fluorescent labeling 
methods for some time. There are two ways to detect the captured proteins using fluorescent 
dyes. The direct method (see Figure 10.12) relies on labeling all of the proteins in the com- 
plex solution with a fluorescent dye and then capturing the labeled proteins on the antibody 
array. The advantage of this method is that two different samples can be compared directly 
in a competitive binding experiment using two different colored dyes. The disadvantage of 
this approach, however, is that it is often difficult to label low-abundance proteins with high 
efficiency. Moreover, this approach requires extremely specific reagents to analyze low-abun- 
dance proteins (cross-reactivity of the reagents become a problem when the antigen is pre- 
sent at a much lower concentration than other proteins with similar epitopes). 

The other way to detect the captured proteins does not require labeling the proteins 
themselves. Instead, each protein is captured by one reagent and then detected in a second 
step with a second reagent (see Figure 10.13). The second reagent (antibody) recognizes the 
protein at a site that does not overlap with the recognition site of the first reagent (antibody). 
In this sandwich approach, the second reagent is labeled and so provides the signal. The 
advantage of this approach is that it does not require labeling the proteins themselves. 
Moreover, additional specificity is gained by using two different reagents for each protein. 
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FIGURE 10.12, Antibody arrays: Direct labeling (see text for details). (Caurtesy of Gavin MacBeath, 
Harvard University.) 


The disadvantage, however, is that is it more difficult to assemble matched pairs of antibod- 
ies for each protein of interest. Although there are reasons to pursue both approaches, the 


sandwich approach is more appropriate for analyzing very complex solutions (cell lysates) 
and this method is detailed in Protocol 14. 


Reading and Interpreting the Arrays 


One of the advantages of constructing protein arrays on glass slides is that they can be 
scanned and analyzed using commercially available instrumentation. Several companies sell 
fluorescence slide scanners. In general, there are two types of scanners: those that use lasers 
to excite the fluorophores on the slide and a photomultiplier tube (PMT) to measure fluo- 
rescence and those that use a white-light source coupled with filters to illuminate the slide 
and a charge-coupled device (CCD) to measure fluorescence. Both types of instruments can 
be used to read protein arrays. The advantage of the PMT is that it is often faster and less 
expensive, whereas the advantage of CCD is that it tends to offer a higher signal-to-noise 
ratio and is more flexible with respect to the range of fluorophores that can be read. 

All commercially available scanners come with image-processing software. At a mini- 
mum, the software enables the fluorescence of each spot to be quantified and the data to be 
exported to a spread sheet. The analysis of protein array data may become more sophisticat- 
ed in the future; however, at present, we have used relatively simple manipulations to inter- 
pret our results. For antibody array data, changes in fluorescence from one sample to the next 
are best expressed as fold-changes. If purified antigens are available, standard curves can be 
generated, offering absolute, rather than relative, quantification. 

Interaction data require more processing, One of the powerful aspects of protein arrays 
is that they can be used to determine the full n x a matrix of protein-protein interactions 
within a set of n proteins. To do this, n arrays of all n proteins are prepared and each array is 
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FIGURE 10.13. Antibody arrays; Sandwich labeling (see text for details). (Courtesy of Gavin MacBeath, 
Harvard University.) 


probed with a different member of the set. Each labeled protein, however, will produce a dif- 
ferent level of background binding to the arrays and a different maximum signal for specific 
binding. As a result, it is necessary to apply some sort of normalization to determine which 
interactions are real and which are likely to arise from nonspecific binding. We have found it 
useful to quantitate the fluorescence of each spot on an array and then divide that intensity 
by the mean intensity of all the spots on the array to yield an "x-fold above the mean" value. 
An arbitrary decision must then be made as to how large an “x-fald above the mean” value is 
rcquired to score the interaction as real. 

As a further complication, some proteins on the array exhibit higher levels of nonspecif- 
ic binding than others. If only a single array is processed, it is impossible to determine whether 
the signal at a given spot is due to a specific binding event or arises from nonspecific binding 
to an intrinsically “sticky” protein. One advantage of the n X n experiment is that it is possi- 
ble to see how each immobilized protein interacts with all »! labeled proteins. Thus, a second 
level of confidence can be gained by analyzing each immobilized protein over the n different 
arrays, Each “x-fold above the mean" value for a given immobilized protein can be divided by 
the mean of the “x-fold above the mean" values on all 7 arrays. This effectively removes the 
nonspecific binding effect and highlights interactions that are more likely to be real. 


PROTOCOL I 


Using FLAG Epitope-tagged Proteins for 
Coimmunoprecipitation of Interacting Proteins 


MATERIALS 


Tus PROTOCOL WAS CONTRIBUTED BY Anne Verhagen (Walter and Eliza Hall Institute of 
Medical Research, Parkville, Australia). 

Protein-protein interactions have a major role in transducing and regulating signaling 
pathways in celis. One way to detect protein interactions is by coimmunoprecipitation. In the 
initial analysis, immunoprecipitates can be prepared from [S|methionine-labeled cell 
lysates and examined. When an “S-labeled protein is detected that appears to coimmuno- 
precipitate specifically with the protein of interest (protein X), the procedure can be scaled 
up to generate sufficient amounts of the coimmunoprecipitating protein for detection by 
Coomassie Blue staining. This is normally sufficient for identification by MS technology. 
When examining protein interactions, it may be possible to immunoprecipttate endogenous 
protein X directly if antibodies are available. Alternatively, ectopically expressed (transiently 
or stably expressed) epitope-tagged protein X (e.g., FLAG-protein X) can be immunoprecip- 
itated using commercially available epitope-tag-specific monoclonal antibodies (mAbs). The 
advantage of examining interactions of endogenously expressed proteins is that these are 
more likely to be physiological and less likely to be an artifact of overexpression. [n scaling up 
the procedure, however, epitope-tagged overexpressed protein X may be necessary for suffi- 
cient coimmunoprecipitating protein to be isolated. This protocol describes the use of FLAG- 
epitope-tagged proteins for both small-scale analysis and large-scale coimmunoprecipitation 
of interacting proteins. 


» Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Anti-FLAG M2 agarose affinity gel (Sigma) 
Glycine (100 mM, pH 3) <!> 
Horseradish peroxidase ( HRP)-conjugated anti-mouse secondary antibody 
IEF Buffer 
9 M urea <!> 
324 mw dithiothreitol (DTT) <!> 
2% pharmalytes 3-10 (Amersham Biosciences) 
0.5% Triton X-100 <!> 
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P Equipment 


Lysis buffer 
1% Triton X-100 
10% glycerol 
150 mm NaCl 
20 mM Tris (pH 7.5) <!> 
2 mM EDTA 


Supplement lysis buffer with 1 mu PMSF <!>, 10 g/ml aprotinin <!>, and 16 ug/ml leupeptin, <!> 
immediately betore use. 


|5S]Methionine or [^S]methionine/[?S]cysteine <!> 
NaCl (saline) (150 mw) 

Phosphate-buffered saline (PBS) 

Polyethylenimine (PED) (800 kD) (Fluka) <!> 


Prepare a 0.9 mg/ml stock in H,O, adjust pH to 7, and filter sterilize. This stock can be kept at 49C 
wrapped in foil for several months. Always vortex the tube immediately before use. 


RF10 medium 
RPMI 1640 media 
10% fetal calf serum 
1% penicillin-streptomycin <!> 
2% glutamine 


RPMI 1640 media (serum-free) (Invitrogen Life Technologies) 
The use of RPMI and RF10 media is appropriate with 293 T cells. Other cells may require different 
media. 


4x SDS-PAGE loading buffer 
200 mM Tris (pH 6.8) 
400 mm DTT 
8% SDS <!> 
0.4% bromophenol blue <!> 
40% glycerol 

Sodium azide (1%) <!> 

Tris (1 M, pH 8) 

Trypsin (cell-culture grade) 


Centrifuge, large, with fixed-angle rotor (e.g., Sorvall) 

Fluorescence microscope with appropriate filters for detection of GEP 

Incubator, cell culture at 37°C and 5% CO, 

Microfuge 

Microfuge tubes 

Needle (23-gauge) 

Poly-prep chromatography columns (10-ml) (Bio-Rad) 

Rotating wheel or other suitable mixing device at 49C 

SDS-PAGE and 2D-IPG/SDS-PAGE apparatus 

Ultrafiltration column (e.g., Centricon-10 for 10,000 m.w. cutoff, Millipore) 
See note at Step 40. 
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» Biological Molecules 


METHOD 


Anti-FLAG M2 monoclonal antibody (Sigma) 


Mammalian expression vector encoding FLAG-epitope-tagged protein X 
Vectors for expression of FLAG-epitope-tagged proteins under the control of a cytomegalovirus 
(CMY) promoter are available from Sigma (pf LAG-CMY series 1-7). l l 
Optional: Mammalian expression vector encoding GFP (CLONTECH) 
See the note to Step 3. 


293 T cells 


The following protocols are required for both initial analysis of protein-protein interactions 


and large-scale purification of protein-X interacting proteins, A summary diagram of the 
various steps is given in Figure 10.14. 
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FIGURE 10.14. Usc of FLAG-epitope-tagged proteins for coummmunoprecipitation of interacting proteins. 
In the preliminary analysis, 293 T cells are transfected with FLAG-epitope-tagged protein X and an unre- 
lated FLAG-tagged protein (FLAG-contrel), Following labeling of the cells with | 5SImethionine, the cells 
are lysed, and an immunoprecipitate is prepared using FLAG-agarose heads, The immunoprecipitate is ana- 
lyzed by IPG/SDS-PAGE. FLAG-tagged proteins (F), proteins that nonspecifically interact (NS), and a pro- 
tein that specifically interacts (5) are indicated. For the identification of protein S, large-scale transfection 
of 293 T cells with Flag-protein X is performed. Protein X and interacting proteins are isolated on a FLAG- 
agarose column, concentrated on a Centricon ultrafiltration column, separated by 2D gel electrophoresis 
and revealed by Coomassie Blue staining. Protein $ is then identified by mass spectrometric techniques. 
(Courtesy of Anne Verhagen, Walter and Eliza Hall Institute of Medical Research, Parkville, Aust ralia.) 
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TRANSFECTION OF CELLS WITH VECTORS ENCODING EPITOPE-TAGGED PROTEINS 


Transfection of mammalian cells with plasmid vectors can be performed using many dif- 
ferent procedures. The transfection procedure given below has been used successfully for 
the transfection of 293 T cells. Alternative methods are described by Sambrook et al. 
(1989) and Sambrook and Russell (2001). Many commercial transfection kits are also 
available. 


TRANSFECTION USING POLYETHYLENIMINE 


This procedure was first described in detail by Boussif et al. (1995). The method below his 
been adapted for the transfection of 293 T cells (Verhagen et al. 2000). 


1. Grow 293 T cells in RF10 medium in a humidified incubator at 5% CO, and 37°C. 


2. On day 1, harvest the exponentially growing cells by trypsinization and plate out the cells 
at a density of 1 X 10* cells for 10-cm plates or 3 x 10^ cells per 15-cm plate. 


3. On day 2, for a 10-cm plate, mix 12 pg of mammalian expression vector encoding FLAG- 
epitope-tagged protein-X DNA with 600 Lil of PBS. In à separate tube, mix 48 pl of PET 
(0.9 mg/ml) with 600 ul of saline. For 15-cm plates, scale up appropriately. Vortex both 
tubes for 10 seconds and store for 10 minutes at room temperature. 


Optional: It is helpful to have a small amount of GF P-encoding vector (1.2 ug per 1U-cm plate) 
in the DNA mix together with the FLAG-protein X encoding vector (10.8 ug). This enables 
transfection efficiency to be assessed by examining the cells under a fluorescent microscope. 
Green cells are normally evident 24 hours posttransfection. 


4. Carefully add the PEI mix to the DNA mix. Vortex again and leave for a further 10 min- 
utes. 


5. Rinse the cells once with serum-free RPMI and add 8 ml of fresh serum-free RPMI (sup- 
plemented with glutamine and antibiotics) to the 10-cm plates (or 20 ml/15-cm plate). 


6. Add the DNA/PEI mix dropwise to the plates and gently mix by tipping the plate back- 
ward and forward. 


7. Incubate the cells for 4 hours in the incubator. 


8. Remove the media, add fresh RFIO medium, and return tbe cells to the incubator 
overnight. 


Metabolic Labeling for Small-scale Analysis 


10. Return the ce 


[n the initial analysis of protein interactions, metabolic labeling is used to detect coimmuno- 
precipitating proteins. 


9. One day posttransfection, remove cell culture media and add 3 ml of fresh RF10 medi- 
um supplemented with [?S]methionine (final concentration 0.05 mCi/ml). 


ils to the incubator for 16 hours. The long labeling time allows for the label- 
ing of proteins that do not have a high turnover rate. 


Optional: A higher level of incorporation will occur if RF10 medium is used that is free of unla- 
beled methionine, although some cells do not survive well in this media for long periods. 
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Cell Lysis 


1. 


At 48 hours posttransfection, rinse the cells twice with 10 mi of PBS. 


WARNING: Make sure that radioactive waste is disposed of appropriately. 


, Remove all traces of PBS and add 1 ml of ice-cold cell lysis buffer (freshly supplemented 


with protease inhibitors) per 10-cm plate (or 3 ml per 15-cm plate). 


. Use a cell scraper to dislodge all cellular material from the cell culture plate and transfer 


the extract to a microfuge tube. 


. Allow lysis to proceed on a rotating wheel for L hour at 4°C. 


. Remove nuclear and cellular debris by centrifugation at 13,000g for 10 minutes at 4°C 


and retain the supernatant on ice. 


Immunoprecipitation for Small-scale Analysis 


16. 


17. 


18. 


19. 


20. 


2l. 


22. 
23. 


24. 
25. 


‘transfer the required FLAG-antibody-coupled agarose beads into a microfuge tube (25 pl 
per immunoprecipitation), and pellet the beads by centrifugation at 13,000g for 10 seconds. 


Remove the supernatant, and wash the beads twice in cell lysis buffer. Centrifuge for 10 
seconds after each wash. 


Resuspend the beads in 10 volumes of lysis buffer and dispense 250 ul (= 25 ul of beads) 
into microfuge tubes. Centrifuge briefly to pellet the beads and remove excess buffer. 


Add 1 ml of precleared cel! lysate (from Step 15) to the FLAG antibody-agarose pellet. 
Incubate the beads with cell lysate on a rotating wheel for 1 hour at 4°C. 


Pellet the beads by centrifugation at 13,000g for 10 seconds, and remove the supernatant 
carefully by suction using a 23-gauge needle. 


Wash the beads five times by resuspending them each time in 1 ml of fresh lysis buffer. 
Centrifuge to pellet the beads and aspirate off the supernatant after each wash. 


Following the final wash, carefully remove all traces of the supernatant by aspiration. 


Resuspend the beads in 40 pl of 100 mM glycine (pH 3) to elute the proteins. Pellet the 
beads by centrifugation and transfer the eluate to a fresh tube. Repeat this step. 


Neutralize the final eluate (80 ut) with 8 pl of | m Tris (pH 8). 


For analysis by one-dimensional SDS-PAGE, add 4x SDS loading buffer to the sample and 
boil it for 5 minutes. For analysis by two-dimensional immobilized pH gradient/SDS- 
PAGE, dilute the sample 1:4 with IEF buffer. For details on one-dimensional and two- 
dimensional PAGE, see Chapters 2 and 4, respectively. 


Large-scale Enrichment of the FLAG-tagged Protein and lis interacting 
Partners by Antibody Column Chromatography 


26. 
27. 


Place 0.8 ml of anti-FLAG antibody—agarose beads in an empty chromatography column, 


Wash with 3 x 1 ml of 0.1 M glycine, allowing each milliliter to pass through the column 
before making the next addition. 
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28. 


29. 
30. 


31. 


32. 


33. 


34. 


35. 


36. 
37. 


38. 


39. 


Pass 10 ml of PBS through the column, pipetting the beads up and down to accelerate the 
process. 


Pass 2 ml of lysis buffer through the column, allowing the beads to settle and form a bed. 


Pass 50 ml of precleared cell lysate through the column without disturbing the bed. This 
is a slow process. Do not allow the beads to dry out at any point. Save 20 ul of precolumn 
cell lysate for later analysis (51). Collect the lysate that has been through the column and 
also collect 20 pl for gel analysis ($2). During collection of the lysate, continue to hydrate 
the column with lysis buffer. 


Wash the beads with 2 column volumes of ice-cold lysis buffer. Pipetting the beads up 
and down will speed up the process, but allow for the beads to settle and drain between 
each wash. 


Wash the beads with 2 column volumes of ice-cold PBS. 


Klute the proteins with 6 x 1 ml of 0.1 M glycine (pH 3), allowing the beads to drain 
between each addition and collecting 1-ml fractions in separate microfuge tubes. 


Neutralize the fractions immediately with 50 ul of 1 m Tris (pH 8) and remove 20 pil of 
each for analysis (83 to S8). 


Wash the beads with 2 column volumes of 0.1 « glycine (pH 3), pipetting the beads up 
and dowm to accelerate the process. 


Neutralize the beads with 2 column volumes of PBS. 


Add sodium azide (to 0.196 v/v final concentration) directly to the PBS in the column. 
Seal the column and store at 4°C for future use. 


Analyze the 20-1] samples (51 to S8) by western blot analysis with anti-FLAG M2 mon- 
oclonal antibody and a horseradish peroxidase (HRP)-conjugated anti-mouse secondary 
antibody. 


The efficiency of the column in removing FLAG-protein X from the cell lysate can be determined 
by how much remaining FLAG-protein X is in $2 compared with the starting material (S1). 


Of the eluted fractions S3 to S8, pool those in which FLAG-protein X is detectable by 
western blot analysis. 


Concentration of Proteins for Purification 


40. 


Al. 


Place the pooled fractions from the column containing FLAG-protein X in a Centricon 
ultrafiltration tube and concentrate according to the manufacturer's instructions. 
EXPERIMENTAL TIP: A Centricon-10 tube has a molecular-mass cutoff of 10 kD and is suit- 


able if the coimmunoprecípitating protein (i.e., the protein that interacts with protein X) hasa 
molecular mass above this. Other Centricon tubes are available for smaller proteins. 


When the total pooled fraction has been concentrated to a very small volume (e.g., 50 ul), 
invert the Centricon tube and recover the concentrate according to the manufacturers 


instructions. 
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42. Dilute the sample 1:4 with IEF buffer. 


EXPERIMENTAL TIP: Some proteins may stick to the ultrafiltration membrane of the 
Centricon tube. It is possible to wash them off the membrane using IEF buffer. 


Purificiation and Identification of the Interacting Proteins 


43. Separate the proteins by two-dimensional IPG/SDS-PAGE (see Chapter 4). 


44. Stain the gel with Coomassie Blue to visualize proteins that have interacted with protein 
X. 


45. Identify these proteins by nano-electrospray ionization tandem mass spectrometry, as 
discussed in Chapter 8. 


Cast StuDY 


DIABLO is a protein identified on the basis of its ability to interact with the cell death 
inhibitor protein mammalian IAP homolog A (MIHA). For the identification of DIABLO, 
293 T cells were transiently transfected with a cDNA encoding MIHA with a carboxy-ter- 
minal FLAG epitope tag (FLAG-MIHA). In the initial analysis with [5S]methionine- 
labeled cell lysates, a protein spot corresponding to DIABLO could be detected that coim- 
munoprecipitated specifically with FLAG-MIHA and not another unrelated FLAG-tagged 
protein. To identify DIABLO, cellular lysate prepared from 100 15-cm Petri dishes of 293T 
cells transiently transfected with FLAG-MIHA was passed through a column of anti-FLAG 
antibody-coupled agarose beads. After extensive washing, the bound proteins were eluted 
with acidic glycine and separated by two-dimensiorial IPG/SDS-PAGE (see Figure 10.14) 
using a Amersham Biosciences Multiphor system. This involved separation of the proteins 
in the first dimension on 11-cm (pH 3-10) Immobiline strips, followed by separation in 
the second dimension on precast, solid-back SDS gradient 8-18% polyacrylamide gels. The 
gel was stained with Coomassie PhastGel Blue R (following the procedure for staining with 
Coomassie Blue described in the Multiphor electrophoresis system user's manual, except 
that for Step 1, fixing solution C was used instead of fixing solution N), and the stained gel 
spot corresponding to DIABLO was excised for analysis (Verhagen et al. 2000). 


Rz ee MER M 


PROTOCOL 2 


Affinity Purification of Interacting Proteins from Cell Lysates 


MATERIALS 


T PROTOCOL WAS CONTRIBUTED BY Manuel Baca and fian-Guo Zhang (The Walter and 
Eliza Hall Institute of Medica! Research, Parkville, Victoria 3050, Australia). 

The identification of protein interaction partners is a powerful method for elucidating 
the biological activity of a protein. Important insights into the function of a protein cau be 
gained from a knowledge of the biochemical interactions in which the protein is involved, 
particularly where the function of the interaction partner is well-defined. Proteomic identi- 
fication relies on isolating the interacting protein directly from a biological source and then 
identifying that protein through microanalytical methods. Here, recombinant protein or a 
chemically synthesized bioactive fragment is immobilized on resin and used as a "probe" to 
capture interacting proteins directly from a cell extract. The affinity-purified proteins are 
fractionated by gel electrophoresis and visualized by Coomassie staining. Comparison of this 
gel profile to that obtained from the incubation of cell lysate with a control resin (i.e., resin 
that lacks the immobilized probe protein) is used to identify proteins that interact specifical- 
ly. Specific bands are excised from the gel, proteolytically digested in situ, and analyzed by liq- 
uid chromatography/tandem mass spectrometry to derive primary sequence information 
and consequently identify the interacting protein (Figure 10.15). In this protocol, a generic 
affinity capture procedure is described. This generalized protocol works well in several appli- 
cations. However, some conditions can be altered, as discussed earlier in the chapter (see the 
panel on OPTIMIZATION OF VARIABLES FOR AFFINITY CAPTURE METHODS in the introduction to 
this chapter). A case study is provided at the end of this protocol detailing the identification 
of elongins B and C as SOCS-box-interacting proteins (Zhang et al. 1999). 


p Reagents 
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CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>, 


Coomassie Blue stain 
50% (v/v) methanol <!> 
10% (v/v) glacial acetic acid «t» 
40% (v/v) H,O 
0.1% (wiv) Coomassie Brilliant Blue R250 <!> 


Ethanolamine-HCL (1 M, pH 8.0) <!> l . 
This can be prepared from ethanolamine solution adjusted to pH 8.0 with HCL 


Immobilization buffer 
phosphate-buffered saline (PBS) 
0.0296 (v/v) Tween-20 
0.5 mM Tris(2-carboxyethyl)-phosphine hydrochloride (TCEP) (Pierce) <!> 
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» Equipment 


Gell lysate 


Probe protein 
resin 


Control resin 


SDS-PAGE fractionation 


In situ proteolysis of specific gel bands 


! 


Mass spectrometnc analysis and database interrogation 


FIGURE 10.15. Outline of affinity capture strategy. (Courtesy of Manuel Baca and Jian-Guo Zhang, The 
Walter and Eliza Hall Institute of Medical Research.) 


N-hydroxysuccinimide (NHS)-activated Sepharose (Amersham Biosciences) 
Immediately before use, prepare the resin by washing four times with ice-cold 1 mm HCl <!>. 
Immobilization buffer containing 0.02% (v/v) sodium azide «t». 
Lysis buffer 
10 mM Tris-Cl (pH 7.5) 
100 mm NaCl 
0.5% Nonidet P-40 
Immediately before use, add the protease inhibitor 1 mM phenylmethylsulfonyl fluoride (PMSF} 
«t» in addition to Complete Protease Inhibitor Cocktail tablets (Roche Applied Science) and the 


protein phosphatase inhibitors } mM sodium orthovandate (Na, VO) <!> and | mM sodium flu- 
oride {NaF} <!>. 


2x SDS-PAGE gel-loading buffer 
2.5% (wiv) SDS <!> 
25% glycerol 
125 mM Tris-Cl (pH 6.8) 


0.01% (w/v) bromophenol blue <!> 
Add fresh dithiothreitol (NTT) <!> to 100 mM final concentration immediately before use. 


SDS-polyacrylamide gradient gel (4—2096) <!> 
Sodium phosphate (0.5 M, pH 7.5) 


Boiling water bath or heating block 

Chromatography columns (disposable 10-ml) (Bio-Rad) 
Dialysis membrane 

Gel electrophoresis apparatus 
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Microfuge 
Microfuge tubes 
Tube mixer 
Preferably, use a device that permits end-over-end mixing. 
Scalpel 
UV spectrophotometer 


b Biological Molecules 


Cell line growing in culture 


Probe protein, recombinantly expressed, or chemically synthesized peptide probe 


METHOD 


Preparation of Affinity Resin 


1. Dialyze the recombinantly produced probe protein against immobilization buffer for at 
least 4 hours. Replace the dialysis buffer and repeat this step twice. 


If the probe is a synthetic peptide that has been HPLC-purified, this may be taken up directly 
in immobilization buffer. 


IMPORTANT: Completely remove all traces of amine and thiol-containing reagents (e.g. Tris- 
HCI, DTT, and glutathione) prior to immobilizing a protein covalently on NHS-Sepharose; 
otherwise, these agents will themselves react with resin-bound NHS groups. TCEP is a nonnu- 
cleophilic reducing agent that will keep the protein reduced, but it will not interfere with the 
subsequence immobilization chemistry. 


2. Quantitate the concentration of dialyzed protein by measuring the absorbance at 280 nm. 


EXPERIMENTAL TIP: Use the dialysis buffer as the spectrophotometer blank. The molar 
extinction coefficient of protein is readily determined by the formula nW(5690) * nY(1280), 
where nW and nY are the number of tryptophan and tyrosine residues within the fusion pro- 
tein, respectively. 


3. Incubate 0.6 mg of dialyzed protein with 200 u} of washed NHS-activated Sepharose for 
3 hours at room temperature, while shaking gently on a rotary mixer. 


Prepare the control resin in tandem with the probe resin according to Steps 3-5. If the probe 
protein has been expressed as a fusion protein, the control resin should consist of the fusion tag 
(e.g, GST) immobilized onto NHS-Sepharose. Otherwise, the control resin is simply 
ethanolamine-blocked NHS-Sepharose. 


4. Block any unreacted NHS groups by adding 0.1 volume of buffered 1 M ethanolamine 
(pH 8.0). Continue shaking gently on a rotary mixer for 3 hours at room temperature or 
overnight at 49C. 


5. Transfer the resin to a disposable 10-ml chromatography column and wash the resin with 
50 ml of immobilization buffer. 


"he resin can be stored at 4C in immobilization buffer containing 0.0296 (v/v) sodium azide. 
The resin shelf-life depends on the stability of the immobilized protein. If the recombinant 
probe protein is known to degrade quickly, the resin should be used as soon as possible. 


Immediately prior to use, wash the affinity resin with lysis buffer (see Step 9). 
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Purification of interacting Proteins 


6. Harvest 1 x 10° cultured cells and lyse them for 30 minutes on ice using 5 ml of ice-cold 


13. 
14. 


lysis buffer. 


This number of cells should be easily attainable for a cell line grown in suspension. For adher- 
ent cells, a more feasible cell number is 1 x 10° for initial experiments. 


. Transfer the lysate to microfuge tubes and centrifuge at maximum speed for 15 minutes 


at £C. 


. Optional: Add the supernatant to 150 ul of control resin freshly washed in lysis buffer. 


Incubate the mixture for 2 hours at 4°C on a rotary mixer, centrifuge briefly, and then 
remove the supernatant. 


EXPERIMENTAL TIP: This preclearing step can be used to reduce the capture of proteins that 
associate nonspecifically with the probe-protein resin. It is better to attempt the procedure 
without this step initially, but it can be included if a high level of background protein contam- 
ination is observed. 


. Transfer 50 pl of each resin to separate microfuge tubes. Wash the resins (several times) 


with lysis buffer, including a centrifugation step between washes to pellet the resin. Then 
resuspend the resins to the same initial concentration, and dispense 40 [1] of each to fresh 
tubes for the next step. 


. Split the lysate into two equal portions. Add one half to 40 ul of probe protein resin and 


the other half to 40 ul of control resin. Incubate on a rotary mixer for 2 hours at 4°C. 


This amount of resin should be suitable for most purposes, but it may need adjustment accord- 
ing to the density of probe protein loaded on the resin. Small proteins or peptides can be immo- 
bilized at high density and consequently less resin could be used, Conversely, if the probe protein 
has a high molecular weight, more resin may have to be used. The greater the volume of resin 
used, the greater the extent of background protein contamination. 


. Centrifuge the mixtures briefly and remove the supernatants by aspiration. Wash the 


resins each with 2 ml of cold lysis buffer. Centrifuge the tubes briefly, and remove the 
supernatant. Repeat this wash step three more times. 


EXPERIMENTAL TIP: This degree of washing is given as a guide. If necessary, increase or 
decrease the extent of washing according to the level of background protein capture observed 
in the control resin experiment. 


. Flute the bound proteins by adding 30 pl of 2x SDS-PAGE gel-loading buffer to each 


resin sample and heating the tubes for 4 minutes at 95°C. 
Run the samples on a 4-20% SDS-PAGE gradient gel (see Chapter 2, Protocol 1). 


Visualize the protein bands by staining with Coomassie Blue (see Chapter 2, Protocol 2 
or 3). 


If no bands are visible by Coomassie staining, repeat the experiment, but instead silver stain the 
gel. Note that standard silver staining methods (see Chapter 4, Protocol 11 and Chapter 7, 
Protocol 11) are not compatible with in situ digestion and mass spectrometric analysis. 
Although lower in sensitivity, consider using the mass-spectrometry-compatible silver staining 
procedure in Chapter 2, Protocol 7. 


. Compare the protein profiles on the two gels resulting from the capture of lysate proteins 


by the immobilized probe protein resin versus those captured by the control resin. Use a 
clean scalpel biade to excise any bands specifically captured by the probe protein resin for 
subsequent in situ digestion and mass spectrometric analysis, 


712 = CHAPTER 10 


Case Swupy: IDENTIFICATION OF ELONGIN B AND ELONGIN C as SOCS-BOX-NTERACTING 
PROTEINS (ZHANG ET AL. 1999) 


The suppressors of cytokine signaling (SOCS) are a family of proteins that contain a cen- 
tral SH2 domain and an —40-amino-acid motif at the carboxyl terminus known as the 
SOCS box (Starr et al. 1997; Hilton et al. 1998). In addition, several other non-5H2 protein 
families also contain a SOCS box at their carboxyl termini (Hilton et al. 1998). At the time 
of the discovery of the SOCS proteins in 1997, the SOCS box was a newly discovered motif 
of unknown biological function. The identification of cellular proteins that specifically 
associate with the SOCS box was used to provide important insights into the possible role 
of this motif in cell signaling pathways. 

Two complementary affinity-capture approaches were used to isolate SOCS-box-inter- 
acting proteins. These approaches differed in the nature of the SOCS box probe that was 
used to capture interacting proteins. In one approach, the SOCS box from SOCS-1 was 
recombinantly expressed as a GST fusion protein and covalently coupled onto NHS- 
Sepharose resin. This material was used as an affinity matrix to capture specific interacting 
proteins from the lysate of the murine monocytic leukemic cell line M1. Alternatively, a 46- 
residue synthetic peptide corresponding to the SOCS box from SOCS-1 was specifically 
biotinylated at the amino terminus and immobilized onto streptavidin-agarose resin. This 
affinity matrix was also used to capture specific interacting proteins from the M1 cell lysate. 

When the affinity column eluates from both approaches were analyzed by SDS-PAGE, 
two specific interacting proteins of 15 kD and 18 kD were detected. Excision of these gel 
bands and subsequent proteolysis and mass spectrometric analysis were used to identify the 
proteins as elongin C and elongin B, two proteins known to associate with each other and, 
in turn, with proteins such as elongin A and the Von Hippel Lindau (VHL) tumor sup- 
pressor protein. Interestingly, the elongin B/C heterodimer was known to associate with 
proteins that contain a (T,S,P}LXXX(C,S)XXX{L,LV) consensus sequence, and indeed, this 
motif is a conserved feature of all SOCS boxes. The identification of elongins B and C as 
SOCS-box-interacting proteins had major implications for the proposed function of the 
SOCS box. Elongins B and C are believed to target associated proteins to proteasomal 
destruction; thus, it was suggested that the SOCS box acts as an adaptor for directing 
SOCS-associated proteins to the protein degradation pathway. 


PROTOCOL 3 


Native Agarose Gel Electrophoresis of Multiprotein 
Complexes 


Ta PROTOCOL WAS CONTRIBUTED BY Rosalind Kim (University of California, Berkeley, 
Berkeley, California). 

An important tool for the biochemist is the ability to analyze proteins in their native 
state. Electrophoresis of proteins and protein-protein complexes in native agarose geis using 
a horizontal gel apparatus is simple to set up, takes a short time to run, and avoids the use of 
toxic components. This system allows the detection of both positively and negatively charged 
proteins as well as protein-protein complexes in the same gel (R. Kim et al. 1999). A protein 
in native agarose can have either a positive or negative charge depending on its pl and the pH 
of the buffer used to perform the electrophoresis. Proteins with a pl lower than the buffer pH 
carry a net negative charge and migrate toward the anode, whereas proteins with a pI higher 
than the buffer pH carry a positive charge and migrate toward the cathode. The gel is run in 
a submerged horizontal platform with the wells positioned in the center of the gel. This 
allows for negatively and positively charged proteins to migrate toward the anode and cath- 
ode, respectively. Proteins with different molecular weights and pl values can be tested as well 
as proteins that form complexes, whether they be two pure proteins forming a complex or a 
complex formed after incubating a pure protein with a crude extract. Once a complex is 
formed, it can be cut from the gel and the components can be isolated using an Ultrafree-DA 
centrifugal filter device (Millipore) according to the method of Krowcyznska et al. (1995). 
This method does not replace IEF gels because it does not determine the pl of a protein, but 
it docs facilitate the detection of protein-protein complexes and may give information on the 
protein's charge at a defined pH. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 

Agarose (powdered) 

Gel destaining solution 
45% methanol <!> 
10% acetic acid <!> 

Gel staining solution 
0.12% Coomassie Brilliant Blue R 
45% methanol 
10% acetic acid 

Glycerol (596) 


713 


714 a CHAPTER 10 


Native gel buffer (Buffer A) 
25 mM Tris-Cl (pH 8.5) 
19.2 mM glycine <!> 
2x Sample-loading buffer 
20% glycerol 
0.2% bromophenol blue <!> 
0.12 M Tris base 
SDS-polyacrylamide gel <!> 


Prepare an SDS-polyacrylamide gel according to Chapter 2, Protocol ] or use a precast gel (e.g. 
Amersham Biosciences). See Step 15. 


b Equipment 


Cellophane (ultraclear) (Idea Scientific Co., Minneapolis, Minnesota) 
This cellophane is used for drying the agarose or polyacrylamide gels. Even though the agarose gels 
can be as thick as 0.7 cm, these gels (after soaking in 5% glycerol) can be placed between two layers 
of cellophane, stretched using a frame, and dried for 24 hours at 37°C to preserve the data. 


Erlenmeyer flask 
See Step la. 


Gel electrophoresis apparatus (horizontal) (e.g., Horizon 58, Whatman Biometra) 
Microcon centrifugal filter units (Millipore) 

Microfuge 

Microwave <!> 


Ultrafree-DA unit (Millipore) 
See Step 11. 


P Biological Samples 


Cellular extract containing proteins of interest 
For the preparation of suitable extracts from animal, yeast, and bacterial cells, see Chapter 3. 


Pure protein samples (2-5 pg per lane) 


» Additional Reagents 


Step 15 requires the reagents and equipment listed in Chapter 2, Protocol 1. 


METHOD 


Preparation of the Native Gel 


1. Prepare a horizontal 0.8% agarose gel in native gel buffer. The dimensions of the gel 
should be ~8 cm x 5.5 cm x 3 mm. 


a. Add powdered agarose and native gel buffer to an Erlenmeyer flask that is three to 
four times the volume of the native gel buffer used. Weigh the flask. 


b. Cover the flask with an inverted beaker and place it in a microwave until the agarose 
melts. 


c. Reweigh the flask and add H,O, if necessary, to bring it back to its original weight. 
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d. Allow the agarose to cool to 45°C in a water bath. 


e. Place the comb in the center of the gel-casting assembly and pour the agarose solu- 
tion into the assembly. 


f. Allow the agarose to harden. 


g. Place the gel into a horizontal electrophoresis unit containing native gel buffer. Make 
sure that the agarose gel is completely submerged. Carefully remove the comb. 


Identification of Protein Complexes 


2. 


FIGURE 10.16, Schematic representation of 


Using purified proteins, incubate the proteins of interest together under conditions that 
facilitate protein complex formation. To determine whether proteins in a cell extract may 
bind to a protein of interest, incubate 5 pg of the pure protein with 45 ug of soluble cell 
extract under the appropriate conditions for forming protein complexes. 


. Mix 2-5 ug of each protein sample 1:1 (v/v) with 2x sample-loading buffer. 


The protein samples should include each individual protein (and cell extract, if used) and the 
mixtures incubated as in Step 2. 


. Load the samples into the wells (typically, load ~20 pt per well), and run the gel at a con- 


stant voitage of 50 V for 1 hour at room temperature. 


. Stain the gel in staining solution for 20 minutes and destain in destaining solution until 


the proteins bands can be clearly identified (Figure 10.16). 


. To dry the gel, soak it in 596 glycerol for 5 minutes and dry it between two sheets of cel- 


jophane membrane for 24 hours at 37°C. 


Cathode (-) 


Protein or protein complex 
with a pl higher than the 
buffer pH wiil migrate 
toward the cathode. 


Wells —» [73 Exc EC 


migration of negatively and positively charged ©) - - 
proteins and a complex of the two. Native @ Protein "i protean complex 
agarose gel (0.896) is shown as described in with a pl ower than 


Protocol 3. (Lane 1) A positively charged pro- 
tein; (lane 3) a negatively charged protein; (lane 
5) a complex of the two proteins (lanes 2 and 4 


buffer pH will migrate 
toward the anode. 


are blank). (Courtesy of Rosalind Kim, Univer- 1 2 3 4 5 ! 
sity of California Berkeley.) . Anode (H 
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Isolation and Ideniification of the Components of a Protein-Protein Complex 


7. 


12. 


15. 


Load two identical protein complex samples into adjacent lanes of a 0.8% agarose gel. 
Run the gel at a constant voltage of 50 V for | hour at room temperature. 


. Use a sharp razor blade to separate the two lanes from the gel. 


. Stain one of the two lanes as described in Step 5. Keep the other lane subinerged in native 


gel buffer (Buffet A). 


. Identify the region of the stained gel that contains the protein complex to be recovered. 


Using the stained gel as a guide, excise the slice of agarose from the unstained gel frag. 
ment that corresponds to the cornplex. 


. Place the unstained gel fragment containing the complex into an Ultrafree-DA unit. The 


gel nebulizer in this unit is a device that will convert the gel slice to a fine spray upon cen- 
trifugation. Place the filter unit into a filtrate vial. Centrifuge the unit at 5000g for 10 
minutes at either room temperature or 4°C. The gel fragment must pass through the ori- 
fice of the nebulizer, thus converting the gel to a fine slurry. The gel particles are captured 
by the filter unit and discarded (the filter unit is not reusable). 


Transfer the protein sample in the filtrate vial to a Microcon ultrafiltration concentrator 
(the cutoff size to use is determined by the size of the proteins under study). 


. Centrifuge the Microcon concentrator at 14,000g until the volume of the sample is 


reduced to 10—20 ul. 


The proteins are retained above the membrane in the Microcon sample reservoir. 


. Recover the concentrated proteins by inverting the Microcon into a new vial. Centrifuge 


at 1000g for 3 minutes. 

For complete details, refer to the manufacturer's instructions. 
To identify the sizes of the individual components of the protein complex band, analyze 
the concentrated proteins by SDS-PAGE (see Chapter 2, Protocol t). 


The percentage of acrylamide in the gel will depend on the sizes of the proteins under study 
(see Chapter 2, Table 2.2). Alternatively, the proteins (~4 pmoles are required) can be analyzed 
by ESI- or MALDI-MS (see Chapter 8). 


PROTOCOL 4 


Protein Analysis Using Blue Native PAGE 


Tas PROTOCOL WAS CONTRIBUTED BY Michael T. Ryan (Department of Biochemistry, La 
Trobe University, 3086 Melbourne, Australia}. 

BN-PAGE was developed to aid analysis of membrane protein complexes (Schagger and 
von Jagow 1991). Both the sample-loading buffer and the cathode buffer contain Coomassie 
Brilliant Blue G. Upon binding to proteins, this anionic dye gives the proteins a slight nega- 
tive charge, cnabling them to enter the native gel at neutral pH, where the stabilitv of protein 
complexes is optimal. In addition, by binding to hydrophobic regions of proteins, the dye 
prevents protein aggregation during electrophoresis. This protocol describes a genetic 
method tor pertorming BN-PAGE. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 
Acrylamide stock (49.596, 3%C) 
Dissolve 48 g of ultrapure acrylamide <!> and 1.5 g of N,N’-methylene-bis-acrylamide <!> in H;O 


to a total volume of 100 ml. Filter through a Whatman 3MM filter and store the solution at room 
temperature, away from light. 


Ammonium persulfate <!> 
Make a fresh 10% (w/v) solution in HO, 
Anode buffer 


Prepare a solution of 50 mM Bis-Tris (bis| 2-hydroxvethyl|imino-tris[hvdroxy methyl methane). 
Adjust the pli to 7.0 with HCI <!>. Store at 4°C. 


10x Cathode buffer 1 
500 ms Tricine 
150 mw Bis-Tris (pH 7.0) 


0.2% (w/v) Coomassie Brilliant Blue G 
Store at eC. 


10x Cathode bufter II 
500 mM Tricine 


150 mm Bis-Tris (pH 7.0) 
Store at 4^C. 


Destaining solution 

40% ethanol <!> 

10% acetic acid <!> 
Dithiothreitol (DTT) <!> 
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3x Gel buffer 
200 mM £-amino-n-caproic acid «1» 
150 mM Bis-Tris 
Adjust the pH to 7.0 with HCl and store the buffer at 4°C. Some investigators use a final concen- 
tration of 500 mM €-amino-n-caproic acid in their samples and in the gel to facilitate detergent sol- 
ubilization of protein samples and to focus complexes during electrophoresis. However, in some 
cases, this may lead to a reduction in the stability of membrane protein complexes. Reducing the 
concentration of £-amino-n-caproic acid has not in our hands resulted in any detrimental effects. 

isopropanol <!> 

10x Loading dye 
5% (wiv) Coomassie Brilliant Blue G 
500 mM €-amino-n-caproic acid 
100 mM Bis-Tris 
Adjust the pH to 7.0 with HC) and store the dye at 49. 

SDS (2% why) <!> 

Staining solution 
0.25% (w/v) Coomassie Brilliant Blue R250 
40% (v/v) ethanol 
10% (v/v) acetic acid 

Solubilization buffer 
Most buffers containing the appropriate nonionic detergent used to solubilize membrane proteins can 
be applied to BN-PAGE. For mitochondrial translocation (TOM and TIM) complexes, a solubilization 
buffer consisting of 1% (w/v) digitonin <!> (recrystallized), 20 ma Tris-Cl (pH 7.4), 0.1 mM EDTA, 
50 mM NaCl, 10% (w/v) glycerol, and 1 mM PMSF <!> is typically used because this buffer was used 
in the original immunoprecipitation protocols to determine the interactions between these membrane 
proteins (Blom et al. 1995). Other detergents (e.g., 0.5% v/v Triton X-100 <!> or 0.2% w/v n-dodecyl 
maltoside) have also been used. However, BN-PAGE analysis of mitochondrial outer membranes sol- 
ubilized in Triton X-100 has shown that this detergent results in a number of small-molecular-weight 
TOM proteins dissociating from the main TOM complex (Dekker et al. 1998). 


1x SDS-PAGE running buffer 

25 mM Tris 

192 mM glycine 

196 (w/v) SDS 
TEMED (N,N,N^,N'-tetramethyleneethylenediamine) solution <!> 
Transfer buffer 

48 mM Tris base 

39 mM glycine 

0.0496 (w/v) SDS 

2096 (v/v) methanol 


To recrystallize digitonin, boil the powder in ethanol and store it for 20 minutes at -209C. 
Centrifuge the digitonin precipitate at 5000g, and dry the pellet in a vacuum desiccator. 


Although concentrations of NaCl above 100 mo are used in the solubilization of mem- 
branes with Triton X-100, this salt may be problematic when the proteins are stacked dur- 
ing BN-PAGE. £-Amino-n-caproic acid, which has been reported to enhance the actions 
of detergents in the solubilization of proteins (Schágger and von Jagow 1991), should be 
used instead of NaCl. It also acts as a serine protease inhibitor and may therefore maintain 
the stability and activity of protein complexes. Bis-Tris is used in the gel buffer because this 
base has a slightly acidic pK (6.5-6.8) and can therefore maintain the pH at 7.0. Tricine, 
with a pK of 8.15, is used as the trailing ion (Schágger and von Jagow 1991). 
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b Equipment and Conditions 
BN-PAGE equipment 
Cooled gel system (e.g., SE 600 standard dual cooled vertical unit, Amersham Biosciences) 
The SE 600 can accommodate up to four gels at a time, enhancing reproducibility. The large anode 
buffer chamber with a cooling system enables electrophoresis to run at a constant temperature. 
Gel plates (18 X 16 cm) 
Spacers (typically 1—1.5-mm thick) 
Combs (containing 10-15 teeth) 


For preparative BN-PAGE, use a comb containing one normal-sized tooth for loading marker pro- 
teins and a single large tooth for the sample. 


Mini-gel systems can also be used and a number of different ones are now on the market. Use 
one with an integral cooling chamber, enabling the apparatus to be connected to a recirculat- 
ing refrigerated water bath, to maintain a constant temperature under the high-voltage condi- 
tions that generate excess heat. 

Plates and equipment should be cleaned thoroughly and rinsed well with H,O to ensure that 
no traces of detergent (which could otherwise denature proteins) remain. If BN-PAGE is to be 
performed regularly, a complete PAGE system reserved exclusively for this technique is recom- 
mended. A conventional homemade vertical gel apparatus can be also be used and run in a 49C 
room. In this case, the temperature should be monitored regularly and the voltage adjusted 
appropriately to ensure that the gel does not become warm, which may otherwise lead to pro- 
tein aggregation or disassembly of protein complexes. 


Electroblotting apparatus 

Gradient maker 
This gradient maker should be sufficient to hold a total volume equivalent to that of the gel 
employed (e.g., 20 ml) and have mini-sized spin bars for mixing of acrylamide solutions. 

Magnetic stirrer 

Microfuge 

Peristaltic pump 

Power supply (with maximum 500 V or above) 

PVDF membrane 


Refrigerated circulating H,O bath 
This bath should be connected to the cooling chamber of the gel apparatus. Because heating dur- 
ing the electrophoresis run does occur, it may be necessary to monitor the temperature of the anode 
buffer at times, using a thermometer, Positioning the gel chamber on top of a magnetic stirrer and 
placing a magnetic spin-bar within the anode buffer chamber can maintain the buffer temperature 
during the electrophoretic run. 


P Biological Molecules 


Marker proteins 
The marker proteins must be in their native configurations. We use a set of markers (from 
Amersham Biosciences) containing thyroglobulin (669 kD), ferritin (440 kD), catalase (232 kD}, 
lactate dehydrogenase (140 kD), and BSA (66 kD). Resuspend 250 [ig of protein in 800 H of 1x 
loading dye and load 50 ul per lane when using a large gel. This corresponds to 2.5—5 ug of protein 
per band. These soluble proteins migrate as discrete bands at their appropriate molecular weights. 

Protein samples 
The protein samples can range from purified complexes to subcellular fractions to total cellular 
extracts, For mitochondrial extracts, mitochondria are isolated using standard techniques (Spector et 
al. 1997). Solubilize mitochondrial pellets in an appropriate detergent-containing buffer. 
Approximately 100 ug of total protein in a volume of 50 ul of solubilization buffer loaded in each well 
is sufficient to observe most abundant respiratory complexes. To enrich membrane complexes, prepare 
a total membrane pellet by sonicating a mitochondrial fraction in hypotonic buffer, pellet the intact 
mitochondria at 10,000g, and then centrifuge the supernatant at 100,000g to recover membranes. 
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» Additional Reagents 


METHOD 


Steps 24—28 require the reagents and equipment listed in Chapter 2, Protocol 1. 


Preparation of a Gradient Gel 


A 4-16.5% acrylamide gradient gel with a 4% acrylamide stacking gel is typically used. This 
resolves protein complexes from >1000 kD to ~40 kD. However, because the top of the gel 
can be fragile and if the size of the complex to be analyzed is less than 600 KD, a 6-16.5% (or 
other) gradient gel may be employed. It may be convenient to make acrylamide mixes suffi- 
cient for pouring a number of gels since this will aid in reproducibility (see Table 10.3). The 
higher-percentage acrylamide mix contains 1096 (v/v) glycerol, which aids in the formation 
of the gradient by preventing unwanted mixing of layers during gel casting. A simple gradi- 
ent maker, with a small magnetic spin bar placed in the chamber containing the higher-per- 
cent acrylamide mix, is centered on top of a magnetic stirrer. The chamber is stirred at con- 
siderable speed to aid mixing; however, in some cases, high velocities of stirring can slow the 
entry of the lower-percent acrylamide mix into the mixing chamber. For additional informa- 
tion on pouring gradient gels, see Figure 10.17 and Chapter 2, Protocol 1. 

1. Determine the volume needed to fill the gel, leaving ~2.5 cm for the stacking gel in addi- 
tion to the lengths of the wells of the comb used. (For 18 x 16-cm gels with 1-mm-thick 
spacers, use 9 ml of each acrylamide solution, thereby leaving ~5 cm of space above the 
gradient.) Fill both gradient chambers with exact volumes of each acrylamide mix. 


2. To 9 ml of acrylamide, add 4 pl of TEMED and 40 pl of 1096 (w/v) ammonium persul- 
fate to cach chamber and mix thoroughly. 


Different volumes of acrylamide will require different volumes of TEMED and ammonium 
persulfate. 


3. Turn the pump on and then open the valve between the two gradient chambers to enable 
the flow of the lower-percent acrylamide mix into the chamber containing the higher- 
percent acrylamide mix. 

EXPERIMENTAL TIP: The flow rate should be quite fast, because a constant and continuous 
stream of acrylamide running from the tubing into the gel results in the formation of a more 
linear gradient. 


4. Layer isopropanol onto the surface of the acrylamide between the glass plates, to level the 
gel and aid in polymerization. 


TABLE 10.3. Acrylamide mixes for BN-PAGE 


4% 6% 14% 16.5% 20% 
3x Gel buffer 30 mi 30 mi 30 ml 30 ml 30 ml 
Acrylamide stock 7.3 ml 11 ml 25.5 ml 30 m! 36.4 ml 
Glycerol - - 18 mi 18 ml 18 ml 
H,O 62.7 ml 49 ml 26.5 ml 12 ml 5.6 ml 


Total 90 ml 
‘Table courtesy of Michael T. Ryan tLaTrobe University]. 
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FIGURE 10.17. Schematic representation of pouring a gradient BN- PAGE gel. Low- and high-percent acry- 
lamide mixes (see Table 10.3) are placed in the appropriate gradient chamber. The pump is turned on caus- 
ing the acrylamide to flow between the gel plates and enabling mixing of the low-percent acrylamide with 
the high-percent acrylamide. (Courtesy of Michael T. Ryan, LaTrobe University.) 


5. Immediately after use, thoroughly wash out the gradient chamber and tubing with H,O 
to prevent blockage by polymerized acrylamide., 


6. Following polymerization, remove the isopropanol. Wash the top of the gel with H,0, 
and blot the inside surfaces of the glass plates dry using filter paper. 


7. Prepare a 4% stacking gel solution and layer the solution (see Table 10.3) on top of the 
polymerized gradient gel. 
8. Insert the comb carefully into the stacking gel solution, starting first from one side, to 


avoiding trapping air bubbles at the gel-comb interface. 


9. Once the gel is completely polymerized, store it at 4°C. When kept moist (e.g., surround- 
ed by tissues soaked in HO and then wrapped in plastic wrap, and placed in a contain- 
er), the gel is stable for up to 1 week. 


Loading the Protein Samples 
10. Remove the comb from the gel and wash the wells with H,O to remove unpolymerized 
acrylamide and excess reagents. 


11. Resuspend the protein samples in the appropriate solubilization buffer. Centrifuge the 
samples in a microfuge at maximum speed for 5 minutes at 4°C to pellet any aggregated 


material. 
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12. Add loading dye to the protein samples (5 ul is added to 45 ul of sample for large gels or 
2 ul to 18 ul of sample for minigels). 


13. Load the samples into the wells using a syringe (or gel-loading pipette tips). Make sure to 
rinse the syringe thoroughly in H,O between each sample. Gently overlay the samples 
with Cathode buffer 1. 


Alternatively, place Cathode buffer 1 in the wells first, and then load the samples through the 
buffer, where they will settle to the bottom of the well. 


Electrophoresis Conditions 


14. Run large gels at 100 V until the samples have migrated through the stacking gel and then 
increase the voltage to 500 V for migration of the proteins through the gradient gel (total 
electrophoresis time ~5 hours). 


Alternatively, when set at 6 mA per gel and a maximum of 180 V, electrophoresis is complete 
after 16 hours. For minigels, run at 6 mA (maximum 400 V) for 1 hour and 20 minutes. 


15. After the Coomassie dye has entered the gel and traveled at least to the top of the gradi- 
ent gel, replace Cathode buffer 1 with Cathode buffer I1. The lack of Coomassie dye in the 
latter buffer enables the proteins to become visible in the gel during electrophoresis. 
Transferring to Cathode buffer II is required if the proteins will be electroblotted. 


Alternatively, if it is known beforehand how long to run the gel and the proteins will not be 
electroblotted, Cathode buffer I can be used exclusively during electrophoresis. Destain the gel 
after electrophoresis is complete to visualize protein complexes. 


16. Stop the electrophoresis run when the Coomassie Blue dye reaches the bottom of the gel. 
Proceed to visualize the proteins by staining the gel (Step 17), electroblot the proteins to 
a PVDF membrane (Step 19), or separate the protein complexes into their component 
proteins by SDS-PAGE (Step 21). 


Staining the Protein Complexes 
Protein complexes are often directly visible following electrophoresis. Additional complexes 
can be observed by staining the gel. 
17. Incubate the gel with staining solution for ~1 hour with gentle shaking. 


18. Destain the gel by incubating it with multiple changes of destaining solution until clear 
bands are observed. 


Electroblotting the Protein Complexes 


19. Soak the gel in 1x SDS-PAGE running buffer for ~20 minutes. 


20. Rinse the gel in transfer buffer and electroblot onto a PVDF membrane. Using the semi- 
dry method, set transfers of large gels at 200 mA for 1.5 hours. 
EXPERIMENTAL TIP: Nitrocellulose is not recommended, because the Coomassie dye binds 
irreversibly to this membrane causing problems in downstream processes such as immuno- 
decoration. 
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Isolating the Protein Complex Components by SDS-PAGE 


To analyze the composition of protein complexes, SDS-PAGE can be performed in the sec- 
ond dimension. This is particularly important when analyzing proteins by immunodecora- 
tion. Because polyclonal antibodies often cross-react with other proteins in a sample, it can 
lead to anomalous results when BN-PAGE blots are decorated. To confirm that an immun- 
odetected complex indeed contains the antigen of interest, a lane from the BN-PAGE gel is 
excised and subjected to SDS-PAGE, thereby resulting in the separation of proteins from the 
various complexes (Figure 10.18). 

Alternatively, the protein profile can be observed by staining (Figure 10,19a) the SDS- 
PAGE gel. In the case of mitochondria, 150-200 pg of total protein applied to the first dimen- 
sion is required to observe individual protein spots on the SDS-PAGE gel. 


21. Excise lanes from the BN-PAGE gel using a razor blade or the edge of a thin gel spacer. 


EXPERIMENTAL TIP: Gel strips can be wrapped in Parafilm and stored at -80°C for long 
periods prior to SDS-PAGE. 


22. If possible, use a razor blade to remove the blue migration front at the very bottom of the 
strip (containing dye and detergent) so that the large concentration of Coomassie stain 
at this end does not trail from the front during SDS-PAGE. 


23. Soak the gel strip in 296 (w/v) SDS containing 0.2 M DTT for 20 minutes. 


24. Remove excess solution from the gel strip. Use forceps to gently place the strip on one of 
the gel plates (will be used to prepare the SDS-PAGE gel, see Chapter 2, Protocol 1) at a 
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FIGURE 10.18. Schematic representation of performing SDS-PAGE in the second dimension following 
BN-PAGE. (Courtesy of Michael T. Ryan, LaTrobe University.) 
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FIGURE 10.19. Coomassie-stained mitochondrial extracts following BN-PAGE (a) and 2D-PAGE (b). (a) 
Mitochondrial extracts (100 ug) isolated from rat kidney (lane !) or rat heart (lane 2) were solubilized in 
buffer containing 0.5% (v/v) Triton X-100 and subjected to BN-PAGE (4-16.5% gradient) followed by 
Coomassie staining. Multiple complexes can be observed. (b) Yeast mitochondria (150 pug of protein) were 
solubilized in digitonin-containing buffer and subjected to BN-PAGE followed by SDS-PAGE in the second 
dimension and subsequent Coomassie staining. (Courtesy of Michael T. Ryan, LaTrobe University.) 


25. 


26. 


27. 


28. 
29. 


position that the wells would normally occupy. Place the other giass plate on top so that 
the gel strip is sandwiched between the two plates (Figure 10.18). The spacers should be 
the same thickness as those used in BN-PAGE. 


Pour the separation gel solution to ~3 cm below the gel strip and overlay with iso- 
propanol. Avoid wetting the strip by tilting the gel on its side while pouring. 


After polymerization, remove the isopropanol and pour the stacking gel solution over 
and around the BN-gel strip. Make sure that bubbles do not become trapped under the 
gel strip. 


For comparison, place a comb containing two lanes in the stacking gel at the side of the 
gel strip to load both SDS-PAGE molecular-weight markers and a sample of the original 
protein that was applied to BN- PAGE, but in this case, prepared for SDS-PAGE by boil- 
ing in Laemmli sample buffer (see Chapter 2, Protocol 1). 


Run the gel under standard SDS-PAGE conditions (see Chapter 2, Protocol 1). 


Following electrophoresis, stain the gel, electroblot the proteins, or prepare individual 
protein spots for mass spectrometry. 


Individual protein spots obtained from 2D-PAGE can be subjected to protein sequence analy- 
sis using various techniques. Alternatively, protein complexes can be excised directly from the 
BN-PAGE gel and subjected to in-gel proteolytic (e.g. tryptic) digestion {Matsudaira 1993, and 
see Chapter 7, Pratocol 13) followed by separation of peptides using HPLC and subsequent 
sequencing using Edman degradation (see Chapter 6, Protocol 1) or mass spectrometry (see 
Chapter 8}. 


PROTOCOL 5 


Analysis of the Topology of Protein Complexes Using 
Cross-linking and Mass Spectrometry 


lus PROTOCOL WAS CONTRIBUTED BY Juri Rappsilber and Matthias Mann (Protein 
Interaction Laboratory, Department for Biochemistry and Molecular Biology, University of 
Southern Denmark, Campusvej 55, DK-5230 Odense M, Denmark). 

This protocol is designed to allow identification of spatial relationships between proteins 
within a wide variety of multiprotein complexes. Details for optimizing cross-linking of pro- 
teins within multiprotein complexes are provided. Following the cross-linking reaction, the 
proteins are separated by one-dimensional PAGE, the cross-linked proteins are isolated from 
the gel, and the individual members of the cross-linked complexes are identified by mass 
spectrometry (Shevchenko et al. 1996; Pandey et al. 2000a). The protocol can be used to ana- 
lyze many protein complexes isolated by any purification technique, provided the protein 
complexes remain in their native configuration. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <I>. 


» Reagenis 
Buffer X 
20 mm PIPES (pH 7.0) 
Salts according to the requirements of the investigated complex 


The pH of PIPES changes -0.009 units per °C. The buffer must not contain reagents that can 
react with the cross-linker employed, e.g., dithiothreitol <!>, B-mercaptoethanol <!>, Tris, or 
phosphate. 


Cross-linking reagents 
BS3 (bis[sulfosuccinimidyl|suberate) 
BMH (bismaleimidohexane) 
EDC (1-ethyl-3-[3-dimethylamineprapyl |carbodiimide) «1» 
SMCC (succinimidyl-4-[N-maleimidomethyl ] cyclohexane-1-carboxylate) 
Dimethylsulfoxide (DMSO) (HPLC grade, if possible) <!> 
4x SDS-]oading buffer 
200 mM Tris-Cl (pH 6.8) 
856 SDS «t» 
0.4% bromophenol blue «1» 
4096 glycerol 
Stop solution 
1 M dithiothreitol (DTT) 
1 M Tris-Cl (pH 7.8) EE 
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è Equipment 


(Optional) Bio-Spin column (Bio-Rad), MicroSpin G-25 column (APBiotech), or NAP col- 
umn (Amersham Biosciences) 
See Step 1. 
Mass spectrometer capable of identifying proteins (Ion trap, reflector MALDI-TOF, triple 
quadrupole, QTOE or QSTAR mass spectrometer) 
Software that calculates peptides from protein sequences (e.g., GPMAW, available at http:// 
welcome.to/gpmaw) 


(Optional) Ultrafiltration concentrator (e.g., the Centricon Plus-20, Millipore} 
Sec Step 2. 
Water bath or heating block preset to 70°C 


P Biological Sample 


Purified protein complex 

Purify the protein complex using a method that will keep the complex in its native form. A proven 
way to isolate larger quantities of a complex is by the recombinant tagging of one of the complex 
components. The other components can assemble on the tagged protein in the cell or in the cell 
lysate, and the complex is then isolated by the interaction of the tag with a tag-specific solid matrix, 
eg» Protein A tag and immobilized IgG, GST and immobilized glutathione, or affinity epitope 
(such as Myc or HA) and immobilized antibody. The elution conditions must be mild so as not to 
destroy the complex. Either competition (e.g., using the peptide epitope) or enzymatic cleavage in 
the linker region between the tag and the complex member is suitable for elution. The use of salt, 
detergents, or extreme pH usually result in denaturation of protein complexes and are therefore 
generally not advisable for elution. ` 


» Additional Reagents 


METHOD 


Steps 10 and 14 require the reagents and equipment listed in Chapter 2, Protocol 1. 


The cross-linking reaction can precede the purification. In this case, the elution can be 
done in SDS-loading buffer. Note that, in general, protein cross-linking in vivo is not a 
trivial task. 


Optimization of the Cross-linking Reaction 


Different preparations of a protein complex vary in quality and concentration. It is therefore 
recommended that ail optimization tests be done with a single preparation. However, freez- 
ing samples of protein complexes is not advised (unless specifically tested), because it often 
results in partial denaturation. Rather, plan all work carefully and complete it within a few 
days, during which the complex is stored at 49C, 


1. (Optional) If the buffer components of the protein complex solution are not compatible 
with cross-linking, exchange the purification buffer with Buffer X. A rapid method for this 
is gel filtration. Depending on the volume of the solution, use a Bio-Spin column (50-100 
HÌ), MicroSpin G-25 column (10-100 pl), or the gravitation flow-driven NAP columns 
(0.1-2.5 ml) according to the manufacturers’ protocols. The procedure is simple: 


* Equilibrate the column with Buffer X. 
* Apply the sample to the column. 
* Collect the flowthrough. 
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2. 


3. 


4. 


Gel filtration-based desalting methods have an efficiency of 95-98%. This means, for example, 
that a solution containing 100 mm Tris before desalting will still contain 2-5 mM Tris in the 
sample after ge} filtration. Thus, to achieve the acceptable concentrations of buffer compo- 
nents, gel filtration may have to be repeated. An alternative desalting method, although it takes 
longer, is dialysis. 


(Optional) Concentrate the samples. Samples can be rapidly (~10 minutes) and efficient- 
ly concentrated using an ultrafiltration device according to the manufacturer's protocol. 


Depending on the concentration of the purified complex, volume reduction may be necessary 
to ensure that a sufficient amount of protein is loaded onto the gel in Steps 10 and 14. The effi- 
ciency of precipitation (done just prior to loading the sample onto the gel) from dilute samples 
is not as good as ultrafiltration-based methods and is therefore not recommended. To reduce 
sample losses due to adsorption of protein on the ultrafiltration membrane, the membrane can 
be coated prior to concentrating the sample, by concentrating a BSA solution to 1 mg/m! in 
Buffer X and washing the membrane three times with Buffer X. 


Place 13 microfuge tubes on ice and add 26 ul of protein complex solution to each one. 


IMPORTANT: To distinguish between substrates and the products of the cross-link reaction, 
include a control ta which no crass-linker is added. 


Weigh out ~1 mg of each cross-linker on a balance and dissolve in a volume of cold 
DMSO or H,O that gives a 10 mM solution, 


EDC: 1.9 mg/ml H,O 
BS: 5.7 mg/ml HO 
BMH: 2.8 mg/ml DMSO 
SMCC: 3.3 mg/ml DMSO 


IMPORTANT: The cross-linkers hydrolyze rapidly and thus should be handled as quickly as 
possible. Do not use frozen stocks! 


. Prepare two dilutions, of 1 mm and 0.1 mw in cold water, from the 10 mM stocks of each 


cross-linker. 


. Add 3 ul of each cross-linker solution (0.1 mM, 1 mM, and 10 mM) to one of the tubes 


containing protein complex (from Step 3) to test the optimal cross-linker concentration. 


. Incubate the tubes for 1 hour at 2°C on ice. 


. Stop the reaction by adding 1 pl of stop solution. Continue incubating the tubes for a fur- 


ther 10 minutes. 


. Add 10 pl of 4x SDS-ioading buffer to each sample. Heat the samples for 10 minutes at 


70°C in a water bath or heating block. 


. Separate the components (some will be cross-linked) of the protein complexes on a low- 


percentage SDS-PAGE gel. Be sure to include protein from the control tube that is absent 
any cross-linker. 


The percentage of the SDS-PAGE gel is determined by the size of expected cross-linked prod- 
ucis and whether it can still be handled; the gel becomes increasingly fragile with lower per- 
centages of acrylamide. A good compromise is a 6% gel, although this may not be required if 
small proteins are investigated. To increase handling stability of low-percentage gels, add 1/6 
volume of a 2% prepolymerized acrylamide solution. A prestained marker allows elec- 
trophoresis to be continued until the designated size reaches the bottom of the gel, allowing à 
better spread of the slower-migrating species. 
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11, After the best cross-linker and its most suitable concentration have been determined, fur- 


ther optimize the cross-linking reaction conditions by 


e varying the time that the cross-linking reaction is allowed to proceed, ranging from 


10 minutes to 2 hours, and 


* varying the temperature from 2°C to room temperature at - 39°C intervals. 


Isolation of Cross-linked Products and Further Analysis 


12. When the optimum cross-linking conditions have been established, prepare additional 


sample of protein complex, if necessary, according to Steps ] and 2. 


13. Add the chosen cross-linker to the protein complex sample, and incubate the mixture 


under the optimized reaction conditions. 


IMPORTANT: Remember to include a control tube contaming the protein complex but no 
cross-linking reagent. 


14. Repeat Steps 8-10. 


A number of points must be considered in preparing the SDS-PAGE gel if planming subsequent 
analysis by mass spectrometry. Ali solutions must be free of dust to avoid keratin contamina- 
tion. Ideally, use freshly filtered solutions. Take special care at those steps where the use of 
gloves is necessary because electrostatic charging increases the risk of contamination. The sen- 
sitivity of the mass spectrometric analysis is best if the final protein sample is in as small a vol- 
ume of acrylamide gel as possible. Measures to reduce the gel volume include the use of 1-mm 
spacers, employing conditions to obtain well-resolved bands (e.g., possibly precast gels), and 
cutting the bands very precisely in Step 16. 


15. Visualize the proteins by staining (see Chapter 2, Protocols 2-7). Identify the bands con- 


taining cross-linked proteins by comparison of the cross-linked material to the untreat- 
ed complex on the SDS-PAGE gel. 


16. Excise the bands of interest from the gel and analyze them by mass spectrometry {sec 


Chapter 8, Protocol }). 


Although the proteins being analyzed are cross-linked, no special precautions must be taken, 
other than keeping the volume of buffer used in the digestion (Chapter 71 as small as possible 
to ensure a sufficient concentration of peptides in the supernatant. This eliminates the need for 
extraction procedures prior to mass spectrometric analysis, which often result in the loss of 
some peptide species. 


The individual proteins within mixtures of proteins can be difficult to identify unambigu- 
ously. Therefore, it is sometimes advisable to perform the database searches with the peptide 
mass data on a custom database that contains only the known components of the investi- 
gated complex. This analysis can also be done manually by comparing the measured peptide 
masses with a list of calculated peptide masses if sophisticated software is not at hand. 
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Case Stupy: TOPOLOGY OF THE NupPBAP COMPLEX 


The Nup84p complex is a six-member subcomplex of the yeast nuclear pore complex 
(Siniossoglou et al. 1996). To study the topology of the Nup84p complex, it was purified 
and cross-linked (Rappsilber et al. 2000). The products were separated by SDS-PAGE and 
identified using MS. For the purification, one of the components (Nup85p) was tagged 
with the Protein A tag which binds to the heavy chain of IgG. When the Nup84p complex 
formed, it incorporated the Nup85p-Protein A fusion protein, enabling the entire complex 
to be purified using IgG immobilized on Sepharose beads. The bound complex was eluted 
by cleaving the protease site in the linker between Protein A and Nup85p using TEV pro- 
tease in a buffer that was compatible with cross-linking. The volume of the eluate was suf- 
ficiently small that concentration was unnecessary. An aliquot was compared with a con- 
trol purification of a metabolic enzyme to determine the background of the purification 
(Figure 10.20). Based on the intensity of the Coomassie-stained bands, the concentration 
of the complex was estimated to be ~1 pmole in 26 pl. The low concentration of the puri- 
fied Nup84p complex minimized the likelihood of cross-links between complexes. 

To find a suitable cross-iinker, five different reagents with reactivity toward suifhydry] 
and/or amino groups and one nonspecific linker (BASED) were investigated. The cross- 
linkers SMCC, BS, DSS, and BMH resulted in the appearance of different new bands on 
the polyacrylamide gel at a concentration of 10 pm (—300-fold excess over the complex). 
SMCC and BMH gave the best yield of products in both number and amount. BS? and DSS 
resulted in one major product at about the size of band H (Figure 10.21) obtained by 
SMCC. With BASED, the intensity of nonlinked material on the gel decreased; however, 
surprisingly, no bands corresponding to cross-linked species were observed. DMA did not 
result in any apparent reaction. 

“Fe refine the reaction conditions, cross-linker concentration, temperature, and reaction 
time were varied for SMCC and BS’, the most promising of the cross-linkers that were ini- 
tially screened. Cross-linking with 100 um SMCC for 1 hour at 2°C gave the largest diver- 
siy of products for this complex, as judged from SDS-PAGE and silver staining. 
Consequently, these conditions were chosen for the further analysis of the Nup84p com- 
plex (Figure 10.21). 

The optimal molar ratio of complex to reagent was found to be ~1:3000. A ratio of 1:300 
resulted in low yields of cross-linked products, whereas ratios of 1:10,000 or higher prac- 
tically linked all of the members of Nup84p complex to one another. 

Establishing standard conditions for the cross-linking reaction was difficult because the 
Nup85p complex could only be purified in relatively small amounts (a few hundred 
fmoles). The quantitation of cross-linking reactions also proved to be difficult because 

in complex concentration could only be estimated by comparison of stained bands. 
Time-course experiments, performed at XC, 24°C, and 30°C, using SMCC, BS, and 
BMH revealed that the reaction was essentially complete after 30-60 minutes. Higher tem- 
peratures only increased thé staining background on the gel but did not increase cross- 
linking efficiencies. 

. The crogs-linked products were excised from the final gel and processed under standard 
&enditions to obtain peptides for mass spectrometric analysis. The gel slices were washed, 
and the proteins were reduced, alkylated, and digested with trypsin (Shevchenko et al. 
1996; also see Chapter 7). An aliquot of the supernatant was analyzed on a thin-layer 
preparation with a-cyano-4-hydroxy-t innamic acid mixed with nitrocellulose as the 
matrix (Vorm et al. 1994). Mass spectra were obtained as the sum of 200-250 selected 
meüsuremééts on a Bruker REFLEX IL and spectra were internally calibrated using 
matrix-related signals and trypsin autolysis signals. Peaks were assigned in LaserOne soft- 
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FIGURE 10.20. Schematic outline of the approach 
used to obtain topological information on a protein 
complex. A fraction of the purified complex is cross- 
linked, and the products are separated by 1D SDS 
PAGE. In parallel, the noncross-linked complex is sub- 
jected to gel electrophoresis in order to recognize the 
cross-link products, Relevant bands are excised, digest- 
ed with trypsin, and measured by MALDI-MS. The 
peptide mass maps obtained are searched against a 
nonredundant database or à complex-specific data- 
base. Based on proteins identified in cross-link prod- 
ucts, a model of the complex topology can be con- 
structed. Since there are many possibilities for the 
masses of cross-linked peptides, the observed masses 
that correspond to cross-linked peptides are not con- 
clusive even at high mass accuracy. Interpretation of 
the site of contact based on these peptides requires fur- 
ther analysis, i.c, sequencing. (Courtesy at Juri 
Rappsilber and Matthias Mann, Un iversity of Southern 
Denmark.) 


FIGURE 10.21. Silver-stained SDS-PAGE gel (1090) 
of the Nup84p complex (lane 3). As a control, DHFR, 
which has no known protein-binding partners, was 
purified in parallel to show the background of the 
purification procedure (lane 2). (Lane 1) Total yeast 
extract. (Courtesy of Juri Rappsilber and Matthias 
Mann, University of Southern Denmark.) 
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ware, developed in-house. Using the program PeptideSearch (developed in-house), the 
peptide masses were searched against a nonredundant database containing all known pro- 
tein sequences. The identification of a protein by PeptideSearch is based on the compari- 
son of matching peptides with the matches of the other proteins in the database. The four 
large subunits of the Nup84p complex could be identified in one or the other cross-linked 
product, The two small subunits, Secl3p and Sehip, could also be identified when the 
search was confined to yeast proteins below 120 kD. This shows that, in principle, the 
analysis of the cross-linked products could have yielded the identities of all six complex 
components without prior knowledge of the complex. This information could be then 
used for a topological analysis of the protein complex. The number of matching peptides 
was determined by comparing the list of measured masses with the tryptic peptides of the 
six known complex members, This analysis permitted identification of the proteins present 
in their respective bands (see Figures 10.22 and 10.23 and Table 10.4). 

Band HI of Figure 10.21 contains Nup85p and Nup145p. The added masses of Nup85p 
(86 kD) and the carboxy-terminal part of Nupl45p (~85 kD, as Nupi45p is posttransla- 
tionally cleaved into two proteins with different function) are in agreement with the appar- 
ent mass of Band IH (168 kD). Therefore, we conclude that Nup85p and Nup]45p are 
direct neighbors. Band H contained Nup&5p, Nup145p, and Sehi p. The apparent molecu- 
lar mass of the band is 138 kD, which agrees within the range of uncertainty introduced by 
the unknown migration behavior of cross-linked protein pairs with the protein pairs 
Nup85p + Sehlp (86 kD + 40 kD) and Nupl45p + Sehtp (85 kD + 40 kD). From these 
data, we conclude that Sehip is a direct neighbor of both Nup85p and Nupl45p. This is 
further confirmed by the analysis of Band V which migrates at an apparent mass of 185 kD 
and contains Sehip, Nup85p, and Nupl45p, which have an added mass of 203 kD. This 
band therefore contains all three proteins cross-linked to one another. Similar reasoning on 
the results of the identifications in Band IV shows that Nup120p is a direct neighbor of 
Nup85p and Nupl45p. The higher- molecular-mass Bands VI, VII, and VIII all contain four 
or more components cross-linked to each other and are therefore less informative. These 
bands, however, all contain Nup85p and Nupl45p, thus supporting a central role for these 
two proteins. The remaining band (Band I) migrates at an apparent mass of 122 kD and 
was found to contain all six members of the complex. Considering the apparent mass of 
the band, each of the four large Nup proteins is potentially linked to either one of the small 
proteins, Secl3p or Seh lp. Nup85p linked to Seh1p was identified in Band H; therefore, the 
combination in Band I is likely to be Nup85p + Seci3p. Equally, Nupt45p + Sehlp was 
identified in Band H, suggesting the combination of Nupl45p + Secl3p in Band I. 

The failure to observe certain protein linkages can also suggest topological information. 
For example, Secl3p was not found linked to Sehip (the corresponding mass range is not 
shown on the gel in Figure 10.21), and Nup84p was not found directly linked to Nup120p. 
This suggests that Sec13p is not directly adjacent to Sehlp and that Nup84p does not have 
direct contact to Nup120p. In addition, for each of the two larger proteins, Nup84p and 
Nup120p, there is only one band indicating a link to one of the smaller proteins, Seci3p and 
Sehlp (Band D). This is consistent with Nupi20p being directly adjacent to either one of the 
small proteins and Nup84p to the other one, with both pairs being separate from each other. 

Taken together, the data suggest a model for the Nup84p complex in which Nup85p and 
Nupl45p form a central pair with two opposing pairs at the side, one composed of Nup84p 
with either Sec13p or Seh1p and the other being Nup120p with either Sehlp or Secl3p. At 
present, the orientation of the two opposing pairs, cis or trans, remains unclear (Figure 
10.24). This model satisfies all the constraints derived from the data in Table 10.4 and was 
validated by deletion analysis (Siniossoglou et al. 2000). 
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TABLE 10.4. Identification of cross-linked proteins 


Cross-linker 
: t 


kDa 


kDa 


FIGURE 10.22. Silver-stained SDS-PAGE gel (6%) of the purified 
Nup84p complex either noncross-linked (lane }, -) or cross-linked 
(lane 2, +) loaded in a ratio of 1:3, At least eight new bands appear 
after addition of cross-linker (labeled 1-VII). Background bands are 
denoted with an asterisk. (Courtesy of Juri Rappsilber and Matthias 
Mann, University of Southern Denmark.) 


MW, Secl3p Sehlp Nup84p Nup85p Nupl45p Nupl2üp MW, 
Band (kD) 32kD 40kD 84kD 86 kD  85kD* 121 kD» Linked proteins (kD} 

I 122 6f25% 6/2990 16/2590 11/2396 14/1496 33/3596 Nup12üp/(Sec13p or Sehlp) 153/161- 

Nup84p/(Seci3p or Sehlp) 116/124 
Nup85p/Sec13p 118 
Nupl45p/Sec 3p 117 
il Dà =- 832%  -- 121% 9/99 we Nup85piSehlp 126 
Nup145p/Seh lp 125 
I 168 — -- Wee 8/1196 — 11/896 ies Nup85p/Nup 145p 171 
IV 175 ic -4- 1- 8/19% 6/7% 619% Nup85p/Nup120p 207 
Nupl45p/Nupl2üp 206 
M 185 90348 - l- 7/15% 6/6% fe Nup8Sp/Nup145p/Secl3p 203 
v] >20 ~- S86 23- 1/15% 18199 11/14% Nup85p/Nup145p/Nup 120p/Seh!p 332 
VI >00 1- I- 917% B14% 16/15% 10/11% Nup85p/Nup145p/Nup120p/Nup84p 376 
VIN — 2300 8/4006 — 2j. 8/15% 9/23% 2324% 11/496 Nup85p/Nupl4Sp/Nupi20p/NupB4p/Seclp 408 


Table courtesy of Juri Rappsilber and 
xiv, where x is the number of pepti 
a[n mature Nup145p, the amino-termi 
"Nupl20p runs at an apparent molecular mass of 100 kD. 


des and y is the sequence coverage o 
nal third of the protein sequence is 


Matthias Mann (University of Southern Denmark}. 
bserved for the respective proteins. 
cleaved so that Nupl45p migrates faster t 


han NupiSp on SDS-PAGE. 
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FIGURE 10.23. MALDI peptide mass map obtained after in-gel tryptic digestion of Band VII. Four proteins were identified by iter- 
ative searches in the nonredundant database (NRDB) with greater than 50 ppm mass accuracy. The proteins were identified in the 
order: Nupl45p, Nup120p, Nup84p. and Nup85p. (Courtesy of Juri Rappsilber and Matthias Mann, University of Southern 
Denmark.) 
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FIGURE 10.24. Proposed topology model for the Nup84p complex. The two small complex members may 
be on the same side of the complex (cis) or on opposing sides (trans). Sec 3p may be close to Nup120p and 
Schip may be close to Nup84p or vice versa. (Courtesy of Juri Rappsilber and Matthias Mann, University 
of Southern Denmark.) 


PROTOCOL 6 


subsecond Photo-cross-linking: Use of Water-soluble Metal 
Complexes to Monitor Protein-Protein Interactions 


MATERIALS 


Tus PROTOCOL WAS CONTRIBUTED By David A. Fancy (University of Texas Southwestern 
Medical Center Dallas, Texas). 

Understanding how the vast arrays of protein-protein interactions are responsible for cel- 
lular function is one of the fundamental quests in biology. Over the decades, we have seen 
increasing complexity in the organization of protein complexes, such as in the proteosome and 
RNA polymerase II holoenzyme, and we have been challenged to follow the intricate cascades 
of signal transduction to the cell nucleus. To understand these systems, the investigator is faced 
with at least two fundamental questions: (1) What is the organization of the proteins in ques- 
tion? (2) How do these interactions change during a catalytic cycle? Of the variety of methods 
invented to answer these questions, chemical cross-linking has proven to be very useful. 

This protocol provides details in using photo-induced cross-linking of unmodified pro- 
teins (PICUP), which is based on radical coupling reactions. The proteins of interest are mixed 
with water-soluble metal complexes, Palladium(IE} (tetrakis-4-N-{methy! |-pyridyl porphyrin 
(K. Kim et al. 1999) or Tris-bipyridyl ruthenium (II) dication and ammonium persulfate 
(Fancy et al. 1999). The mixture is photolyzed with visible light (2400 nm) to photo-oxidize 
the metal complex via electron transfer to ammonium persulfate (see Figure 10.4) (Bolletta et 
al. 1980; Aboul-Enein and Schulte-Frohlinde 1988; Nickel et al. 1994). The activated metal- 
centered radical cation extracts an electron from an aromatic amino acid, resulting in a pro- 
tein-centered radical species that can attack a wide variety of other groups. This leads to cova- 
lent cross-linking between components of the protein complex. Irradiation times can range 
from subsecond flashes when a 150 W Xe lamp is used, or to multiple seconds when using a 
hand-held flashlight. Typical yields of cross-linked product range from 3096 to 9096. 


» Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Ammonium persulfate <!> 
4x Gel-loading buffer 
0.2 M Tris 
8% SDS <!> 
2.9 M B-mercaptoethanol <!> 
40% glycerol 
0.4% xylene cyanol <!> 
6.4% bromophenol blue «'» 
Methanol <!> 


b Equipment 
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PdCl, <!> 
5x Reaction buffer 
75 mM sodium phosphate (pH 7.5} 
750 mM NaCl 
TMPyP (5,10,15,20-tetrakis| ]- methyl-4-pyridinio]porphyrin tetra[p-toluene sulfonate}, Sigma) 
Trifluoroacetic acid (TFA) <!> 
Tris-bipyridyl ruthenium (IT) dication (Ru[bpy] ^ ) 


Heating block preset to 95°C 
Photo-cross-linking apparatus 
Xenon high-power lamp (150 W "arc lamp,” Thermo Oriel) <!> 
Cut-on filter (380-2500-nm; Thermo Oriel) 
Body of a manual single-lens reflex camera with the lens and back removed 
Cardboard barrier 


Sample microfuge tube holder 
For an illustration of the photo-cross-linking apparatus, see Figure 10.25, and for details, see the 
panel on GENERAL CONSIDERATIONS on the following page. 


SepPak C,, cartridge (Waters) 
Spectrophotometer, visible 
Syringe filter (45 pm) and syringe 


» Biological Sample 


Protein sample of interest 
Prepare the protein sample in a total reaction volume of 125 uM TMPyP-Pd(II) or Ru(bpy) A ina 
1.7-ml microfuge tube. 
See the panel on GENERAL CONSIDERATIONS, 


» Additional Reagents 


Step 5 (of the photo-cross-linking procedure) requires the reagents and equipment listed in 
Chapter 2, Protocols 1 and 2. 


Water Fitter 


Protective SLR camera 
barrier 


T 


FIGURE 10.25. Typical light source setup for a high-power lamp (150 W Xe). Light is first filtered through 
10 cm of distilled H,O and then through a 380-2500-nm cut-on filter. Exposure time (to 1/1000 second) is 
controlled using a manual single-lens reflex (SLR) camera with the lens and back removed. (Courtesy of 
David A. Fancy, University of Texas Southwestern Medical Center.) 


50 centimeters 
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METHOD 


GENERAL CONSIDERATIONS 


* 


Easily oxidized components such as B-mercaptoethanol or dithiothreitol (DTT), which 
quench cross-linking, should be removed prior to the experiment. 


Protein concentrations can vary depending on the experiment, and thus the amount of 
metal complex should be determined empirically. For these conditions, 1—10 pm protein 
is typically used. 


Stock solutions of 30 mm metal complex in H,O can be stored indefinitely frozen and in 
the dark. The ammonium persulfate stock should be made fresh before each series of 
experiments and not used if more than 1 hour old. 


Cross-linking reactions are carried out in 1.7-ml microfuge tubes, which maximizes the 
irradiation surface, to achieve the highest quantum yield during a flash irradiation. 


After the addition of the metal complex, the protein solution should be kept in the dark 
because ambient light can cause molecular oxygen-mediated cross-linking of the pro- 
teins (Fancy et al. 2000). Ammonium persulfate is added to a final concentration of 2.5 
mM, in the dark, just before photolysis. 


It is important to keep a protective barrier between the camera and the light source until 
just before shutter opening because a continuous direct beam of intense light can burn 
through the delicate fabric that makes up many camera shutters. 


Open microfuge sample tubes are held horizontally (parallel to the beam of light) in a 
mounted rack (i.e., the sample microfuge tube holder) placed 50 cm from the 150 W 
lamp. 


Care should be taken to focus the beam at the point where the liquid sample is to be 
placed. 


If a hand-held flashlight is used for these experiments, it should be of the highest power 
conveniently obtainable and focusable. In our experiments, we have successfully used an 
aluminum body Brinkman flashlight that takes four D-cell batteries, In this case, the dis- 
tance between the sample and light source should be reduced to no more than 10 cm. 


Metallation of TMPyP with Pa(II) 


1, 


Add 0.34 mg (1.9 moles) of PdCl, to a microfuge tube containing 1 mg (1.2 moles) of 
TMPyP in 1.4 ml of H,O. Heat this mixture for 2 hours in a 95°C heating block, 


EXPERIMENTAL TIP: Metallation can be monitored by the collapse of the four Q band peaks 
at 518, 554, 584, and 639 nm into two peaks at 525 and 558 nm by visible spectroscopy. 


2. Cool the sample and filter it through a 45-um syringe filter. Add TFA to 196. 


3. Inject the solution onto a SepPak C,, cartridge previously equilibrated in 1% TTA. Wash 


the cartridge with 5 ml of 1% TFA. 


4. Elute TMPyP-Pd(II) with 0.5 mi of methanol into a preweighed microfuge tube. 
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5. After evaporation of the methanol, dissolve TMPyP-Pd(II) in 1 ml of H,O. Determine its 
concentration by its extinction coefficient at 417 nm (e = 158 x IO Įm fern’), 


This reagent can be stored frozen and in the dark indefinitely. 


Photo-cross-linking Proteins 


WARNING: Wear safety glasses with tinted lenses to protect the eyes when using the photo-cross- 
linking apparatus with a xenon lamp. 


1. Prepare the protein sample in a total reaction volume of 20 pl, containing protein, 1X 
reaction buffer, and 125 pm TMPyP-Pd(II) in a 1.7-m] microfuge tube. Protect the reac- 
tion tube from light. 


2. Add 5 ul of freshly made ammonium persulfate to the solution immediately before pho- 
tolysis. Secure the sample tube horizontally (so it is parallel to the beam of light) in the 
sample holder. 


3. Remove the barrier between the SLR camera and the light source, and open the camera 
shutter for 0.5 second by a cable release or, if so equipped. the camera's own autotimer. 
After the shutter is closed, replace the solid barrier between the camera and the lamp. 


4. Immediately add 7 pl of a 4x gel-loading buffer to the sample to quench the reaction. 


5. Analyze the photo-cross-linking reaction using SDS-PAGE followed by Coomassie Blue 
staining or western blotting. 


An example of a typical cross-linking experiment is shown in Figure 10.26. In this experiment, 
UvsY was cross-linked using the above protocol. One can see in lane 6 that only in the case 
where the metal complex and ammonium persulfate are present ts cross-linking efficient dur- 
ing flash photolysis. Also apparent, in lane 3, is the axygen-mediated cross-linking when long 
exposures are taken in the absence of ammonium persulfate. 


4 + + . + Ru(bpy);?- 
60 - 0.5 0.5 hviseconds) 
; SOx? 


- Higher-order 
cross-links 
-(UvsY), 


-(UvsY)s 


e -UvsY 

FIGURE 10.26. UvsY is efficiently cross-linked in the presence of Rulbpy) i and APS (lane 6) during a 0.5- 
second exposure. A competing pathway is shown in lane 3, but it takes a considerably longer exposure time. 
‘Thus, reactions must remain in the dark until flash photolysis is initiated. (Courtesy of David A. Fancy, 
University of Texas Southwestern Medical Center.) 
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OPTIMIZATION AND PRECAUTIONS 


As with all experimental procedures, a bit of optimization is required to achieve satisfacto- 
ry results. Too much cross-linking is often the result of a high concentration of protein, 
metal complex, or ammonium persulfate, or overexposure of the sample to the light. If 
decreasing the shutter time or reducing the amount of metal complex and ammonium per- 
sulfate does not work, then the protein concentration must be lowered. Interestingly, for 
some proteins at high concentrations (>I uM), this technique can pick up protein collisions 
and thus link proteins that should be strictly monomeric or are unrelated. An example of 
this is shown in Figure 10.27, lane 5). UvsY (a filamentous protein involved in T4 homol- 
ogous recombination) was mixed with maltose and binding was nonspecifically cross- 
linked. Unfortunately, lowering the protein concentration can be problematic if the 
method of protein detection is not sensitive. Instead, to eliminate nonspecific cross-link- 
‘ing, exogenous amino acid, such as tyrosine or histidine, can be added (to 60 m) to the 
solution (Figure 10.27, lane 6). Presumably, under these circumstances, long-lasting radi- 
cal cations produced on the surface of the oxidized protein would interact with an elec- 
tron-rich histidine before nonspecifically cross-linking to another protein diffusing 
through the solution. 

If too little or no cross-linking is observed, it is often due to poor detection or a lack of 
exposed aromatic amino acids on the surface of the protein. At this stage, the investigator 
may need to prepare a western blot to detect cross-linked products reliably. We have 
noticed that some monoclonal antibodies detect cross-linked products poorly if irradiation 
times are considerably longer than 5 seconds (Fancy et al. 2000). This is most likely due to 
the oxidation of the epitope needed for antibody binding. 


+ - + + + Ru(bpy);/hv/ S0, 
+ Tyrosine 
kD 
Higher-order 
cross-links 
li- 
72.9- -UvsY/MBP 
~(LivsY¥), 
47.1- -MBP 
-(UvsY ), 
29.3. 
21.6- 
-Uvs¥ 
14.8-G 


FIGURE 10.27. Adding a small amount of tyrosine to a cross linking reaction can eliminate some ot the 
nonspecific cross-linking. UvsY and MBP can cross-link to each other (lane 2) realfirming the suspicion 
that PICUP can trap transient protein-protein interact ions. Adding a threetold excess of tyrosine over pro- 
tein can decrease nonspecific cross-linking without compromising yield. (Courtesy of David A. Fancy, 
University of Texas Southwestern Medical Center.) 


CHARACTERIZATION OF PROTEIN COMPLEXES œ 739 


Case Stuy: SUBSECOND PHOTO-CROSS-LINKING 


Mapping protein-protein contacts in large multiprotein complexes can be a daunting task, 
and it is thus important to develop new chemical methods to aid in this pursuit. Affinity 
modification approaches, in which a reactive tag is appended to a specific residue or agent, 
have been valuable tools used to probe the immediate environment around the point of 
attachment (Fancy 2000). Current methods rely heavily on bifunctional cross-linkers or 
UV-light-activatable photo probes that often result in low yields and protein degradation. 

We set out to show that an affinity approach could also be employed using the water- 
soluble long-wavelength, light-harvesting metal complexes exploited by PICUP. In the fol- 
lowing experiment, we show that Pd(II)-metallated Protoporphyrin IX (PPIX-Pd[H]) 
(Aldrich) can be selectively delivered to the antibody @-HA (or monoclonal antibody 
12CA5) (Sigma) by attaching it to the epitope peptide of the influenza hemagglutinin pro- 
tein NH,-YPYDVPDYA (HA). PPIX has two carboxyl groups that allow easy coupling to 
the peptide tising 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDC). In this experi- 
ment, we incubated various ratios of HA-PPIX-Pd(Il) with a-HA or an antibody that it 
does not bind, a-glutathione S-transferase (Sigma). Only if selective delivery of the cross- 
linking reagent is achieved should one observe a high level of cross-linking between -HA 
and HA-PPIX-Pd(II) and not o-GST (Figure 10.28). Protein levels and peptide conjugate 
levels were kept low («10 ng) to avoid nonspecific protein cross-linking, which may be 
attributable to the exogenous addition of HA-PPIX-Pd(ID. 

Results of this experiment are shown in Figure 10.29. It is clear that upon addition of 
ammonium persulfate (S,O,~) and subsequent photolysis, cross-linking occurs specifical- 
ly at o-HA. Interestingly, even a 1000:1 ratio of HA-PPIX-Pd(If) to antibody did not 
induce coupling of @-GST. Also of interest is that addition of ammonium persulfate was 
not essential to initiate efficient cross-linking of O-HA. This is presumably due te the abil- 
ity of oxygen to sensitize cross-linking via a potentially different mechanism. 


YT 


anti-HA Cross-link 


oA 
Oo 


APS/light 
————»- No cross-link 


anti-GS T 


FIGURE 10.28. Selective delivery of HA-PPIX-Pd(T3 too HA and cross-linking. The epitope for o-HA (NH,-YPYDVPDYA} can 
be selectively tethered to PPIX- Pd (Il) and then incubated with @-T1A (which it recognizes) or with a-GST which it does not. Cross- 
linking should only efficiently occur when selective recognition is achieved. (Adapted, with permission, from Aboul-Enein and 
Schulte-Frohlinde 1988 [€ American Society for Photobiology|.) 
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FIGURE 10.29. Selective delivery of HA-PPIX-Pd(II) to a-HÀ and cross-linking. Cross-linking is only 
observed when a specific complex between G-HA and HA-PPIX-Pd(II) is formed (lanes 2-5). When an 
antibody that does not recognize the HA epitope is employed, there is no cross-linking. For detection, pro- 
teins were transferred to PVDF membrane and probed with cf-mouse antibodies conjugated to horserad- 
ish peroxidase. Detection was achieved through enhanced chemiluminescence (ECL). (Courtesy of David 


A. Fancy, University of Texas Southwestern Medical Center.) 


PROTOCOL 7 


site-specific Protein-DNA Photo-cross-linking: 
Analysis of Structural Organization of Protein-DNA 
and Multiprotein-DNA Complexes 


lus PROTOCOL WAS CONTRIBUTED BY Nikolai Naryshkin, Andrey Revyakin, and Richard H. 
Ebright (Howard Hughes Medical Institute, Waksman Institute, and Department of 
Chemistry, Ruigers University, Piscataway New Jersey). 

Processes of fundamental biological importance, including replication, repair, recombi- 
nation, and transcription, proceed via assembly of large multiprotein-DNA complexes. 
Understanding such processes requires information regarding the structural organization of 
the complexes and information regarding the pathways and kinetics of assembly of the com- 
plexes. Site-specific protein-DNA photo-cross-linking is able to define positions of proteins 
relative to DNA within large multiprotein-DNÀ complexes, permitting the analysis of struc- 
tural organization of complexes (Bartholomew et al. 1990; Bell and Stillman 1992; Lagrange 
et ai. 1996). In conjunction with rapid mixing and laser flash photolysis, site-specific protein- 
DNA photo-cross-linking also is able to define kinetics of formation of individual protein- 
DNA interactions within large multiprotein- DNA complexes, and thus also allows analysis of 
pathways and kinetics of assembly of complexes (S. Druzhinin et al., unpubl.). 

Several procedures have been developed for site-specific protein-DNA photo-cross-link- 
ing within large multiprotein-DNA complexes (Bartholomew et al. 1990; Bell and Stillman 
1992; Lagrange et al. 1996). The most effective procedure involves the following four parts 
(see Figure 10.5): 

e Part 1: Chemical (Fidanza et al. 1992; Yang and Nash 1994; Mayer and Barany 1995) and 
enzymatic (Sambrook and Russell 2001) reactions are used to prepare a DNA fragment 
containing a phenyl-azide photoactivatible cross-linking agent and an adjacent radiolabel 
incorporated at a single, defined DNA phosphate (with a 9.7 À linker between the pho- 
toreactive atom of the cross-linking agent and the phosphorus atom of the phosphate, and 
with an 211 A maximum "reach" between potential cross-linking targets and the phospho- 
rus atom of the phosphate). 


e Part 2: The multiprotein-DNA complex of interest is formed using the site-specifically 
derivatized DNA fragment, and the multiprotein-DNA complex is UV-irradiated, initiat- 
ing covalent cross-linking with proteins in direct physical proximity to the cross-linking 
agent. 


Part 3: Extensive nuclease digestion is performed, eliminating uncross-linked DNA and 
converting cross-linked DNA to a cross-linked, radiolabeled three- to five-nucleotide “tag.” 


Part 4: The “tagged” proteins are identified, usually by denaturing polyacrylamide gel elec- 
trophoresis followed by autoradiography. 
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The procedure is performed in a systematic fashion, with preparation and analysis of at 
least ten derivatized DNA fragments, each having the photoactivatible cross-linking agent 
incorporated at a single, defined DNA phosphate (typically, every second DNA phosphate 
[each 12 A] on each DNA strand spanning the region of interest) (Lagrange et al. 1996, 1998; 
Kim et al. 1997, 2000a; Wang and Stumph 1998; Bartlett et al. 2000; Naryshkin et al. 2000). 
The results of the procedure define the translational positions of proteins relative to the DNA 
sequence. Plotted on a three-dimensional representation of.a DNA helix, the results also 
define the rotational orientations of proteins relative to the DNA helix axis and the groove 
orientations of proteins relative to the DNA major and minor grooves (Lagrange et al. 1996, 
1998; Kim et al. 1997, 2000a; Wang and Stumph 1998; Naryshkin et al. 2000). 

Elsewhere, we have presented detailed protocols for cross-linking of protein-DNA com- 
plexes in solution (“in-solution” cross-linking; Kim et al. 2000b; Naryshkin et al. 2001) or in 
polyacrylamide gel matrices (“in-gel” cross-linking; Kim et al. 2000b; Naryshkin et al. 2001). 
Here, we present a detailed protocol for the cross-linking of protein-DNA complexes immo- 
bilized on streptavidin-coated paramagnetic beads (“on-bead” cross-linking; see Naryshkin 
et al. 2000). On-bead cross-linking offers three important advantages over the other two 
methods: 


» The ability to perform stringent washing to eliminate nonspecific and nonproductive com- 
plexes (Kim et al. 2000a; Naryshkin et al. 2000). 
» The ability to change reaction media rapidly (e.g., to add substrates, inhibitors, or allosteric 
effectors) (Kim et al. 2000a; Naryshkin et al. 2000). 
e The ability to change reaction temperatures rapidly. 
In view of these advantages, we now employ in our laboratory on-bead cross-linking in 
all photo-cross-linking experiments not requiring rapid mixing and laser flash photolysis 
(Naryshkin et al. 2000; Y. Kim et al., unpubl.). 
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PART 1: Design and Preparation of Biotin-labeled Site-specifically 
Derivatized DNA Fragment 


MATERIALS 


b Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


10x Annealing buffer 
400 mM Tris-HCl (pH 7.9) 
500 mM NaCl 
100 mm MgCl, «t» 
ATP (100 mm) 
p-Azidophenacyl bromide <!> 
Biotin-}1-dATP, biotin-11-dCTP, or biotin-16-dUTP (1 mw) 


The choice of biotin-labeled dN'IP is determined by the choices of restriction endonucleases (see 
Biological Molecules, below). 


C,, HPLC solvent A (50 mM triethylammonium acetate [pH 7.01, 5% acetonitrile <!>} 
Ci; HPLC solvent B (100% acetonitrile) 
Chloroform <!> 
Controlled-pore glass (CPG) oligonucleotide synthesis supports (1 mole, 500 A) (Applied 
Biosystems) 
dA, dC, dG, T B-cyanoethylphosphoramidites (Applied Biosystems) <!> 
dNTPs (10 mM) 
Three dNTPs are required to complement the biotin-labeled dNTP in Step 36. 
Denaturing loading buffer 
0.3% bromophenol blue <!> 
0.3% xylene cyanol «t» 
12 mM EDTA, in formamide <!> 
‘lhe formamide may have to be deionized prior to its addition to the loading buffer. If any yellow 
color is present, deionize the formamide by stirring on a magnetic stirrer with Dowex XG8 mixed 


bed resin for | hour and filtering it twice through Whatman No. 1 paper. Store the deionized for- 
mamide in small aliquots under nitrogen at —709C. 


DNA polymerization mix (25 mM each dATP, dGTP, dCTP, and TTP) 
EDTA (0.5 M, pH 8.0) 
Elution buffer 

0.5 M ammonium acetate <!> 

10 mM magnesium acetate (pH 7.5) 

| mM EDTA 


Ethanol (7096 and absolute) <!> 
Store at --20°C. 


Formamide <!> 
[y-? P | ATP (10 mCi/mi, 6000 Ci/mmole) «t» 
Low-EDTA TE 

10 mat Tris-HCl (pH 8.0) 

0.1 mM EDTA 
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b Equipment 


Methanol <!> 
Nendenaturing polyacrylamide slab gel (5%) 

29:1 acrylamide:bisacrylamide 

0.5x TBE (10 x 7 x 0.15 cm) 
Nondenaturing loading buffer 

0.396 bromophenol blue 

0.3% xylene cyanol 

30% glycerol, in H,O 
Oligodeoxyribonucleotide synthesis reagents (e.g., Applied Biosystems) 
10x Phosphorylation buffer 

500 mM Tris-HCl (pH 7.6) 

100 mm MgCl, 

15 mM B-mercaptoethanol <!> 
PicoGreen dsDNA quantitation kit (Molecular Probes) 
Polyacrylamide slab gel (596) 

29:1 acrylamide:bisacrylamide <!> 

8 M urea «t» 

0.5x TBE (10 X 7 X 0.075 cm) 
Polyacrylamide slab gel (596) 

29:1 acrylamide:bisacrylamide 

0.5x TBE (10 X 7 X 0.15 cm) 
Polyacrylamide slab gel (12%) 

19:1 acrylamide:bisacrylamide 

8 M urea <!> 


0.5x TBE (10 x 7 x 0.075 cm) 

TBE (45 mM Tris-borate [pH 8.3) and 1 mM EDTA) is usually prepared as a 5x stock. Dilute the 
stock buffer just before use and make the gel solution and the electrophoresis buffer frorn the same 
stock solution. The 5x stock solution is prepared by combining 54 g of Tris base, 27.5 g of boric acid, 
and 20 ml of 0.5 m EDTA (pH 8.0) to a final volume of 1 liter with H.O. 


Potassium phosphate (1 M, pH 7.0) 


10x Restriction endonuclease digestion buffers 
The choice of digestion buffer(s) is determined by the choices of restriction endonucleases (see 
Biological Molecules below). 


SDS (1096) <!> 
Sodium acetate (3 M, pH 5.2) 
TE 
10 mM Tris-HCl (pH 7.6) 
1 mm EDTA 
Tetracthylthiuram disulfide/acetonitrile (Applied Biosystems) 
Triethylammonium acetate (50 mM, pH 7.0) (Prime Synthesis) 


Autoradiogram markers 
Autoradiography intensifying screen 
Beaker (600 ml) 
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Benchtop centrifuge 
CHROMA SPIN+TE-10 and SPIN+TE-100 spin columns (BD Biosciences CLONTECH) 
DNA/RNA synthesizer (c.g., ABI392, Applied Biosystems) 
Fixed-angle microfuge 
Germicidal lamp (254 nm) «i» 
HPLC system, including 
HPLC pump (e.g., L-7100, Hitachi) 
HPLC UV detector (e.g., L-3000, Hitachi) 
C3, RP-HPLC column (5 um) (e.g., LiChrospher 100 RP-18 column, KGaA) 
Intensifying screen 
Microfuge tubes, silicunized polypropylene 
Oligonucleotide purification cartridge (OPC) (Applied Biosystems) 
Phosphoimaging system 
Polypropylene round-bottomed tubes (6 mi) 
Scalpels (disposable) 
Spin-X centrifuge filter (0.22 um, cellulose acetate) 
UV spectrophotometer 
Vacuum evaporator (SpeedVac, Thermo Savant, or equivalent) 
Water baths or heating blocks preset to 379C, 55°C, 65°C, and 90°C 


P Biological Molecules 
DNA polymerase I Klenow fragment (10 umts/pl) 


Restriction endonucleases A and B 

Choose restriction endonucleases À and B using the following criteria: 
» Endonuclease A should have a unique (or nearly unique) cleavage site 100-150 nucleotides Y 
to the DNA site of interest (Figure 10.5). 
* Endonuclease B should have a unique (or nearly unique) cleavage site 100-150 nucleotides 3' 
to the DNA site of interest (see Figure 10.5). 
* One, and only one, restriction endonuclease (either A or B) should yield cleavage products with 
5' overhanging ends containing dA, dG, or T. 
It is important to use restriction endonucleases with cleavage sites relatively distant (100-150 
nucleotides) from the DNA site af interest and thus to prepare DNA fragments having relatively 
long DNA sequences flanking the DNA site of interest. This ensures that in the event a protein binds 
nonspecifically at thc ends of the DNA fragment (see, e.g., Melancon et al. 1983), it will not inter- 
fere with binding of protein(s) at the DNA site of interest, and it will not yield cross-links that over- 
lap those of protein(s) at the DNA site of interest (Naryshkin et al. 2000). 

Single-stranded DNA-binding protein (Stratagene) 

Single-stranded DNA template 
The single-stranded DNA template is a single-stranded DNA of M13 derivative (or, less preferably, 
phagemid derivative) carrying a DNA site for the protein(s) of interest (prepared as in Sambrook 
and Russell 2001). 

T4 DNA ligase (5 untts/p}) 

T4 DNA polymerase (3 units/Al) 

T4 polynucleotide kinase (10 units/}11) 

T7 DNA polymerase (10 units/Hl) 

Upstream primer 
The upsteam primer is a 25-nucleotide oligadeoxyribonucleotide complementary to region of sin- 
gle-stranded DNA template ~50 nucleotides 5’ ta cleavage site for restriction endonuclease A (see 
Figure 10.5). 
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METHOD 


Chemical Reactions 


1. Perform 24 standard cycles of solid-phase B-cyanoethylphosphoramidite oligadeoxyri- 
bonucleotide synthesis to prepare CPG-linked precursor containing residues 3-26 of the 
desired oligodeoxyribonucleotide. Use the following settings: 

Cycle: 1.0 mole CE 
DMT: on 
End procedure: manual 


2. Replace the iodine/H,O/pyridine/tetrahydrofuran solution (bottle 15) with a tetraethyl- 
thiuram disulfide/acetonitrile solution, Perform one modified cycle of solid-phase D-cya- 
noethylphosphoramidite oligodeoxyribonucleotide synthesis to add residue 2 and phos- 
phorothioate linkage. Use the following settings: 

Cycle: 1.0 pmole sulfur 
DMT: on 
End procedure: manual 


3, Replace the tetraethylthiuram disulfide/acetonitrile solution (bottle 15) with the 
iodine/H ,O/pyridine/tetrahydrofuran solution. Place a collecting vial on the DNA syn- 
thesizer. Perform one standard cycle of solid-phase D-cyanoethylphosphoramidite 
oligodeoxyribonucleatide synthesis to add residue 1. Use the following settings: 

Cycle: 1.0 umole CE 

DMT: on 

End procedure: automatic CE 

Steps 1-3 yield an oligodeoxyribonucleotide 26 nucleotides in length with a phosphorothioate 
linkage between nucleosides 2 and 3. Occasionally (rarely), a low yield is observed in the 


primer-extension reaction (see Steps 28-31). [n this case, prepare an oligadeoxyribonucleotide 
with 1—10 additional nucleotides at the 3’ end, and preferably with dG or dC at the 3’ end. 


4. Remove the collecting vial and screw the cap on tightly. Incubate for 10 hours at 55°C), 


5. "Iransfer the sample to a 6-ml polypropylene round-bottomed tube, place the tube ina 
vacuum evaporator, and spin for 30 minutes with the vacuum evaporator lid ajar and 
with no vacuum (to allow evaporation of ammonia). 


é. Close the vacuum evaporator lid, apply vacuum, and dry. 


7. Remove the trityl group and purify ~0.1 umole of oligodeoxyribonucleotide on OPC 
according to the supplier's protocol. 


8. Dry in the vacuum evaporator. 


9. Resuspend in 100 ul of TE. Remove a 2-1 aliquot and dilute with 748 ul of TE. 
Determine the concentration from UV absorbance at 260 nm (molar extinction coeffi- 
cient ~240,000 m'em 7). 


10. To check the purity of the oligodeoxyribonucleotide, mix an aliquot containing ! nmole 
of oligodeoxyribonucleotide with an equal volume of formamide. Apply this mixture to 
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12. 


14. 


16. 


17. 
18. 


20. 


a 1296 polyacrylamide TBE slab gel. To monitor the progress of the electrophoresis, load 
5 pl of denaturing loading buffer in an adjacent lane. Allow electrophoresis to proceed for 
30 minutes at 25 V/cm. 


. Remove the gel, place it on an intensifying screen, and view it in the dark using a 254-nm 


germicidal lamp. The oligodeoxyribonucleotide should appear as a dark shadow against 
a green background and should migrate more slowly than bromophenol blue. If the puri- 
ty is 29596, proceed to Step 12. 


Divide the remainder of the sample into 50-nmole aliquots, transfer them to 1.5-ml sili- 
conized polypropylene microfuge tubes, dry them in a vacuum evaporator, and store 
them at —20°C, The oligodeoxyribonucleotide is stable for at least 3 years. 


. Dissolve 10 mg (42 umole} of p-azidophenacyl bromide in 1 ml of chloroform. Transfer 


100-ul aliquots (4.2 umole} to 1.5-ml siliconized polypropylene microfuge tubes, and dry 
them in a vacuum evaporator. Wrap the tubes with aluminum foil, and store desiccated 
at 4°C (stable indefinitely). 


IMPORTANT: This and all subsequent steps should be carried out under subdued lighting. 
Fluorescent light and daylight must be excluded. Low-to-moderate levels of incandescent hght 
(e.g., from a single task lamp with 60 W tungsten bulb) are acceptable. 


Dissolve a 50-nmole aliquot of phosphorothioate oligodeoxyribonucteotide (from Step 
12) in 50 ul of ILO, and resuspend a 42-umole aliquot of p-azidophenacyl bromide 
(from Step 13) in 220 pi of methanol. 


. In a 1.5-ml siliconized polypropylene microfuge tube, mix: 


50 ul (50 nmoles) of the phosphorothioate oligodeoxyribonucieotide solution 
5 wl of 1 M potassium phosphate (pH 7.0) 
55 ul (1 umole) of the p-azidophenacyl bromide solution 


Incubate the reaction for 3 hours at 379C. 


Precipitate the derivatized oligodeoxyribonucleotide by adding 11 pl of 3 m sodium 
acetate (pH 5.2) and 275 ul of ice-cold absolute ethanol, Invert the tube several times, and 
store it for 30 minutes at -20°C. 


Centrifuge the tube at 13,000g for 5 minutes at 4°C. 


Remove the supernatant, and wash the pellet with ice-cold 70% ethanol, Air-dry the pel- 
let for 15 minutes at room temperature. Store at —20°C. The p-azidophenacyl-derivatized 
oligodeoxyribonucleotide is stable for at least J year. 


. Resuspend the derivatized oligodeoxyribonucleotide (from Step 18) in 100 ui of 50 ma 


triethylammonium acetate (pH 7.0), 


Analyze a 5-ul aliquot by Cy, RP-HPLC to determine the efficiency of the derivatization 
reaction. 


a. Set the column flow rate to 1 mi/min. 


b. Equilibrate the column with 10 column volumes of C,, HPLC solvent A before load- 
ing the sample. 
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c. After loading the sample, wash the column with 6 column volumes of C,, HPLC sol- 
vent A, 


d. Elute with a 50-minute gradient of 0-70% C,, HPLC solvent B in C,, HPLC solvent 
Á. 


Derivatized and underivatized oligodeoxyribonucleotides elute at ~25% solvent B and 
~16% solvent B, respectively. 


21. If derivatization efficiency is 28096, purify the remainder of the sample using the proce- 
dure in Step 20, collecting derivatized oligodeoxyribonucleotide peak fractions. 


22. Pool the peak fractions, divide them into 1-ml aliquots, and dry them in a vacuum evap- 
orator. Store the fractions desiccated at -20°C in the dark. The purified derivatized 
oligodeoxyribonucleotide is stable for at least 1 year. 


23. Resuspend one aliquot of the pooled and dried fraction in 100 pl of TE. Remove 5 yl, 
dilute with 495 ul of H,O, and determine the concentration from UV absorbance at 260 
nm (molar extinction coefficient ~242,000 M tem}. 


24. Divide the remainder of the derivatized oligodeoxyribonucleotide/TE solution from Step 
23 into 20 5-pmole aliquots and one larger aliquot. Dry them in a vacuum evaporator, 
and store desiccated at -20°C in the dark. 


The derivatization procedure yields two diastereomers in an approximate ratio of ! tal: one in 
which azidophenacyl is incorporated at the sulfur atom corresponding to the phosphate O1P, 
and one in which azidophenaoyl is incorporated at the sulfur atom corresponding to the phos- 
phate O2P (see Fidanza et al. 1992; Mayer and Barany 1995). Depending on the oligodeoxyri- 
bonucleotide sequence and HPLC conditions, the two diastereomers may elute as a single peak, 
or as two peaks (e.g., at 2496 and 25% solvent B), In most cases, no effort is made to resolve the 
two diastereomers, and experiments are performed using the unresolved diastereomeric mix- 
ture, Thís permits simultaneous probing of protein-DNA interactions in the DNÀ minor 
groove (probed by the O1P-derivatized diastereomer) and the DNA major groove (probed by 
the O2P-derivatized diastereomer) (Lagrange et al. 1996, 1998; Kim et al. 1997, 2000a; Wang 
and Stumph 1998; Naryshkin et al. 2000). 


Enzymatic Reactions 


25. Resuspend 5 pmoles of derivatized oligodeoxyribonucleotide in 12 pl of H,O. To the 
oligodeoxyribonucleotide add: 


2 ul of 10x phosphorylation buffer 
5 ul of [y-?P]ATP (50 Ci) 
1 ul (10 units) of T4 polynucleotide kinase 


Incubate for 30 minutes at 37°C. 


IMPORTANT: This and all subsequent steps should be carried out under subdued lighting. 
Fluorescent light and daylight must be excluded. Low-to-moderate levels of incandescent light 
(e.g., a single-task lamp with 60 W tungsten bulb) are acceptable. 


26. Add 20 ul of H,O and mix thoroughly. 


27. Desalt the radiophosphorylated derivatized oligodeoxyribonucleotide into TE using a 
CHROMA SPIN+TE-10 spin column according to the manufacturer’s protocol. 
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28. 


30. 


31. 
32. 
33. 


34. 


Immediately proceed to the next step, or, if necessary, store the radiophosphorylated 
derivatized oligodeoxyribonucleotide solution at —209C in the dark. The solution is sta- 
ble for up to 24 hours. 


In a 1.5-ml siliconized polypropylene microfuge tube, mix: 


40 ut of radiophosphorylated derivatized oligodeoxyribonucleotide 
i ul of 10 um upstream primer 

4 ul of 0.5 UM single-stranded DNA template 

5 ul of 10x annealing buffer 


Heat the mixture for 5 minutes at 65°C, 


. Transfer the tube to a 600-ml beaker containing 400 ml of H,O at 659C. Incubate the 


beaker at room temperature to permit slow cooling (65°C to 25°C in ~60 minutes). 


Add: 


2 ul of DNA polymerization mix (25 mM each dATP, dGTP, dCTP, and TTP) 
1 ul of 100 mm ATP 

0.5 ul (10 units) of T7 DNA polymerase 

1 pl (3 units) of T4 DNA polymerase 

1 pl (5 units) of T4 DNA ligase 

1 ul (1 pg/ml) of single-stranded binding protein 


Incubate the mixture for 5 minutes on ice, followed by 5 minutes at room temperature, 
followed by 1 hour at 37°C. Perform a parallel reaction without ligase as a "no-ligase" 
control, 


Add 1 ul (5 units) of T4 DNA ligase and incubate for an additional 2 hours at 37°C. 
Terminate the reaction by adding 1 pl of 10% SDS. 


Desalt into TE using a CHROMA SPIN+TE-i00 spin column according to the manufac- 
turer's protocol. 


Perform a restriction digestion: 


a. Where digestion buffers for restriction endonucleases A and B are compatible: In a 
1.5-ml siliconized polypropylene microfuge tube, mix: 


56 ul of site-specifically derivatized DNA/TE solution from Step 33 
6.2 ul of 10x digestion buffer 

1 wl (10 units) of restriction endonuclease A 

1 pl (10 units) of restriction endonuclease B 


Incubate for 1 hour at 37°C. Perform a parallel restriction digestion using 56 ul of 
“no-ligase” control. 


b. Where digestion buffers for restriction endonucleases A and B are incompatible: jn a 
1.5-ml siliconized polypropylene microfuge tube, mix: 


56 pl of site-specifically derivatized DNA/TE solution from Step 33 
6.2 ul of 10x digestion buffer having lower salt concentration 
I ul (10 units} of corresponding restriction endonuclease 
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35. 


36. 


37. 


38. 


39. 


Incubate for 1 hour at 37°C. To the digest, add: 


7.1 ul of 10x digestion buffer having higher salt concentration 
1 pl (10 units) of corresponding restriction endonuclease 


Incubate for 1 hour at 37°C. Perform a parallel restriction digestion using 56 ul of 
"no-ligase" control. 


Estimate the efficiency of ligation. 


a. Mix 3-11 aliquots of the restriction digest and of the "no-ligase" control restriction 
digest each with 7 pl of denaturing loading buffer. 


b. Heat the tubes for 5 minutes at 90°C. 
c. Apply the reactions to a 5% polyacrylamide TBE slab gel. 
d. Run the gel for 30 minutes at 25 V/cm. 


e. Dry the gel and subject it to phosphorimaging. Estimate the ligation efficiency by 
comparing the reaction and “no-ligase” control lanes. If the ligation efficiency is 
29096, proceed to the next step. If not, repeat the preparation, beginning with Step 25. 


To the remainder of the restriction digest (from Step 34), add: 


2.5 ul of 1 mM biotin-labeled dNTP 
0.25 ul of 10 mM each of other dNTPs 
0.3 ul (3 units) of DNA polymerase I Klenow fragment 


]ncubate for 20 minutes at 25°C. 
In order, add: 


3 ul of 0.5 m EDTA (pH 8.0) 
10 ul of nondenaturing loading buffer 


Apply the entire reaction to a nondenaturing 5% polyacrylamide TBE slab gel. Run the gel 
at 25 V/cm until the bromophenol blue has completely run off the gel. 


Retrieve the derivatized DNA fragment from the gel. 


a. Remove one glass plate and cover the gel with plastic wrap. Attach two autoradiogram 
markers to the gel. 


b. Expose X-ray film to the gel for 30-60 seconds at room temperature, and process the 
film. 


c. Cut out the portion of the film corresponding to the derivatized DNA fragment. 


d. Using white paper as background, superimpose the cut-out film on the gel, using the 
autorad markers as alignment reference points. 


c. Use a disposable scalpel to excise the portion of the gel corresponding to the deriva- 
tized DNA fragment. 


. Place the excised gel slice in a 1.5-ml siliconized polypropylene microfuge tube and crush 


it with a 1-ml pipette tip. Add 400 yl of elution buffer, vortex brietly, and rock the tube 
for 12 hours at 37°C. 
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4]. 


42. 


48. 


lransfer the supernatant to a Spin-X centrifuge filter. Centrifuge in a fixed-angle 
microfuge at 13,000g for 1 minute at room temperature, 


‘Transfer the filtrate to a 1.5-ml siliconized polypropylene microfuge tube. Precipitate the 
derivatized DNA fragment by addition of 1 ml of ice-cold absolute ethanol. Invert the 
tube several times, and store it for 30 minutes at —209C., 


. Centrifuge the tube in a fixed-angle microfuge at 13,000 for 5 minutes at 49C, 


. Remove and dispose of the supernatant, wash the pellet with 500 ul of ice-cold 70% 


ethanol, and air-dry the pellet for 10 minutes at room temperature. 


. Resuspend the pellet in 30 ul of low-EDTA TE, 
. Determine the amount of radioactivity in the sample by Cerenkov counting. 


. Remove a 1-pl aliquot of the sample and determine its DNA concentration using a 


PicoGreen double-stranded DNA quantitation kit according to the manufacturer's pro- 
tocol. Calculate the specific activity (expected specific activity ~5000 Ci/mmole). 


Store the derivatized DNA fragment at ~20°C in the dark. The fragment is stable for ~1 
week, 
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PART 2: Preparation and UV Irradiation of Protein-DNA Complex 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


» Reagents 

2x Binding and washing buffer 
10 mM Tris-HCl (pH 7.5) 
2.0 M Nac 
i mM EDTA 

Dynabeads M-280 streptavidin-coated paramagnetic beads (10 mg/ml) (Dynal) 

1x Protein-binding buffer 
Protein-binding buffer should contain components required for complex formation by protein(s) 
of interest and also should contain 50 pg/ml of acetylated bovine serum albumin (BSA), which 


reduces nonspecific adsorption of protein(s) onto beads and surface, and 1x protease inhibitor 
cocktail (Sigma), which reduces protease degradation of protein(s) of interest. 


3x Quench solution 
3M urea «f» 
0.3 M Nal «i» 
0.3 M B-mercaptoethanol <!> 


b Equipment 
Borosilicate glass culture tube (13 x 100 mm) 
Dry bath and heating blocks 
Optional: For complex formation at temperatures above ambient, see Step 13. 
Magnetic particle concentrator (Dynal) 
Microfuge tubes (siliconized polypropylene) 
Photochemical reactor (Rayonet RPR-100) 


The reactor should be equipped with an RMA400 sample holder and 16 RPR-3500 A tubes (Southern 
New England Ultraviolet). 


Photochemical reactors should be available at most chemistry departments. With appropriate mod- 
ifications to the protocol, other photon sources can be substituted, including 305-310-nm UV 
lasers (S. Druzhinin et al., unpubl.), long-wave cross-linking units (Wang and Stumph 1998), and 
long-wave germicidal lamps (Bartlett et al. 2000) (the latter two photon sources being only mar- 
ginally satisfactory, due to the requirement for relatively long UV-irradiation times). 

Polystyrene microfuge tubes (04-978-145, Fisher Scientific) 

Radioactivity survey meter (for *P, Ludlum Measurements) 

Refrigerated bath 


Optional: For complex formation at temperatures below ambient, see Step 13. 


» Biological Molecules 
Biotin-labeled, site-specifically derivatized DNA fragment (60 nM in low-EDTA TE) 
From Part 1, Step 48. 


Protein(s} of interest 


METHOD 
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11. 


. Resuspend the paramagnetic beads by gently shaking the storage vial. Transfer a 3-1 


aliquot into a 1.5-ml siliconized polypropylene microfuge tube, and centrifuge the tube 
briefly in a microfuge. 


. Place the tube into a magnetic particle concentrator, wait 30 seconds, and remove the 


supernatant, 


. Resuspend the beads in 30 ul of 1x binding and washing buffer, and mix them gently 


using a micropipette. incubate the beads for 2 minutes at room temperature. 


. Centrifuge the tube briefly in a microfuge. 


. Place the tube in a magnetic particle concentrator, wait for 30 seconds, and then remove 


the supernatant. Repeat Steps 3—5. 


. Resuspend the bead pellet in 10 ul of 2x binding and washing buffer. Add 2 yl of 60 nm 


biotin-labeled, site-specifically derivatized DNA fragment in low-EDTA TE (from Part 1, 
Step 48). Incubate the tube for 10 minutes at 25°C, mixing occasionally by gently tapping 
the tube. 


. Place the tube into a magnetic particle concentrator for 30 seconds. Transfer the super- 


natant to a new 1.5-ml siliconized polypropylene microfuge tube. 


. Estimate the binding efficiency of the biotin-labeled, site-specifically derivatized DNA 


fragment by comparing the radioactivity of the bead pellet and supernatant. If the bind- 


ing efficiency 1s 28096, proceed to the next step. [f not, repeat the preparation beginning 
with Part 1, Step 25. 


IMPORTANT: This and all subsequent steps should be carried out under subdued lighting. 
Fluorescent light and daylight must be excluded. Low-to-moderate levels of incandescent light 
(e.g., from single task lamp with 60 W tungsten bulb) are acceptable. 


. Resuspend the bead pellet in 30 ul of 1x binding and washing buffer. Centrifuge the beads 


briefly in a microfuge. 


. Place the tube into a magnetic particle concentrator, wait for 30 seconds, and then 


remove the supernatant. 


Resuspend the bead pellet in 30 u of 1x protein-binding buffer and centrifuge the beads 
briefly in a microfuge. 


. Place the tube into a magnetic particle concentrator, wait 30 seconds, and remove the 


supernatant, 
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13. 


14. 


15. 


17. 


18. 


19. 


20. 


21. 


Resuspend the bead pellet in 1x protein-binding buffer to the desired DNA concentra- 
tion. Add the protein(s) to the desired concentration, Incubate the mixture at the desired 
temperature for the desired time, mixing occasionally by gently tapping the side of the 
tube. 


In this and subsequent steps, DNA and protein concentrations, reaction temperature, and time 
are determined empirically and/or are based on previous knowledge about the interactions of 
the components within the complex. 


Centrifuge the reaction briefly in a microfuge. 


Place the tube into a magnetic particle concentrator, wait for 10 seconds, and remove the 
supernatant. 


Resuspend the beads in 50 ul of 1x protein-binding buffer preincubated at the desired 
temperature. Incubate for 2 minutes at the desired temperature. 


Place the tube into a magnetic particle concentrator, wait 30 seconds, and remove the 
supernatant. 


Optionally, in this step, add challengers, inhibitors, or effectors (see Kim et al. 2000a; Naryshkin 
et al. 2000). 


Resuspend the bead pellet in 20 pl of 1x protein-binding buffer preincubated at the 
desired temperature and transfer the mixture to a polystyrene microfuge tube. Insert the 
polystyrene tube inside a 13 x 100-mm borosilicate glass culture tube preincubated at the 
desired temperature. 


Place the tube in the photochemical reactor. UV-irradiate the protein-DNA sample for 20 
seconds (11 ml/mm" at 350 nm). 


If the desired reaction temperature is at least 10°C higher or lower than the ambient tempera- 
ture, fill the borosilicate glass culture tube with H,O, and preincubate it until the HLO reaches 
the desired reaction temperature. 


Transfer the suspension to a 1.5-ml siliconized polypropylene microfuge tube. Add 7 ul 
of quench solution, vortex briefly, and spin the tube briefly in a microfuge, 


Place the tube into a magnetic particle concentrator, wait for 10 seconds, and remove 17 
ul of the supernatant. Immediately proceed to Part 3. 
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PART 3: Nuclease Digestion 


MATERIALS 


CAUTION: Sce Appendix 3 [or appropriate handling of materials marked with <!>. 


b» Reagents 

CaCl, (55 mm) <!> 

5x SDS-loading buffer 
300 mM ‘Tris-HCI (pH 8.0) 
10% SDS «1» 
24mm EDTA 
25% B-mercaptoethanol <t> 
0.1% bromophenol blue <!> 
50% glycerol 


b Equipment 
Heating block preset to 70°C 


b Biological Molecules 


DNase 1 (240 units/Lil) 
DNase I should be diluted to 10 units/Il] with 20 mM sodium acetate (pH 6.5), 5 mM CaCl,, 0.1 mu 
PMST, and 50% glycerol. 

Micrococcal nuclease 


Micrococcal nuclease should be dissolved and diluted to 10 units/pl with 5 mM CaCl, 0.1 mm 
PMSE, and 50% glycerol. 


UV-irradiated protein(s)- DNA complex of interest (from Part 2, Step 21) 


METHOD 


1. To 10 ul of the bead suspension (from Part 2, Step 21), add: 


1 jd of 55 mm CaCl, 
0.5 ul (5 units) of micrococcal nuclease 
0.5 ul (5 units) of DNase I 


Incubate for 5 minutes at 37°C, 
2. Terminate the reaction by adding 3 ul of 5x SDS-loading buffer and heating the tube for 
3 minutes at 70°C. 


Occasionally (rarely), the digestion reaction yields doublets of bands (resulting from incom- 
plete digestion}. In such cases, perform an alternative digestion procedure. 


a. To 10 pl of the bead suspension (from Part 2, Step 21), add 1 pl of 55 mM CaCl, and 0.5 
Jil (5 units) of DNase L Incubate for 10 minutes at 37°C. 


b. Terminate the reaction by adding 1 ul of 10% SDS and heating for 5 minutes at 65°C. 


c. After the sample has cooled to room temperature, add 1 ul of 25 mm ZnCl, | ul of 1 v 
acetic acid «1», and 1 (tl (30 units) of 51 nuclease. Incubate for 10 minutes at 37°C. 


d. Terminate the reaction by adding 3 ul of 5x SDS-loading buffer, and heating for 3 min- 
utes at 70°C. 
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PART 4: Identification of Cross-linked Proteins 


MATERIALS 


p Reagents 


» Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Polyacrylamide (37.5:1 acrylamide:bisacrylamide <!>) slab gel 
Use a gel with a polyacrylamide concentration (or polyacrylamide-concentration gradient) that 
results in maximal resolution in the molecular-weight range of protein(s) of interest. 


Prestained protein molecular-mass markers (7--210 kD) (e.g., Bio-Rad) 
Ix SDS running buffer 

25 mM Tris 

250 mM glycine (pH 8.3) <!> 

0.1% SDS <!> 


Apparatus for autoradiography or phosphorimaging 
Gel dryer 


b Biological Molecules 


METHOD 


Nuclease-digested protein(s)- DNA complex of interest (from Part 3, Step 2) 


1. Apply the entire sample (16 ul, from Part 3, Step 2) to a polyacrylamide slab gel. Load 5 
ul of prestained protein molecular-mass markers into an adjacent lane. 


2. Run the gel in SDS running buffer at 25 V/cm until the bromophenol blue runs off the 
gel. 


3. Dry the gel, and perform autoradiography or phosphorimaging. 


Case Stupy: ANALYSIS OF ESCHERICHIA COL RNA POLYMERASE TRANSCRIPTION ENITIAHON 
COMPLEXES 


The procedure described in this protocol has been successfully applied to analysis of 
Escherichia coli RNA polymerase transcription initiation complexes (Naryshkin et al. 2000, 
2001, and unpubl.). The results have permitted analysis of the structural organization of 
the RNA polymerase-promoter open complex (RPo; Naryshkin et al. 2000), analysis of the 
structural organization of a putative intermediate in formation of the RNA polymerase- 
promoter open complex (RP,s.c3 N. Naryshkin et al., unpubl.), and analysis of the effects of 
a transcriptional activator on the structural organization of RPo and RP sc (Naryshkin et 
al. 2000 and unpubi.). 


PROTOCOL 8 


Analysis of Protein Interactions In Vivo with FRET 


MATERIALS 


Tats PROTOCOL WAS CONTRIBUTED BY Marc Damelin and Pamela Silver (Dana-Farber Cancer 
Institute, Boston, Massachusetts}. 

The protocol presented here focuses on a standard microscope-based assay for budding 
yeast, but many aspects are applicable to other systems and assays. The Experimental Design 
and Data Analysis procedures can be extended to FRET experiments in any system, includ- 
ing mammalian cell lines and other model organisms. The Strain Construction procedure 
can be applied to studies in yeast using fluorimetry instead of microscopy (Overton and 
Blumer 2000). Although the data acquisition and analysis described below can be a daunting 
task, careful quantitative analysis is absolutely necessary due to a high degree of cross-talk 
and many unknown parameters that vary from cell to cell. 


P Reagents 


b Equipment 


Molecular biology reagents 
Use standard molecular biology reagents as described by Ausubel et al. (1997) and Sambrook and 
Russell (2001). 

Synthetic complete (SC) liquid media with 2% glucose 
Use for the growth of yeast cells to be analyzed by FRET. The media should be less than 2 months 
old. 

Yeast manipulation reagents 
Use the reagents described by Guthrie and Fink (2002). 


Filter sets 

CFP filter set to visualize CFP (Donor) 

YFP filter set to visualize YFP {Acceptor} 

FRET filter set to visualize “sensitized emission,” which is YFP emission resulting indirect- 
ly from CFP excitation. 
Each set is composed of an excitation filter, a dichroic mirror, and an emission filter. Filters that 
have been optimized for CFP and YFP are available from Chroma Technology Corp. (or alter- 
natively from Omega Optical). The filter specifications are shown in Table 10.5. Depending on 
the microscope being used, the filters can be installed either in three individual filter cubes or on 
filter wheels (which can eliminate duplicates). It is important that the user be able to switch 
among filter sets easily and quickly. 
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TABLE 10.5. Filters required for CEP-YFP FRET 


Filter set Excitation Dichroic Emission 

CFP 440 nm;20 nm $55 rm LP 480 nm/30 nr 
YEP l 500 nm/25 nm 525 rin LP 545 nm/35 nm 
FREI 440 nm/20 nm 455 nm IP 535 nm/25 nm 


Jable courtesy of Mars Damelin and Pamela Silker (Dana Farher t ancer Institute n, 


Image analysis software 
‘The software must be able to operate the shutter, allowing precise exposures, and to analyze images 
by calculating the average pixel intensily within a specified region. 

Microscope equipment 
The microscope must be outfitted with a camera, an electronic shutter, specialized filters, and 
appropriate light source. See the pane! on EQUIPMENT FOR IN VIVO ERET ANATYSIS below. 


Standard equipment for molecular biology and yeast manipulations 


» Biological Molecules 
&-GFP antibody 
The antibody is used to check expression of the fusion proteins on western blots. 


DNA encoding CFP and YFP 
It is often mast convenient to procure veast vectors designed for FRET from various laboratories 
( Damelin and Silver 2000, 2002; Overton and Blumer 2000), but the genes are also available com- 
mercially (c.g., CLONTECH). For details on the vectors and their use, see Step 1. 


DNA fragments encoding portions of the endogenous proteins of interest 


b Cells 
Yeast strains 

A wild-type haploid ura3 trp! strain, such as FY23 (Winston et al. 1995), serves as the parent strain. 
It is strongly preferable that the strain be cream-colored (not pink or red, e.g. in ade2 mutants), 
because the pigment obscures FRET signals. Additional yeast strains may be necded to test the func- 
tionality of the fusion proteins, For each tagged protein, conditions in which the protein iy essen- 
tial for viability should he identified. 1f viability cannot be assayed under specific growth condi- 
tions, yeast strains with mutations that make the tagged protein essential for viabiltiv (synthetic 
lethal mutations) should be obtained. Yeast cultures for microscopy are grown on à roller drum at 
room temperature or at 25°C. 


EQuiPMENT FOR IN Vivo FRET ANALYSIS 
Listed below is one example of an acceptable setup (Damelin and Silver 2000): 


Nikon Diaphot-300 inverted epifluorescence microscope with a 60 x 1.4 NA objective 
100-W mercury lamp <!> 

Photometrics 200-series liquid-cooled charge-coupled device (CCD) camera 

Uniblitz D122 shutter and shutter driver 

Appropriate filters 


Use of a confocal microscope may be considered but is not necessary for this assay in 
yeast. Software such as the MetaMorph Imaging System (Universal Imaging Corp.) can 
handle data acquisition and image anelysis. 
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METHOD 


Experimental Design 


Due to the complex nature of in vivo FRET experiments, the valid interpretation of the data 
requires experimental controls that satisfy several criteria. A detailed understanding of the 
issues surrounding these experiments is extremely useful in.the planning stage, as well as dur- 
ing image acquisition and data analysis. For an elaborate and very helpful discussion, see 
Gordon et al. (1998). The symbols introduced by these authors have been adopted in this 
protocol (see Table 10.6 and the panel on AN EXPLANATION OF SYMBOLS USED FOR ANALYSIS OF 
FRET DATA). For applications of this protocol, see Damelin and Silver (2000, 2002). 

In its simplest form, the FRET assay is used to investigate a potential interaction between 
two proteins, Target! and Target2. Other endogenous proteins, Controll, Control2..., that do 
not interact with Target] serve as the negative controls. As described below in Data Analysis 
and in Gordon et al. (1998), in vivo FRET measurements are not absolute, and they are mean- 
ingful only when evaluated statistically relative to similar measurements (1.e., interacting pair 
vs. noninteracting pair). A negative control is therefore essential to establish a basis for com- 
parison. Using more than one Control protein is preferable, because the additional data create 
a broader framework for interpretation. Each Control protein must satisfy three requirements: 


1. It should not interact with Targetl. 
2. It should colocalize with Target2. 


3. Sts expression level should be comparable to that of Target2. 


Control proteins are typically other endogenous proteins, but there are possible varia- 
tions, A mutant larget2 protein may serve as a control, provided its expression Jevel and 
localization are comparable to wild-type Target2. However, the use of at least one Control 
protein besides Target and Target2 is encouraged. 

In the experimental design shown in Figure 10.30, Target] is fused to CFP, and Target2 
and all Control proteins are fused to YFP. This scheme derives from the fact that CFP gener- 
ates most of the cross-talk in the FRET filter set. Consequently, only pairs with the same CFP 
fusion protein should be compared directly. 


| 7 


Target1/ ¢ Targat2 ^ 


J ae 
(CFP VFP) 
| | 
|Targeti | Control, 
4 dc r 

FIGURE 10.30. Standard in vivo FRFT experimental LA Gey 
design. An endogenous protein (Targetl), tagged with CFP \ | j 
CEP, is paired with each of several other endogenous pro; T a 
teins (Target2, Controll, Control2) tagged with YFP. 
Because in vivo FRET measurements are not absolute, — | p — 
results can be evaluated only by the direct comparison of Target: ( Contro ^ 


measurements for an interacting pair (top panel) and non- : 
interacting pairs (middle and bottom panels}. (Courtesy of A p 
Marc Damelin and Pamela Silver, Dana-Farber Cancer (CFP) (YFP) 
Institute.) “yy 
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AN EXPLANATION OF SYMBOLS USED FOR ANALYSIS OF FRET DATA 


The FRET measurements discussed in this protocol use three filter sets, which are termed 
the Donor, FRET, and Acceptor (or CFP, FRET, and YFP, respectively). These filter sets are 
designed to isolate and maximize three signals: the donor fluorescence, the acceptor fluo- 
rescence due to FRET, and the directly excited acceptor fluorescence, respectively. The exci- 
tation filters for the donor filter set and the FRET filter set are either the same filter or two 
matched filters. Neutral density filters may be used to match the signals from the three fil- 
ter sets to the dynamic range of the detector. Defined below are the FRET symbols used in 
this protocol which represent the signals using the type of filter set (Donor, FRET or 
Acceptor), the fluorochromes present in the sample (donor only, acceptor only, or both 
donor and acceptor), and the signal from either just the donor or acceptor when both are 
present in the sample. Each symbol starts with an uppercase letter representing the filter set: 


D for the Donor (CFP) filter set 
F for the FRET filter set 
A for the Acceptor (YFP) filter set 


The second letter is lowercase and indicates which fluorochromes are present in the 


specimen: 


d for donor only (CFP) 
a for acceptor only (YFP) 
f for both donor and acceptor present (so FRET is possible) (CFP and YFP) 


TABLE 10.5. Some FRET symbols and their interpretation 


Symbol Filter set 


Fluorochromes present 


Meaning 


Dd donor 


donar »ignal from a donor-only specimen using the donor filter cube 
Fd FRET donor signal from a donor-only specimen using the FRET filter set 
Ad acceptor donar signal from a donor-only specimen using the acceptor filter set 
Da donor acceplor signal from an acceptor-only specimen using the donor filter cube 
Fa PRET acceplor signal from an acceptor-only specimen using the FRET filter set 
Au acceptor acceptor signal from an acceptor-only specimen using the acceptor filter set 
nf denor donor and acceptor signal from a FRET specimen using the donor filter cube 
Ff FRET donor and acceptor signal from a FRET specimen using the FRET filter set 
Af acceptor — donor and acceptor signal from a FRET specimen using the acceptor filter set 


Adapted, with permission, from Gordon et al. (1998), 


Strain Construction 


1. Construct vectors for tagging the endogenous proteins with CFP or YFP. 


Integrating (nonreplicating) vectors are best suited for this purpose, because the fusion replaces 
the endogenous copy in the genome and is expressed at comparable levels from cell to cell. The 
vectors have three primary components: a yeast selection marker such as TRP! or URA3, an in- 
frame fusion between the endogenous gene and CFP/YFP, and either a 3-untranslated region 
(for carboxy-terminal fusions only) or a promoter (for amino-terminal fusions only). All CFP 
vectors should have one selection marker (e.g, TRPI) and all YEP vectors should have another 
marker (c.g., URA3). 
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Only a short fragment of the endogenous gene needs to be used because Its sole purpose is tolar- 
get the integration; the genomic copy provides most of the coding sequence. Thus, a short frag- 
ment of the gene corresponding to each endogenous protein can be amplified by PCR and 
inserted into a cassette such as pPS1890 (CFP) or pP51891 (YFP) (Damelin and Silver 2000). 
These particular cassettes yield fusions with CEP/YFP at the carboxyl terminus of the endoge- 
nous protein. Once integrated into the genome, these fusions are expressed under the endoge- 
nous promoter. 


EXPERIMENTAL TIP: Far some Target proteins, it may be desirable to generate two fusions: 
one at the amino terminus and one at the carboayl terminus, Because the FRET signal is high- 
ly dependent on the intertluorophore distance, the presence of a fluorophore on one domain 
of a protein may not yield a PRET signal if another domain is responsible for the protein-pro- 
tein interacüon in question. Perlorming the experiment with different fusions increases the 
chance of detecting FRET. Additionally, a fusion to one terminus of a particular target protein 
may nat be functional, in which case, the other fusion should be used. 


When designing the integration vectors, it is imperative to ensure that the final vector has 
a unique restriction site within the DNA encoding the endogenous protein, so that the vec- 
tor can be linearized prior to the yeast transformation. There should be at least 100 bp of 
genomic DNA on each side of the restriction site to allow efficient recombination. Ideally, 
the linearization site will be absent in both TRPI and URA3, as subsequent experiments 
may involve swapping the fluorophore tags. 

As an alternative to vectors, DNA encoding the fluorescent protein and the selection mark- 
er can be amplified by PCR with primers designed for integration at the desired site. This 
method may be less efficient since there is less genomic DNA to target the recombination. 


2. For each fusion protein, transform a wild-type yeast strain (described under Materials) 
with the linearized vector. Isolate several transformants for further analysis. 


3. Grow the transformants in liquid media (SC or drop-out) to log phase, and check expres- 
sion and localization of the protein fusion under the microscope. 


4. For transformants expressing the protein fusion, perform western analysis on whole-cell 
lysates, probing with o-GFP antibody, to confirm the expression of the full-length fusion. 


5. Determine the functionality of each fusion protein by assaying cell viability, expression of 
the fusion protein, and correct localization of the fusion protein, under conditions in 
which the fusion protein is required. i 


For instance, in one study, Nup82p-YFP was assayed in wild-type cells because NUP82 is essen- 
tial for growth, whereas Nup188p-YFP was assayed in rup1704 cells because NUPT88 is essen- 
tial in that genetic background (Damelin and Silver 2000). 


6. If the fusion proteins are functional, generate double-labeled strains for the FRET exper- 
iments, for example, by transforming the Target1-CFP strain with each of the YFP con- 
structs. Repeat Steps 3 and 4 to verify coexpression of the fusions. 


7. Prepare glycerol stocks of the yeast strains at -809C selected for further analysis. 


Exposure Settings Calibration 


The exposure settings for a given filter set must be the same during image acquisition for all 
strains with the same Target-C}P fusion. The exposure settings for the three filter sets can be 
different from one another. 
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B. To prepare the samples, grow the cells to log phase in SC media at room temperature or 
at 25°C, 


9. Acquire sample images of the double-labeled strains in the following order: 


+ FRET filter set 
+ YFP filter set 


« CFP filter set, to minimize the effects of photobleaching 


10. Determine the optimal exposure settings by adjusting the hardware and software para- 
meters, most notably the exposure times. 


Exposures should be as short as possible to minimize effects of photobleaching or photodam- 
age. However, the resulting images must have adequate signal intensity over background and 
must be amenable to analysis (e.g., check that the desired region of the cell can be selected and 
analyzed with the software program). Many software applications have a "Scale Image" feature 
that can be used to change the visualization of images without changing the actual data, and 
thus may allow shorter exposures. 


11. Pipette 2 ul of the sample onto a standard glass slide, gently place a coverslip on the slide, 
and examine immediately. 


The coverslips must match the objective; typically, use a No. 1.5 coverslip over the sample. 


EXPERIMENTAL TIP: Slides should not be prepared in advance and should not be used for 
more than 5-10 minutes. 


The exposure settings depend on the equipment used as well as the expression level of the pro- 
tein fusions. Accordingly, in independent sets of experiments where Target! represents distinct 
endogenous proteins, it may be necessary to have different exposure settings. The settings must 
be the same only for experiments within which the data are directly compared. 


Image Acquisition 


The most critical aspects of image acquisition are maintaining the same settings and order of 
exposures for each cell. Acquiring images with all three filter sets records all possible infor- 
mation and permits many different levels of analyses, as described below and in Gordon et 
al. (1998). If the software allows, it is convenient to use a macro for acquiring three exposures 
(FRET, then YFP, then CFP) of a cell with the optimal settings determined above in Exposure 
Settings Calibration, Delays between exposures may be necessary for changing filter sets. For 
many analysis methods, it is easiest to have images containing only one cell. Therefore, either 
the region of image acquisition should be limited to a small box in the center of the field or 
small subregions of larger images should be saved separately. 

CFP and YEP are extremely sensitive to photobleaching, and certain precautions should 
be taken to minimize the effects. Cells should not be visualized by fluorescence before images 
are acquired, because significant photobleaching occurs even after seconds, Thus, white light 
(as opposed to the fluorescence lamp) should be used to find and focus on each cell. Similarly, 
each slide should be used to image only four to five cells, located in different regions of the 
slide. 

The number of cells per strain needed for analysis depends on the specific conditions of 
the experiment and may be difficult to determine beforehand. A good starting point is to 
image 20 cells per strain; this includes only cells that coexpress both fusion proteins and pro- 
vide images amenable to analysis. 
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Data Analysis 


12. 


14. 


15. 


Acquire images with all filter sets of empty fields to measure background levels. 


. Acquire images with all filter sets of cells expressing only Target 1-CFP, to determine Fd, 


Dd, and Ad (see Gordon et al. 1998, Table 10.6, and the pane! above on AN EXPLANATION 


OF SYMBOLS USED FOR ANALYSIS OF FRET DATA). Prepare the samples as described in Steps 
8-11. 


In most cases for CFP, when using the filter sets described above, Ad = 0. 


Acquire images with all filter sets of cells expressing only Target2-YFP, to determine Fa, 
Da, and Aa. These values depend on the fluorophore and the experimental conditions but 
not the strain, so these measurements do not need to be repeated for all strains express- 
ing a YFP fusion. 


In most cases for YFP, when using the filter sets described above, Da — 0. 


Acquire images with all filter sets for each of the double-labeled strains. 


IMPORTANT: Images must be analyzed on a cell-by-cell basis. 


16. 


17. 


23. 


24. 


25. 


For a given cell, open the three images corresponding to the FRET, YFP, and CFP filter 
sets. 


Select a region of the cell for analysis. The regions in all three images should be compa- 
rable with respect to region area, as determined by the software and confirmed by eye. 


Several functions available on imaging software applications may be suitable for this purpose. 
Examples include a function that allows a region to be selected according to pixel intensity 
(“Threshold”) and a function that allows a region to be drawn manually on one image and 
copied to other images. 


. Log (to a text file or spreadsheet file) the average pixel intensity, for the selected region, in 


each image. 


. Repeat Steps 16—18 for each cell. To facilitate the spreadsheet analysis, log the values for 


FRET, YFP, and CFP images in the same order for each cell. 


. For each background image (i.e. acquired in Step 12), log the average intensity of a box 


in the center of the image. 


. Open the data file in a spreadsheet application. 


. Determine the background signal in each filter set by calculating the average value from 


all background images in that filter set. 


Subtract the background signal from all the cell measurements ten data from Steps 
13-15) in the respective filter sets. Use these background-corrected cell measurements in all 
subsequent calculations. 


Determine Fd/Dd from the CFP strain by calculating the average Fd/Dd ratio. This aver- 
age value can be treated as a constant in Step 26. 


Determine Fa/Aa from the YEP strain by calculating the average Fa/Aa ratio. This aver- 
age value can be treated as a constant in Step 26. 
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26. For each cell in each of the double-labeled strains, calculate some of the measures of FRET 
described previously, such as Ff/Df, P/Df, and FRET2 (Gordon et al. 1998). Cells with out- 
lying or aberrant (e.g., negative) values should be excluded from further analysis. 


Values obtained from these calculations do not provide an absolute measure of FRET, because 
they depend on experimental parameters such as exposure time and because in vive FRET 
experiments involve many unknowns, The values are meaningful only when compared among 
the different strains. Moreover, values for the various calculations EAD, P/Df, and FRET2) for 
a given cell can be very different from one another. l 


F represents a calculated FRET value termed “corrected FRET” (Gordon et al. 1998; Youvan 
ei al. 1997}. 


F = Ff- Df(Fd/Dd) — Aft Fa/Aa) 


F separates the FRET signal from the non-FRET signal if Ad and Da are effectively zero. 
However, if Ad and Da are not zero, then £^ docs not correct for the resulting cross-talk, and 
the FRET versus non FRET signals are not separated. Ad and Pa can often be made effec- 
tively zero by appropriate choice of fluorochromes and filter sets. P can be normalized for 
the concentration of donor by using F/Df, which is approximately normalized for the coti- 
centration of donor. 


FRET2 is a calculated FRET value, which is beyond the scope of this protocol to derive. For 
details, see Gordon et al. (1998) and Damelin and Silver (2002). 


27. Compare values for the cells in different strains. Only values for a given calculation (EFD), 
PJDf, or FRET2) can be compared directly. 


«Fora first approximation: Calculate the mean and standard deviation for Pf/Df, F/Df, 
and FRET2 for each strain. 


e For statistical analysis: Compare the values for two given strains using the Rank Sum 
Test, which compares the data sets about their medians and is appropriate for small 
samples in which the values are not necessarily distributed evenly about their mean. 
The standard threshold for the Rank Sum Test is p «0.05, but a higher threshold (e.g., 
p «0.02) may be used. 


If the values for Targetl-CEP/Target2-YFP are significantly greater than those for 
Targetl -CFP/Control- YFP, there is then a FRET interaction between Target1-CFP and 
Target2-YFP. Such a conclusion is justified only when the proper controls are used (see 
Experimental Design above). The existence of a FRET interaction implies that the 
Target proteins are very close to each other in space, because CFP and YFP must be sep- 
arated by <30 A. However, the FRET experiment alone does not prove a direct interac- 
tion between the Targets. 


There are no “rules” for deciding which calculation (F/Df, P/Df, and FRET2) should be 
ultimately adopted, and the most reasonable choice often depends on the specific exper- 
iment. The influence of many aspects of an experiment on the different calculations has 
been discussed (Gordon et al. 1998). Ideally, the results do not depend on the calcula- 
tion, as shown explicitly in some cases (Gordon et al. 1998). In other cases, one calcula- 
tion may be preferable due to tbe characteristics of those particular experiments; thus, 
it is often instructive to compare the results for the various calculations. In general, 
when applicable, the simpler calculations involving fewer measurements are preferable. 


In addition to the controls built into the original experiment, the demonstration that an 
observed FRET signal is maintained upon swapping fluorophores between the target 
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proteiis further substantiates the result. In other words, if Target] -UFP and Target- 
YFP yield a FRET interaction, so should Targeu -CEP and Target) YEP in most cases. 
Different control proteins may be required for this experiment if larget! and Larget2 
differ in expression level. 


One of the goals of proteomics is the analysis of large multiprotein complexes (ego see 
Damelin and Silver 2002), The FRET protocol is quite useful in this regard, because the 
noninteracting proteins within the complex can serve as Control proteins. In such a 
study, as many proteins as possible in a protein complex are lagged with CFP and YEP. 
The ERET assay is performed on the double-labeled strains, and as im this protocol, only 
strains with the same CEP fusion are compared to one another. The signals from certain 
strains may significantly exceed the signals from other strains in the panel, thus consti- 
tuting FRET interactions. Implicit in this setup is the idea that the identities af the 
‘Jarget2 and Control proteins for each ‘larget-CFP fusion may not be known before- 
hand, but will be revealed during the course of the experiment, A given protein may 
serve as Target2 in one case and as a Control in another. Moreover, in some cases, it is 
possible that more than one FRET interaction will be observed for one Target-CEP 
fusion, 
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Monitoring Protein-Protein Interactions 
with GFP Chimeras 


Tus PROTOCOLS WERE CONTRIBUTED BY Sang-Hyun Park (Department of Cellular and 
Molecular Pharmacology, University of California, San Francisco, California) and Ronald T. 
Raines (Department of Biochemistry, University of Wisconsin, Madison, Wisconsin). 

Green fluorescent protein (GFP) from the jellyfish Aeguorea victoria has exceptional 
physical and chemical properties besides its spontaneous fluorescence. These properties, 
which include high thermal stability and resistance to detergents, organic solvents, and pro- 
teases, endow GFP with enormous potential for biotechnical applications. Since the CDNA of 
GFP was cloned (Prasher et al. 1992), a variety of GFP variants have been generated that 
broaden the spectrum of its application (Cubitt et al. 1995; Delagrave et al. 1995; Ehrig et al. 
1995; Heim et al. 1995; Crameri et al. 1996; Ward et al. 2000). Among those variants, 565T 
GFP is unique in having increased fluorescence intensity and faster fluorophore formation 
than that of the wild-type protein, as well as altered excitation and emission spectra (Heim 
et al. 1995; Ormó et al. 1996), Moreover, the wavelengths of the excitation and emission max- 
ima of S65T GFP (490 and 510 nm, respectively) are close to those of fluorescein. The fluo- 
rescein-like spectral characteristics of S65T GFP enable its use with instrumentation that had 
been designed specificalty for use with fluorescein. 

We describe here the use of S65T GFP to probe protein-protein interactions in vitro 
(Park and Raines 1997, 2000). This method requires fusing S65T GFP to a target protein (X) 
to create a GFP chimera (X~GFP{S65T]). The target protein is fused to the amino terminus 
of GEP, rather than to the carboxyl terminus; otherwise, heterogeneity can occur (Park and 
Raines 1997, 2000). The interaction of the fusion protein with another protein (Y) can be 
analyzed with common instrumentation by two distinct methods, 

The first method is a fluorescence gel-retardation assay. The gel-retardation assay has 
been used widely to study protein-DNA interactions (Carey 1991). This assay is based on the 
electrophoretic mobility of a protein-DNA complex being less than that of either molecule 
alone. In a fluorescence gel-retardation assay, electrophoretic mobility is detected by the flu- 
orescence of S65T GEP. This assay is a rapid method to demonstrate the existence of a pro- 
tein-protein interaction and to estimate the equilibrium dissociation constant (Ky) of the 
resulting complex. 

The second method is a fluorescence polarization assay, which provides an accurate 
means to evaluate K, in a specified homogeneous solution. Fluoresence polarization assays 
usually rely on fluorescein as an exogenous fluorophore. S651 GFP can likewise serve in this 
role. Furthermore, the fluorescence polarization assay can be adapted for the high-through- 
put screening of protein or peptide libraries. 
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METHOD 1: Fluorescence Gel-retardation Assay 


MATERIALS 


b Reagents 


» Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «17. 


Native polyacrylamide gel (6% w/v) <!> 
Tris-HCl buffer (10 mw, pH 7.5) containing glycerol (596 v/v) 


Software for quantifying fluorescence (ImageQuant, Amersham Biosciences) 
Fluorescence imaging system (e.g., Fluorlmager System, Amersham Biosciences) 
UV/visible spectrophotometer 

Vertical gel electrophoresis system 


» Biological Molecules 


METHOD 


Purified protein of interest (Y) 


Purified S65T GFP chimera (X~GFP(S65T]) (CLONTECH) 
For details on the construction and purification of the chimera, see Park and Raines (1997, 2000). 


. Estimate the concentration of purified X~GFP(S65T) using the extinction coefficient 


fe = 39,2 mwc'cmr! at 490 nm) of S65T GFP (Heim et al. 1995). 


. To begin the gel-retardation assay, mix 1.0 UM of X- GFP(S65T) with varying amounts of 


Y in 10 pl of 10 mm Tris-HCl buffer (pH 7.5) containing 596 glycerol. 


. Incubate the mixtures for 20 minutes at 20°C. Load the mixtures onto a 6% native poly- 


acrylamide gel, and subject the loaded gel to electrophoresis at 10 V/cm for 30 minutes at 
4°C, 


. Immediately after electrophoresis, scan the gel with a fluorescence imaging system at 700 


V using a built-in filter set (490 am for excitation and 2515 nm for emission). 


. Quantify the fluorescence intensities of bound and free X~GFP(S65T} using appropriate 


software, such as ImageQuant. 


. Determine the value of R (= fluorescence intensity of bound X-GEP[S65T ]/total fluo- 


rescence intensity) for each gel lane from the measured fluorescence intensities. 


- Calculate the value of K, for each lane with Equation I, and average those values. 


1-R 
K, = CY liota 7 RIX ~GFP Jisa) (1) 


R 
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Case Stupy: S65T GFP CHiMERAS 


To demonstrate the potential of S65T GFP chimeras in the exploration of protein-protein 
interactions, we use as a model system the well-characterized interaction of the S-peptide 
and S-protein fragments af bovine pancreatic ribonuclease (RNase A; EC 3.1.27.5) (Raines 
1998; Raines et al. 2000). (GFP chimeras have also been used to explore the interaction of 
[1] the BALF-1 protein and Bax/Bak [Marshall et al. 1999]; [2] CREB and importins 
[Forwood et al. 2001]; and [3] a cyclophilin and the capsid protein p24 of HIV-1 [Kiessig 
et al. 2001].) Subtilisin treatment of RNase A yields two tightly associated polypeptide 
chains: S peptide (residues 1-20) and S protein (residues 21—124) (Richards 1955). 
Residues 1—15 of S peptide (S15) are necessary and sufficient to form a complex with S pro- 
tein (Potts et al. 1963). 

The fluorescence gel-retardation assay is used to quantify the interaction between S pro- 
tein and $15. A GFP chimera, $15~GFP(S65T)~His,, is produced by standard recombinant 
DNA techniques and purified by Ni**-affinity chromatography (Park and Raines 1997, 
2000). A fixed quantity of S15-GFP(S65T)- His, is incubated with a varying quantity of $ 
protein prior to electrophoresis in a native polyacrylamide gel. After electrophoresis, the gel 
is scanned with a fluorescence imaging system, and the fluorescence intensities of bound 
and free S15- GFP(S65T)-His, are quantified (Figure 10.31, top). The value of Ky for the 
complex formed in the presence of different S-protein concentrations is calculated from 
the values of R and the total concentrations of S$ protein and $15~GFP(S65T)}~His, 
(Equation 1). The average (+SE) value of K, is (6 + 3) x 10° M. 

A competition assay is used to probe the specificity of the interaction between 
S15--GFP(S65T)- His, and S protein. $15~GFP(S65T)~His, and S protein are incubated to 
allow for complex formation. Varying amounts of S peptide are added, and the resulting 
mixture is incubated further (20 minutes), and then subjected to native gel electrophore- 
sis. The addition of S peptide converts bound $15~GFP(S65T)~His, to the free state 
(Figure 10.31, bottom). Thus, $15~GFP(S65T)~His, and S peptide bind to the same region 
of $ protein. 


) 6 


"mt. 


FIGURE 10.31. Gel-retardation assay of a protein-protein interac- 


ubdbdbenb dido em c | Free tion. (Top, Lanes 1-9) 1 uM $15-GEP(S65T)--His, and 0, 0.2, 0.5, 0.4, 
0.5, 0.6,0.7, 0.8, and 0.9 pm S protein, respectively, The relative mobil- 
0.18 0.26 040 0.48 0.54 0.62 0.64 0.70 R ities of free and bound SI5-GEP(S65T)-His, are 0.72 and 0.47, 


| 


) A 
4 Ó a 


respectively. The value of R is obtainec for each lane, and values of Ky 

are calculated by using Equation 1, with the average being K, = (6+ 

^ 3) x 107 m. (Bortom) Gel-retardation assay demonstrating that $ pep- 
| Bound tide competes with $§15~GFP(S65 1 }~1 is, for interaction with $ pro- 
tein. (Lane 7) 1 pi $15 -GEPLS6ST - His, and no 5 protem or 5 pep- 


[s 


a - - - free tide; (lanes 2-6) 1 UM SI5-GEPCSST]--His,, 1 UM 5 protein, and 0, 


0.3, 1.0, 3.0, and 10 um S peptide, respectively. 
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METHOD 2: Fluorescence Polarization Assay 


The fluorescence gel-retardation assay described above is a convenient method to visualize a 
protein-protein interaction as well as to estimate the value of K, for the resulting complex. 
Still, gel-retardation assays have an intrinsic limitation in evaluating equilibrium dissociation 
constants. In a gel-retardation assay, it is assumed that a receptor-ligand interaction remains 
at equilibrium during sample loading and electrophoresis. Yet, as samples are loaded and 
migrate through a gel, complex dissociation is unavoidable and results in an underestimation 
of the value of K,. Moreover, if the conditions (e.g., pH, salt type, and concentration) encoun- 
tered during electrophoresis differ from those in the incubation, then the measured value of 
K, may not correspond to the true value. 

In a fluorescence polarization assay, the formation of a complex is deduced from an 
increase in fluorescence polarization, and the equilibrium dissociation constant is deter- 
mined in a homogeneous aqueous environment. Many fluorescence polarization assays have 
used fluorescein as the fluorophore (LeTilly and Royer 1993; Heyduk et al. 1996; Malpeli et 
al. 1996; Fisher et al. 1998). Like a free fluorescein-labeled ligand, a free S6ST GFP chimera is 
likely to rotate more rapidly and therefore to have a lower rotational correlation time than 
does a bound chimera. An increase in rotational correlation time upon binding results in an 
increase in fluorescence polarization, which can be used to assess complex formation 
(Jameson and Sawyer 1995). In contrast to a gel-retardation assay, the fluorescence polariza- 
tion assay is performed in a homogeneous solution in which the conditions can be dictated 
preciselv. 


MATERIALS 


» Reagenis 
Tris-HCl buffer (20 mM, pH 8.0) containing NaC] (at various concentrations) 


» Equipment 
Fluorescence spectrometer capable of measuring polarization (e.g., the Beacon fluorescence 
polarization system, PanVera, Madison, Wisconsin) 
Software capable of nonlinear regression analysis (e.g., DeltaGraph, DeltaPoint, Monterey, 
California) 


P Biological Molecules 
Purified protein of interest (Y) 
Purified S65T GFP chimera (X~GFP{S65T)) (CLONTECH) 


For details on the construction and purification of the chimera, see Park and Raines (1997, 2000). 
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METHOD 


. Mix 0.50-1.0 nM X-GFP(S65T) with various concentrations of Y in 1.0 ml of 20 mm 


Tris-HCI buffer (pH 8.0), with or without NaCl at 20°C, 


Conditions such as buffer, pH, temperature, and salt can be varied to test their effect on the 
binding interaction. 


. Immediately after mixing, make five to seven polarization measurements at each concen- 


tration of Y using a fluorescence spectrometer capable of measuring polarization. For a 
blank measurement, use a mixture that contains the same components but lacks 
X -GFP(S65T). Fluorescence polarization (P) is defined as: 


p. lich 
Ty 1, 


(2) 


where {jis the intensity of the emission light parallel to the excitation light plane and I, 
is the intensity of the emission light perpendicular to the excitation light plane. P, being 
a ratio of light intensities, is a dimensionless number and has a maximum value of 0.5. 


. Determine values of K, by using software, such as DeltaGraph, to fit the data to the equa- 


tion: 


p=" +P, (3) 


where P is the measured polarization, AP (= Paa — Pan) is the total change in polariza- 
tion, and F is the concentration of free Y, The fraction of bound protein (fp) is obtained 
by using the equation: 
P- Pon F 
nz (4) 
AP K,+F 


fue 


The familiar binding isotherms are obtained by plotting f, versus F. 
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Case Stupy: FLUORESCENCE POLARIZATION 


Fluorescence polarization is used to determine the effect of salt concentration on the for- 
mation of.a complex between S$ protein and $15~GFP(S65T)~His,. The value of K, 
increases by fourfold when NaCl is added to a final concentration of 0.10 M (see Figure 
10.32). A similar salt dependence for the dissociation of RNase S had been observed previ- 
ously (Schreier and Baldwin 1977). The added salt is likely to disturb the water molecules 
hydrating the hydrophobic patch in the complex between S peptide and 5 protein, result- 
ing in a decrease in the binding affinity (Baldwin 1996). Finally, the value of Ky = 4.2 x 10 
M observed in 20 mM Tris-HCl buffer (pH 8.0) containing NaC} (0.10 M) is similar (i.e. 
threefold lower) to that obtained by titration calorimetry in 50 mM sodium acetate buffer 
(pH 6.0) containing NaCl (0.10 m) (Connelly et al. 1990), 


1.0 ae e a 
0.8 
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FIGURE 10.32. Fluorescence polarization assay of a protein-protein interaction, S protein (20 Um to 1.0 
nM) is added to 20 mM Tris-HCl buffer (pH 8.0) containing 5$15- GFP(S65T)- His, (0.50 nM) at 20°C in a 
volume of 1.0 ml. Each data point is an average of 5- 7 measurements. Binding isotherms are obtained by 
fitting the data to Equations 3 and 4. The values of K, in the presence of 0 and 0.10 M NaCl are 1.1 x 108 « 
and 4.2 x 10 m, respectively, (Courtesy of Sang-Hyun Park, University of California, San Francisco, and 
Ronald T, Raines, University of Wisconsin, Madison.) 


PROTOCOL 10 


Protein Arrays: Preparation of Microscope Slides 


MATERIALS 


Tus PROTOCOL WAS CONTRIBUTED BY Gavin MacBeath (Harvard University, Cambridge, 
Massachusetts). 

Many vendors supply slides that can be used for the preparation of protein microarrays. 
‘Typically, these slides come with protocols for arraying, blocking, and processing. The fol- 
lowing protocols are for two particularly useful substrates. 


y Reagents 


» Equipment 


772 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


3-Aminopropyltriethoxysilane (United Chemical Technologies, Inc., Bristol, Pennsylvania) <!> 
Bovine serum albumin (BSA) (196 w/v) (Fraction V, minimum 9896 pure) 
N,N'-Disuccinimidyl carbonate 

N,N-Diisopropylethylamine (stock is 5.73 M) <!> 

N, N-Dimethylformamide (DMF), anhydrous <!> 

Ethanol (absolute) <!> 

Ethanol (95%}/5% H,O solution, adjusted to pH 4.5-5.5 with acetic acid <!> 
HCI (196 v/v) aqueous solution <!> 

Helium gas <!> 

Nitrogen gas <!> 

Phosphate-buffered saline (PBS) (400 ml, pH 7.5) 

Sodium hydroxide (10% w/v), aqueous solution <!> 

3-Triethoxysilylbutanal (United Chemical Technologies, Inc.) 


Centrifuge with swinging bucket rotor and microplate carriers 

Microscope aminopropyl-modified slides (Sigma-Aldrich, Corning, TeieChem International) 
Microscope glass slides (VWRbrand Micro Slides-plain, VWR Scientific Products) 

Slide rack, stainless steel (Wheaton Science Products, Millville, New Jersey) 

Staining dish, glass rectangular (Wheaton Science Products) 

Vacuum desiccator 
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METHOD 


Fabrication of Aldehyde Slides 


Aldehyde slides are glass microscope slides that display an aldehyde functionality on their 
surface (see Figure 10.7). They may be purchased from commercial vendors such as 
TeleChem International, Inc. (Sunnyvale, California) under the trade name SuperAldehyde 
Substrates or fabricated more cost-effectively as described below. l 


1. Clean plain glass slides by immersing them in an aqueous solution of 10% NaOH (w/v) 
for 12 hours at room temperature. 


EXPERIMENTAL TIP: Up to 30 slides can be processed conveniently using a 30-slide stainless 
steel rack and glass rectangular staining dish. 


2. Rinse the slides in H,O, then in an aqueous solution of 1% HCl (v/v), and finally twice in 
HO. 


3. Dry the slides by spinning the racks containing the slides in a centrifuge equipped with a 
swinging bucket rotor and microplate carriers. Centrifuge at 200g for 30 seconds at room 
temperature, 


Drying by centrifugation prevents deposition of spots on the slides. 


4. Adjust a 95% ethanol/5% H.O solution to pH 4.5-5.5 with acetic acid (~0.1% acetic 
acid). 


5. Add 3-triethoxysilvibutanal while stirring to yield a 2% solution of silane (v/v). 


6. Stir for 3 minutes at room temperature to allow for hydrolysis and silanol formation and 
then immerse the slides in the solution. Incubate them for an additional 3 minutes at 
room temperature. 


7. Rinse the slides briefly in absolute ethanol and centrifuge them to dry. Cure them under 
vacuum for 10 minutes at ) 109C or for 24 hours at room temperature. 


B. Wash the slides three times for 5 minutes each with 95% ethanol/596 H,O and dry them 
by centrifugation. Store the slides at room temperature in a vacuum desiccator or under 
inert gas. 


BSA-NHS Slides 


BSA-NHS slides are glass microscope slides that contain a layer of bovine serum albumin 
(BSA) covalently attached to their surface. In addition. the BSA molecules are chemically acti- 
vated for covalent attachment of proteins to their surface (see Figure 10.8). BSA-NHS slides 
arc fabricated from aminopropyl-modified slides. These slides can be purchased from a num- 
ber of vendors (including Sigma-Aldrich, Corning, and TeleChem international) or fabricat- 
ed from plain glass slides. If purchasing aminopropyl-modified slides, skip to Step 5. 


1. Clean plain glass slides as described in the previous section in Steps 1~3. 


2. Stir a solution of 93% ethanol/5% H,O and add 3-aminopropyltriethoxysilane to yield a 
396 solution of silane (v/v). (Note: Do not add acetic acid). Stir for 10 minutes at room 
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temperature to allow for hydrolysis and silanol formation, and then immerse the slides in 
the solution. Incubate them at room temperature for an additional 10 minutes. 


. Rinse the slides briefly in absolute ethanol and centrifuge them to dry. Cure them under 


vacuum for 10 minutes at 110°C or for 24 hours at room temperature. 


. Wash the slides three times for 5 minutes each with 95% ethanol/5% H,O and dry them 


by centrifugation. Store the slides at room temperature (preferably with a desiccant). 


. Dissolve 10.24 g of N,N"-disuccinimidyl carbonate and 6.96 ml of N,N-diisopropylethy- 


lamine in 400 ml of anhydrous DMF. 


Final concentrations of N,N -disuccinimidyl carbonate and N,N-diisopropylethylamine are 
100 mm each. 


_ Immerse 30 aminopropyl-modified slides (from Step 4 or purchased commercially) in 


the disuccinimidyl carbonate/diisopropylethylamine solution for 3 hours at room tem- 
perature. 


The solution may turn yellow during the 3-hour incubation. 


. Rinse the slides twice with 95% ethanol/5% H,O. 


. Immerse the slides in 400 ml of PBS (pH 7.5) containing 1% BSA (w/v) for 12 hours at 


room temperature. 


- Rinse the slides twice with H,O, twice with 95% ethanol/5% H,O, and then dry them by 


centrifugation. 


. Dissolve 10.24 g of 100 mM N,N’-disuccinimidyl carbonate and 6.96 ml of 100 mm N,N- 


diisopropylethylamine in 400 ml of anhydrous DME Immerse the slides in this solution 
for 3 hours at room temperature. 


. Rinse the slides three times with 95% ethanol/5% H,O and dry by centrifugation. 


The slides are now ready for use or can be stored in a desiccator under vacuum for up to 3 
months at room temperature without noticeable loss of activity. 
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Protein Arrays: Labeling the Protein and Probing the Array 
for Protein-Protein interactions 


MATERIALS 


Tins PROTOCOL WAS CONTRIBUTED BY Gavin MacBeath {Harvard University). 

To detect protein-protein interactions, a protein microarray can be incubated with a flu- 
orescently labeled protein and the array washed; stable interactions can then be identified by 
scanning the slide for fluorescent spots (see Figure 10.9). The probe proteins can be labeled 
with any one of several different commercially available dyes. The method in this protocol uses 
cyanine-3 (Cy3) or cyanine-5 (Cy5) monofunctionally activated dyes. As with arraying, care 
must be taken to remove nucleophilic molecules from the protein solution prior to labeling. 
Common nucleophilic buffer components include Tris buffer, ethanolamine, glycine, dithio- 
threitol, B-mercaptoethanol, glutathione, and imidazole. 


» Reagents 


» Equipment 


Cy3 or Cy5 monofunctionally activated dyes (Amersham Biosciences) 
These dyes come lyophilized in screw-cap tubes with enough dye to label the equivalent of 1 mg of 
antibody (IgG). 

Phosphate-buffered saline (PBS) (pH 7.5) 

PBS containing 500 mM glycine (pH 8.0) 

PBST (PBS containing 0.196 v/v Tween-20) 


Probing buffer 
See note to Step 5. 


Affinity chromatography equipment 
See Step 4. 


Centrifuge 

Dialysis equipment 
See Step 4. 

Fluorescence slide scanner (e.g., Array WoRx fluorescence slide scanner, Applied Precision, 
Issaquah, Washington) 

Humidity chamber 


Incubation chambers (CoverWell, Grace Biolabs, Bend, Oregon) i 
Use these incubation chambers for processing only one array per slide (see Step 6). The CoverWell 
PC500 accommodates a volume of ~500 yl. 
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Size-exclusion chromatography equipment 
Sec Step 4. 


Slide rotor 
See Step 6. 


b Biological Molecules 
Proteins of interest to be fluorescently labeled 
Slides printed with protein arrays 


For details on printing protein arrrays, see the section of the chapter introduction on Fabricating 
Protein Arrays. For a summary of highlights, see the panel on GUIDELINES FOR PRFPARING PROTEIN 
ARRAYS below. 


GUIDELINES FOR PREPARING PROTEIN ARRAYS 

° Prepared proteins should be at a concentration of at least 100 pg/ml (20 jim for pep- 
tides), in a buffer with a pH of 27, and with 40% glycerol included in the buffer. 

« Unacceptable buffers include Tris and ethanolamine. 

e. Acceptable buffers include MES, PIPES, phosphate, MOPS, HEPES, and borate. 

e. Avoid the following common additives: dithiothreitol, B-mercaptoethanol, imidazole, 
glutathione, glycine, aprotinin, and leupeptin. 

e. Acceptable common additives are TCEP, inorganic ions (Na*, K*, Ca”, Mg’, Mn’, NHG, 
Cr, SO), acetate, sodium azide, EDTA, EGTA, PMSF, and AEBSF. l 

« For most purposes, use a solution of 40% glycerol/60% PBS (pH 7.5). 

« if relatively few proteins are being arrayed, print several arrays per slide (up to 16). While 
the slide is still dry (after printing but before quenching), draw a grid on the slides with: 
a hydrophobic pen {e.g PAP pen Liquid Blocker from Newcomer Supply, Middleton, 
Wisconsin) to physically separate the arrays from each other. 

« To avoid smearing the arrayed droplets during quenching, see the panel on ADDITIONAL 
PROTOCOL: PREVENTING THE FORMATION OP COMETS at the end of this protocol. 


METHOD 


Labeling the Protein 


1. Dissolve the dye in 250 yl of H,O, and add 5 ul of dye solution for every 10 pg of protein 
to be labeled. Aim to maintain a protein concentration of at least 100 ug/ml. 
Under these conditions, the extent of labeling is relatively independent of protein concentra- 
tion. Although the manufacturer recommends labeling at pH 9.3, we routinely label our pro- 
teins in PBS at pH 7.5. Low pH favors conjugation of the dye to the o-amine of the protein, 
whereas high pH results in relatively unselective conjugation to lysine residues. 


2. Incubate the dye and protein mixture for 30 minutes at room temperature to allow the 
dye to react with the protein molecules. 


3. Quench the reaction by adding 0.1x volume of PBS containing 500 mM glycine (pH 8.0) 
and incubate the tube for an additional 30 minutes at room temperature. 
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4. 


Probing the Array 


5. 


6. 


10. 


Remove unincorporated dye either by size exclusion chromatographv or by dialysis. 


If the protein contains an affinity tag (such as a His,-tag or GST-tag), the unincorporated dye 
is most conveniently removed by affinity chromatography using that tag. 


Prepare probing solution by combining probing buffer and fluorescently labeled protein. 


The composition of the probing buffer depends on the labeled protein. In general, the buffer 
should contain salt, detergent, and BSA to minimize nonspecific binding. In addition, the 
buffer may contain reductants, metal ions, or other cofactors as required. As a starting point, 
we recommend PBS (pH 7.5) containing 0.1% Tween-20 (v/v) and 1% BSA (w/v). The optimal 
concentration of labeled protein varies from experiment to experiment. Begin with ! ug/ml, 
which corresponds to 20 nM for a 50-kD protein. 


The volume of buffer to prepare depends on the experiment, Workable protein concentra- 
tions range from | nM ta 1 UM. 


Probe the protein array with the fluorescently labeled protein. 
Processing single-array slides: 

a. Lay the CoverWell slide incubation chamber on the bench with the gasket side up. 
b. Apply 550 ul of probing solution to the CoverWeil. 

c. Slowly lower the microarray slide, array side down, onto the Cover Well. 


d. Invert the slide and press the slide gently but firmly to seal the CoverWell to the slide. 


If done correctly, the solution should spread over the surface of the slide without trap- 
ping air bubbles. 


Processing multi-array slides: 


a. With the slides right-side up, apply 10-30 pl of labeled protein to each array. Do not 
cover the arrays with coverslips or a CoverWell. 


b. Place the slides in a humid chamber. 


. Incubate the slides for | hour at room temperature. 


_ Rinse the slides briefly with PBST, and wash them with PBST three times for 2—3 minutes 


each. 


. Rinse the slides briefly with PBS (without Tween-20), and centrifuge the slides 3n a slide 


rotor at 200g for 30 seconds to remove excess buffer. 


The slides can be dried under a stream of nitrogen gas if an appropriate rotor is not avail- 


able. 


Image the slides with a fluorescence slide scanner (see Reading and interpreting the 
Arrays. 
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ADDITIONAL PROTOCOL! PREVENTING THE FORMATION OF COMETS 


Following the attachment of proteins to activated slides, if the arrays are quenched either by pour- 
ing quenching solution over the slides or by plunging the slides into the quenching solution, the 
nanodroplets smear across the surface of the slide and the resulting spots look like comets. Smearing 
is particularly problematic with aldehyde slides because the reaction between the proteins and the 
slides is so fast. The following protocol {provided by Gavin MacBeath, Harvard University) is 
designed to prevent comet formation. 


Additional Materials 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Quenching solutions 
For aldehyde slides: PBS (pH 7.5) containing 1% BSA 
For BSA-NHS slides: PBS containing 500 mM glycine (pH 8.0) <!> 


Method 


1. Hold the slide upside down and carefully drop the slide onto the quenching solution from 
about a centimeter above the solution. The slide will float on the quenching solution due to sur- 
face tension. 


2. Wait ~10 seconds and then turn the slide right-side up and immerse it in the quenching solu- 
tion. 

3. Shake the slide in quenching solution for at least 1 hour at room temperature or for at least 4 
hours at 4C. 


4, Arrays can be stored for several weeks at 4°C by immersing them in PBS containing 1% BSA 
and 0.02% NaN, <!>. For longer-term storage, add 50% glycerol and store at -20°C, 


PROTOCOL 12 


Protein Arrays: Labeling the Compounds and Probing the 
Array for Protein-Small Molecule Interactions 


MATERIALS 


Tus PROTOCOL WAS CONTRIBUTED BY Gavin MacBeath (Harvard University). 

To detect protein interactions with small molecules, a protein microarray is incubated 
with a fluorescently labeled compound, the array is washed, and stable interactions are iden- 
tified by scanning the slide for fluorescence (see Figure 10.10). It is often adventageous to 
label the small molecules indirectly, by linking them covalently to a previously labeled carri- 
er molecule, such as BSA. As with the labeling of probe proteins (Protocol 11), BSA can be 
labeled with any one of several commercially available dyes. This protoco! describes the label- 
ing of BSA with Cy3 or Cy5, the linkage of the small molecule to the labeled BSA, and the 
probing of protein microarrays with the fluorescently labeled conjugate. 


» Reagents 


» Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <>, 


Cy3 and Cy5 monofunctionally activated dyes (Amersham Biosciences) 

Disuccinimidyl glutarate (DSG) 

N,N’-Disuccinimidyl carbonate 

N-[y-maleimidobutyryloxyl |sulfosuccinimide ester (Sulfo-GMBS, Pierce Biotechnology) 
Phosphate-buffered saline (PBS) (pH 7.5) 

PBS containing 500 mM glycine (pH 8.0) <:> 

PBST supplemented with 196 bovine serum albumin (BSA) (w/v) 


Fluorescence slide scanner (e.g., Array WoRx fluorescence slide scanner, Applied Precision, 
Issaquah, Washington) 

Gel-filtration equipment (e.g., NAP-10 column, Amersham Biosciences) 

Humidity chamber 
See 5tep 7. 


Incubation chambers (CoverWell, Grace Biolabs, Bend, Oregon) l uu l 
Use these incubation chambers for processing only one array per slide (see Step 7). The CoverWell 
PC500 accommodates a volume of ~500 p. 


779 


780 = CHAPTER 10 


b Biological Molecules 
Bovine serum albumin (BSA) to be fluorescently labeled 
Small molecules of interest 
Slides printed with protein arrays 
For details on printing protein arrrays, see the section of the chapter introduction on Fabricating 


Protein Arrays. For a summary of highlights, see the panel on GUIDELINES FOR PREPARING PROTEIN 
ARRAYS in Protocol 11. 


METHOD 


Labeling the BSA 


1. Dissolve 1 mg of BSA in 0.5 ml of PBS (pH 7.5). Add this solution to one tube of cyanine 
monofunctionally activated dye (either Cy3 or Cy5). 


2. Incubate the BSA and dye mixture for 30 minutes at room temperature. 


3. Quench the reaction by adding 100 ul of PBS containing 500 mm glycine (pH 8.0) and 
incubate for an additional 30 minutes at room temperature. 


4. Remove the unincorporated dye by gel filtration, using PBS (pH 7.5) as the elution buffer. 


Coupling Small Molecules to the BSA 


5. An appropriate strategy for coupling a small molecule to labeled BSA depends on which 
functional groups are available on the small molecule. 


It is beyond the scope of this protocol to provide detailed methods on how to deal with every 
possible situation, For an excellent resource for information and protocols relating to bio- 
conjugate chemistry, see Hermanson (1996). The following recommendations have been 
provided to serve as general guidelines. Specific details will vary depending on the nature of 
the molecules under investigation. 


If linking through a carboxylate on the small molecule: 


a. Activate the carboxylate as an N-hydroxysuccinimidyl ester using N, N"-disuccin- 
imidyl carbonate under basic conditions. 


b. Mix the activated compound with the labeled BSA in PBS (pH 7.5). Incubate for } 
hour at room temperature. 


The appropriate ratio of small molecule to BSA varies depending on the compound, but 
a good starting point is to use a 10:1 molar excess of small molecule. 


c. Separate the conjugate from the free compound by gel filtration. 
If linking through an amine group on the small molecule: 
a. Activate the small molecule with at least a tenfold molar excess of DSG. 
b. Separate the activated compound from the unreacted DSG and dimeric compound. 


c. Mix the activated compound with the labeled BSA in PBS (pH 7.5) for ! hour at room 
temperature. 


The appropriate ratio of small molecule to BSA varies depending on the compound, but 
a good starting point is to use a 10:1 molar excess of small molecule. 


d. Purify the conjugate by gel filtration. 
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If linking through a thiol group on the small molecule: 


a. Activate the labeled BSA by treating it with sulfo-GMBS. 


Dissolve sulfo-GMBS in PBS (pH 7.5) at a concentration of 10 mg/ml. Add 50 equiva- 
lents of sulfo-GMBS to an ~1 mg/m] solution of cyanine-labeled BSA in PBS (pH 7.5). 
For example, add 29 ul of a 10 mg/ml solution of sulfo-GMBS to | ml of a 1 mg/ml 
solution of cyaninc-labeled BSA. Incubate for ] hour at room temperature. 


b. Quench the reaction by adding a 1/10 volume of PBS containing 500 mM glycine (pH 
8.0) and incubate for 30 minutes at room temperature. 


c. Remove unincorporated reagent by gel filtration. 


d. Add 100-fold molar excess of the thiol-containing small molecule to the cyanine- 
BSA-maleimide conjugate in PBS (pH 7.5). Incubate the reaction for 1 hour at room 
temperature. 


e. Purify the final conjugate by gel filtration. 


Dilute the doubly labeled BSA conjugates with PBST supplemented with 1% (w/v) BSA. 


We recommend starting with a concentration of 10 pg/ml, but higher and lower concentra- 
tions (between 0.1 pg/ml and 100 pg/m!) should also be explored. 


Probe the protein array with the diluted doubly labeled BSA conjugates. 
Processing single-array slides: 

a. Lay the CoverWell on the bench with the gasket side up. 

b. Apply 550 p! of probing solution to the CoverWell. 

c. Slowly lower the microarray slide, array side down, onto the CoverWell. 


d. Invert the slide and press it gently but firmly to seal the CoverWell to the slide. 


If done correctly, the solution should spread over the surtace of the slide without trap- 
ping air bubbles. 


Processing multi-array slides: 


a. With the slides right-side up, apply 10-30 ul of labeled protein to each array. Do not 
cover the arrays with coverslips or a CoverWell. 


b. Place the slides in a humid chamber. 


. Incubate the slides for 1 hour at room temperature. 


. Rinse the slides with PBST, and then wash them with PBST three times for 2-3 minutes 


each. 


Rinse the slides with PBS (without Tween-20), and centrifuge the slides in a slide rotor at 
200g for 30 seconds to remove excess buffer. 


Image the slides with a fluorescence slide scanner ‘see Reading and Interpreting the Arrays). 


The slides can be dried under a stream of nitrogen gas if an appropriate rotor is not available. 
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Protein Arrays: Probing the Array and Detecting 
Radioactive Spots for Kinase-Substrate interactions 


MATERIALS 


» Reagents 


P» Equipment 


Tus PROTOCOL WAS CONTRIBUTED BY Gavin MacBeath (Harvard University). 

Protein arrays can be probed for potential substrates of protein kinases. To identify sub- 
strates of a kinase of interest, a protein microarray is incubated with that kinase and 
[y- ?P]ATP (see Figure 10.11). Following an appropriate incubation, the array is washed, 
coated with a photographic emulsion, and incubated further; the emulsion is developed, and 
then imaged with a scanning light microscope. 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


ATP 
Dektol developer (Eastman Kodak Company) <!> 
Desiccant 
[y-?PJATP (20 pCi) <!> 
Kinase buffer 

50 mM Tris-Cl (pH 7.5) 

10 mM MgCl, <!> 

1 mM dithiothreitol (DTT) <!> 
Kodak fixer, photographic (Eastman Kodak Company) <!> 
NTB-2 autoradiography emulsion (Eastman Kodak Company) 
Wash buffer 

20 mM Tris-Cl (pH 7.5) 

150 mm NaC] 

10 mm EDTA 

1 mM EGTA 

0.1% "riton X-100 <!> 


B-Radiation box 

Centrifuge 

Incubation chamber (PC200 CoverWell, Grace Biolabs, Bend, Oregon) 
Scanning light microscope 


P Biological Molecules 
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Appropriate kinase(s) 
Proteins of interest 
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Slides printed with protein arrays 


METHOD 


For details on printing protein arrrays, see the section of the chapter introduction on Fabricating 


Protein Arrays. For a sumimary of highlights, see the panel on GUIDELINES FOR PREPARING PROTEIN 
ARRAYS in Protocol 11. 
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. Wash the arrayed slides with wash buffer three times for 10 minutes each. 

. Wash the slides once with kinase buffer for 10 minutes. 

. Incubate the slides with kinase buffer containing 100 jj ATP for 10 minutes. 
. Wash the slides with kinase buffer for an additional 10 minutes. 


. Dilute each kinase into 200 pl of kinase buffer containing 100 uM ATP and 20 pCi 


[y-»P] ATP. 


. Apply the kinase solution to each array using a PC200 CoverWell incubation chamber 


(see Protocol 11, Step 6) and incubate the slides for 1 hour at room temperature. 


. Wash the slides with wash buffer six times for 5 minutes each. 


8. Wash the slides with wash buffer lacking Triton X-100 twice for 5 minutes each. 


10. 


. Wash the slides with H,O three times for 3 minutes each. 


Centrifuge the slides at 200g for 30 seconds to remove excess water. 


The slides can be dried under a stream of nitrogen if an appropriate rotor is not available. 


Detecting Radioactive Spots 


11. 


12. 


15. 


To visualize the radioactive decay, melt NTB-2 autoradiography emulsion for 45 minutes 
at 45°C in a dark room. 


Working in the dark, dip the slides in the emulsion for 3 seconds and allow them to dry 
vertically for 4 hours at room temperature. 


The use of a slide-dipping chamber simplifies this step. 


_ Seal the slides in a B-radiation box with a desiccant and incubate in the dark for 4--10 days 


at 4°C, 


Bring slides to room temperature before developing because the emulsion has a tendency to 
bubble and/or slide off if they are opened when they are cold. 


EXPERIMENTAL TIP: To avoid contaminating the slides with dust from the desiccant, place 
them in a permeable container or wrap it in several Kimwipes. 


_ To develop the slides, immerse them in: 


Dektol developer for 2 minutes 
H,O for 10 seconds 

fixer for 5 minutes 

H,O for 5 minutes 


Detect the radioactive spots with a scanning light microscope. 


PROTOCOL 14 
€——————————————————————————————————— ——— Pn A 
Protein Arrays: Sandwich Approach for Analyzing 

Complex Solutions 


lus PROTOCOL WAS CONTRIBUTED BY Gavin MacBeath (Harvard University). 

Protein arrays can be used to quantitate the abundance and modification states of pro- 
teins in complex mixtures. For this application, it is necessary to array capture reagents, such 
as antibodies, on the slides and then use these molecules to specifically capture their cognate 
antigens from solution. The investigator has a choice in either labeling the proteins that are 
under study (see Figure 10.12) or detecting the captured proteins by an indirect labeling 
method. In the latter case, which is detailed in this protocol, each protein is captured by one 
antibody (or other capture reagent) and then detected in a second step with a second reagent 
(see Figure 10.13). The second reagent, which in this case is an antibody, recognizes the pro- 
tein at a site that does not overlap with the recognition site of the capture reagent. In this 
sandwich approach, the second reagent is labeled and so provides the signal. The advantage 
of this approach is that it does not require labeling the proteins themselves. Moreover, addi- 
tional specificity is gained by using two different reagents for each protein, The disadvantage, 
however, is that is it more difficult to assemble matched pairs of antibodies for each protein 
of interest. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 
Lysis buffer 
20 mM Tris-Cl (pH 7.5) 
150 mM NaCl 
196 BSA (w/v) 
| mM EDTA 
] mM EGTA 
1% Triton X-100 <!> 
0.5% Nonidet P-40 
] mM fi-glycerolphosphate 
1 mM sodium orthovanadate (Na, VO,) <!> 
1 mM PMSF <!> 
| ug/ml leupeptin <!> 
1 pg/ml pepstatin <!> 
Phosphate-buffered saline (PBS) (pH 7.5) 
PBS containing 40% glycerol (v/v) 
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PBS containing 1% (w/v) BSA 

PBS containing 500 mM glycine (pH 8.0) <!> 

PBS containing 1% BSA {w/v} and 0.02% (w/v) NaN, «*» 
PBS containing 0.1% Tween (PBST) 

PBST containing 1% BSA 


b Equipment 


Affinity chromatography equipment with protein G or protein A beads 
BSA-NHS slides 


Prepare as in Protocol 10. 

Centrifuge 

Fluorescence slide scanner (e.g., ArrayWoRx fluorescence slide scanner, Applied Precision, 
Issaquah, Washington) 


Incubation chambers (CoverWell, Grace Biolabs, Bend, Oregon) 
Cover Well models accommodate the following volumes 
PC200: 200 Ll 
PC500: 550 jul 


Sonicator 


b Biological! Molecules 
Capture antibodies 


Detection cocktail 
This cocktail consists of each fluorescently labeled detection antibody present at a concentration of 
100—500 ng/ml in PBST containing 196 BSA. 


» Cells 


Mammalian cells 


METHOD 


Printing Antibody Arrays 


For additional details on printing protein arrays, see Fabricating Protein Arrays in the intro- 
duction to this chapter. For a surumary of highlights, see the panel on GUIDELINES FOR PREPAR- 
ING PROTEIN ARRAYS in Protocol it. 

1. Prepare the capture antibodies at a concentration of 0.5 mg/ml in PBS (pH 7.5) contain- 
ing 40% glycerol (v/v). The buffer must be free of nucleophilic components (such as 
Tris). 

Often, antibodies come formulated with BSA, which will compete for binding to the slide. BSA 
can be removed by affinity chromatography using, protein G or protein A beads. 


2. Allow the antibodies to bind to the slides by incubating the slides together for 2 hours at 
room temperature after arraying the antibodies. 


3. Quench the slides with PBS containing, 500 mM glycine (pH 8.0) for | hour at room tem- 
perature, 
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Block the slides for an additional 30-60 minutes with PBS containing 1% BSA (w/v) 
immediately before use. Alternatively, store the arrays at 4°C immersed in PBS supple- 
mented with 1% BSA (w/v) and 0.02% NaN, (w/v), 


Antibody arrays remain stable for 1-2 months under these conditions. 


Processing Antibody Arrays 


5. 
ó. 


Resuspend mammalian cells on ice at a concentration of 10° cells/ml in lysis buffer. 


Lyse the cells by sonication and centrifuge them at 15,000-20,000g for 10 minutes at 4°C 
to remove insoluble material. 


, Wash the antibody-coated slides (from Step 4) twice with PBS. 


. Apply lysate to the slides and incubate them in a humid chamber for 2-3 hours at room 


temperature. 


The CoverWell PC200 accommodates a volume of 200 ul of lysate and the PC500, 550 yl. 
Alternatively, 10-30 pl of lysate can be used if the arrays is small, but first draw a border 
around it with a hydrophobic pen. 


In general, simply cover the array with lysate. The exact volume is not important. 


. Wash the slides three times for 1 minute each with PBST and three times for L minute 


each with PBS. 


. Add detection cocktail to the slides, and incubate them for 1 hour at room temperature. 
_ Wash the slides twice for 1 minute each with PBST and twice for | minute each with PBS. 
. Centrifuge the slides at 200g for 30 seconds to remove excess buffer. 


. Image the slides with a fluorescence slide scanner (see Reading and Interpreting the 


Arrays in the introduction to this chapter). 
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http://binddb.org BIND—The Biomolecular Interaction Network database 

http://cbm.bio.uniroma2 it/mint/index.html Molecular INTeractive database 

http://curatools.curagen.com CuraTools for gene sequence and protein data mining and analysis. CuraGen 
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http://dip.doe-mbi.ucla.edu Database of Interacting Proteins 
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FIGURE 11.1, Flowchart illustrating the steps for finding a protein's structure, function, and interaction partners. 


A TYPICAL PROTEOMICS EXPERIMENT MIGHT BEGIN WITH CELLS grown under some specified set 
of conditions. A subset of the cellular proteins is purified (see Chapter 1) through subcellu- 
lar fractionation (see Chapter 3) or the use of affinity tags or protein interactions (see 
Chapter 10). These proteins are then identified or quantified by some combination of one- 
or two-dimensional gel electrophoresis (Chapters 2 and 4), high-performance liquid chro- 
matography (Chapter 5), amino acid sequencing (Chapter 6), proteolysis (Chapter 7), and/or 
mass spectrometry (Chapter 8). A diverse set of bioinformatics analyses is required both to 
perform these experimental steps and to interpret the resulting data. Beyond the initial step 
of identifying peptides from their sequences or mass spectra, which is discussed in Chapters 
6-8, a proteomics investigator would often like to perform analyses to 


e identify intact proteins from their constituent peptides, 
e find related proteins in multiple species, 

e find genes corresponding to the proteins, 

* find coexpressed genes or proteins, 


e find or test candidate protein interaction partners, 
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* validate or compare proteomics measurements of posttranslational modifications or pro- 
tein subcellular localization with computational predictions of the same properties, 


* predict structures or domains of proteins, and 


* find the functions of proteins discovered in proteomics experiments. 


This chapter provides advice and protocols for each of these bioinformatics tasks. More 
gencrally, given a protein identified from an experiment, this chapter addresses the essential 
problem of learning as much as possible, computationally, about the protein's function. The 
chapter is divided into three sections. 


« Section 1: Identification of a Gene and Its Homologs. Addressed in this section is the prob- 
lem of identifying the complete gene and its homologs from a peptide or short protein 
sequence. 


e. Section 2: Predicting Protein Structure and Functions, Methods are presented to deter- 
mine the domain and secondary structure of a protein, and where possible, the tertiary 
structure and biochemical function. 


e. Section 3: Matching Proteins to Pathways and Identifying the Cellular Roles of Proteins. 
Methods to discover a gene’s cellular role are introduced, including protocols for assigning 
genes to pathways and for identifying potential interaction partners. 


In each section, protocols and freely available resources, such as applicable Web servers, 
are presented as an aid to the analyses. The approaches we suggest are diagrammed in Figure 
11.1. It is worth noting that the methods described in this chapter can be applied to predic- 
tion and discovery as well as to validation. For example, given experimentally identified pro- 
tein interactions, investigators have a choice: They can extend the experimental interactions 
by the addition of computationally identified interactions, as the combined interaction sets 
are likely to be far more informative, or they can test the experimental interactions by com- 
parison with the computational predictions. Often, computational data can be incorparated 
intuitively into proteomics experiments, such as by explicitly verifying that experimentally 
observed membrane proteins are computationally predicted to contain transmembrane seg- 
ments. The wise investigator will incorporate computational data wherever possible, but set 
certain classes of bioinformatics analyses aside to provide independent validation of the 
results. For additional resources on bioinformatics "especially for the novice,’ see the panel 
On ADDITIONAL RESOURCES at the end of this chapter. 
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SECTION 1: Identification of a Gene and Its Homologs 


Procrams SUCH AS SEQUEST, DLIAILED ELSEWHERE EN THIS VOLUME (Dongre et al. 1997 and 
Chapter 8), predict a peptide's sequence from a mass spectrum trace. Although these pro- 
grams often successfully search sequence databases and attach a peptide sequence to a gene, 
manual searching can often be revealing, as it is able to take into account the individual inves- 
tigator's experience with a protein and with the experiments in which it was discovered. In 
addition, it is often of value to know the near and remote homologs of a protein of interest 
when predicting the function. This section provides suggestions for getting the most infor- 
mation from sequence database searches. 


SEQUENCE DATABASE SEARCHING WITH GAPPED-BLAST AND PSI-BLAST 


Background and Methodology 


Many biologists conduct searches of sequence databases routinely using one of three popu- 
lar programs: FASTA, Gapped-Basic Local Alignment Search ‘fool (Gapped-BLAST), and 
Position-Specific Iterated-BLAST (PSI-BLAST). FASTA is generally considered mare sensi- 
tive for DNA-DNA comparisons. However, BLAST is usually faster. Needleman-Wunsch 
(NW) and Smith-Waterman (SW) are two other programs commonly used for sequence 
comparison. Although NW and SW are more sensitive than either Gapped-BLAST or FASTA, 
they are not generally used for searching sequence databases because of their prohibitive time 
requirements, This section focuses on the Gapped-BLAST and PSI-BLAST family of pro- 
grams. Specific programs in this family are described in Table 11.1. For detailed descriptions 
of the mechanics of these programs, see Altschul et al. (1990, 1997). In addition, two recent 
reviews (Altschul and Koonin 1998; Jones and Swindells 2002) provide helpful tips for effec- 
tively running these programs. 

The Gapped-BLAST suite compares sequences using a substitution matrix, such as 
BLOSUMSJ2, to mostly identify obvious homologs of a query protein. In contrast, PSI- 
BLAST identifies remote homologs of a query protein by sitting on top of Gapped- BLAST 
and defining probable substitutions at each position of a query sequence from the results of 
previous searches. PSI-BLAST is a complex, computation-intensive program and should 
only be used for finding remote homologs! 


TABLE 11.1. The five programs of the Gapped- BLAST family 


Program Query Database Comparison method 

BLASTP protein protein direct comparison against database 

BLASTN nucleotide nucleotide direct comparison against database 

BLASTX nucleotide protein translates query in all six reading frames and 


then performs comparison against database 
TBLASTN protein nucleotide translates database in all six reading frames 
and then performs comparison of query 
TBLASTX nucleotide nucleotide translates both the query and the database in 
all six reading frames into proteins and then 
compares the generated proteins 


PROTOCOL 1 


Simple Sequence Search with a Long Peptide Using 
Gapped-BLAST 


Two PURPOSES GF THiS PROTOCOL ARE: 


o Identification of the gene from which a peptide fragment is derived. Knowing the gene 
sequence from which a peptide is derived vastly increases the potential to learn about the 
structure, function, and interactions of the parent protein, its homologs, and its interaction 
partners. 


» Identification of genes that code for homologs to the peptide (or its parent protein). 
Knowing a protein’s homologs provides insight into structurally and functionally impor- 
tant residues in the protein and enables better identification of remete homologs, sec- 
ondary structure, tertiary structure, etc. 


It is also important to consider from which organism the query sequence was derived. 
When performing a sequence search, two simple questions must first be answered: What is 
the appropriate query sequence? What is the appropriate sequence database to search? The 
possible answers to the first question are discussed in Section 3. For the moment, we assume 
that the entire known sequence is the query. 

Some of the very large number of sequence databases are listed in Table 11.2. A few com- 
mon databases are described to illustrate the importance of thinking about which database 
to search, rather than simply choosing the largest. 

One popular database, GenPept, is gigantic and nonspecific, containing translated cod- 
ing regions of al! publicly available DNA sequences in addition to proteins that have been 
specifically deposited. Other databases are smaller and specific to a single genome or a single 
protein family, For example, the igBLAST database contains exclusively immunoglobulin 
proteins. The proteins within these specialty databases are not always contained in GenPept. 
In addition, sequencing centers will often provide users with the ability to search incom- 
pletely sequenced genomes. It is generally best to search smaller, more specific databases 
whenever possible. The searches are less com putionally expensive and are often likely to yield 
useful results. For example, if an investigator were trying to determine which F. coli protein 
a peptide is derived from, it would make the most sense to initially search an F. coli database. 
The NCBI site has a good set of tutorials describing the adjustable input parameters for 
BLAST. Readers are encouraged to visit: 

http://www.ncbi.nlm.nih.gov/Education/ BLASTinfo/information3.html 


INPUTS 


Query sequence (sample): YLCKTTRNDAEVLLEKPTVRÜGHFLVROCESSPGEFS TSVRFQDSVOHEKVLRDONCKY Y 
Database (sample): C. elegans 
Starting Web Address: http//www.ncbi.nlm.nih.gov/BLAST/ 
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Additional Web Sites Web Sites for sequence 

for BLAST Searches . searches with small peptides 
http://blast.wustl.edu/ http://prowl.rockefeller.edu/cgi-bin/ProFound 
http;//genome.wustl.edu/Blast/client.pl http://www.expasy.ch/tools/ 
http//www.sanger.ac.uk/DataSearch http://pir.georgetown.edu/ 


TABLE 11.2. Databases available on BLAST Web server 


Database Description 


A. Peptide sequence databases 


nr translations of GenBank DNA sequences 
with redundancies removed, P DB, SwissProt, PIR, and PRE 

month new or revised entries or updates to nr in the previous 30 days 

swissprot latest release of the SwissProt protein sequence database" 

Drosophila genome provided by Celera and Berkeley Drosophila genome project 

yeast veast (Saccharomyces cerevisiae) genomic sequences 

E. coli E. coli genomic sequences 

pdb sequences of proteins of known three-dimensional structure 
fram the Brookhaven Protein Data Bank 

yeast yeast (5, cerevisiae) protein sequences 

E. coli E. coli genomic coding sequence translations 

pdb sequences of proteins of known three-dimensional structure 
from the Brookhaven Protein Data Bank 

kabat [kabatpro] Kabat's database of sequences of immunological interest 

alu translations of select Aft repeats from REPBASE, 


a database of sequence repeats 


B. Nucleotide sequence databases 


nt GenBank, EMBL, DDBJ, and PDB sequences with redundancies removed 
(EST, STS, GSS, and HTGS sequences excluded} 

month new or revised entries or updates to nr in the previous 30 days 

dbest” EST sequences from GenBank, EMBL, and DOBI with redundancies 
removed 

dhsts^ STS sequences from Gen Bank, EMBI., and DDBJ with redundancies 
removed 

higs? high-throughput genomic sequences 

kabat [kabatnuc] Kabat’s database of sequences of immunological interest 

vector vector subset of GenBank 

mita database of mitochondrial sequences 

alu select Alu repeats from REPBASE, a database of sequence repeats; 
suitable for masking Alu repeats from query sequences 

epd eukaryotic promoter database 

gss" genome survey sequences, includes single-pass genomic data, 


exon-trapped sequences, and Alu PCR sequences 


Reprinted, with permission, from Mount (2001). 

' The SwissProt database is carefully curated but not always up to date because updates are released after longer 
intervals. SwissProt and PIR are the preferred protein databases for searches because the nr protem database is a 
composite of several databases and has duplicates of many sequences. Unfortunately, PIR i not provided as a sep- 
arate choice on the database menu. 

b Databases containing sequences that may have been less accurately determined. 
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METHOD 


inputting the Peptide Sequence 


1. Go to http///www.ncbi.nlm.nih.gov/BLASTY. 


2. In the section denoted Protein BLAST, click Standard protein-protein BLAST 
[blastp]. 


3, Enter the query sequence into the Search box. 


4. Inthe select box denoted Limit by entrez query, choose Caenorhabditis elegans. 


Choosing C. elegans at this stage specifies the precise database to search. Neglecting this step 
results in searching GenPept. 


5. At the bottom of the page, click the button marked BLAST! 
6. Wait a few moments and then click the button marked Format! 


7. The results page will appear. A description of how to interpret the results (the output) is 
given next. 


Interpreting BLAST Output 


BLAST commonly returns output in several sections, some of which are not always returned 
(depending on whether the output is to be e-mailed or displayed). The first section of output 
describes the query just performed, listing the description of the query sequence (if one was 
provided), and also the database searched against. Regardless of any restrictions placed on the 
search database, BLAST will often list the database as being the complete database available. 
For example: 


Query: = gi]2501594]sp]057997|¥577_METJA PROTEIN MJ0577 (162 letters) 
Database: Non-redundant GenBank CDS translations+PDB+ SwissProt+SPupdate+PIR 
437,713 sequences; 134,605,311 total ietters 


The second section (optionally returned) provides a graphical image map overview of 
the score of database matches and where they map (align) onto the query sequence. Fach line 
is clickable. Clicking a given line hyperlinks to the alignment between the query protein 
sequence and the protein identified in the search. A glance at the graphical overview reveals 
several high-scoring sequences that are highly related to the query. The top bar (pink) is a 
match to the protein from which the fragment was drawn. The next several full-length bars 
(blue) are probable distant homologs. Partial length bars reflect similarity between a portion 
of the query and a database entry. These regions of similarity may be a shared domain or 
motif. The nature of these lower-scoring hits can be further explored by examining the cor- 
responding alignment. 

The third section of output provides descriptions of each significant alignment. The 
score and E-value or P-value are listed at the end of each line. 
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WHAT Is tHE BLAST SCORE? 


P-values and E-values are different ways of representing the significance of the raw score 
of a sequence alignment. The raw score of an alignment is the sum of substitution and gap 
scores, which measure imperfect amino acid substitutions and the size of insertions/dele- 
tions, respectively, in the sequence alignment. Substitution scores S are given by a look-up 
table (see PAM, BLOSUM). Gap scores are typically calculated as the sum of a gap open- 
ing penalty G and a gap extension penalty L. For a gap of length n, the gap cost would be 
Gt Ln. The choice of gap penalties, G and L, is empirical, but it is customary to choose a 
high value for G (10—15) and a low value for L (1-2). 

The E-value, or expectation value, is the number of different alignments with scores 
equal to or better than 5 that are expected to occur by chance in a database search of this 
particular database. The lower the E-value, the more significant the score. It is important 
to note that the E-value depends on database size. For example, performing the same data- 
base search of the GenPept database over time, even when an identical set of proteins is 
returned, will result in progressively less significant E-values as the database grows in size. 

For comprehensive databases (such as the NCBI GenPept database, with 1,000,000 
sequences), a match with an E-value «0.000001 can be trusted as a true homolog. 
However, a search through all proteins of known structure (e.g., Protein Data Bank, see 
Table 11.1) with E «0.001 is akin to searching the NCBI NRDB with E approximately «0.1 
because the NRDB is nearly 100 times larger. As such, the two results are not comparable 
unless one corrects for database size. Often, additional remote homologs can be found 
with higher (less significant) E-values. Manual inspection of the quality of the alignment 
and of the annotation often enables the user to pick these true homologs from the noise. 

Some BLAST implementations return a P-value instead of an E-value. The P-value 
expresses the probability of an alignment occurring with a given raw score or better: This 
probability score represents the intrinsic quality of the alignment and does not take into 
account the size of the database searched. The most highly significant P-values will be 
those close to 0. 


The description. (also called definition) lines are listed below under the heading 
“Sequences producing significant alignments.” Here, the term “significant” simply refers to all 
those hits whose E-value was less than some threshold and should not be interpreted as “bio- 
logical significance.” 


Score E 
Sequences producing significant alignments: (bits) "Value 


gi|17569445|ref|NP 509342.1] (NM 076941) C. elegans set-5 [... 127 1le-30 
gi|17508235 jref |NP_493502.1| (NM 061101) Sre homology domai... 49 66-07 


The final section provides the alignments themselves and full descriptions of the 
sequences as shown below, Pay attention to protein names found in the descriptions. Also 
note items of the form "sp|P29355 | SEM5 CAEEL.” These are identifiers referencing the 
protein in other databases. The code sp indicates that the entry isn the SwissProt database. 
The codes 229355 and SEM5 CAEEL are ‘dentifiers used to reference that entry within the 
SwissProt database. 


u MAKING SENSE OF PROTEOMICS = 803 


»gi|i7569445|ref|NP 509342.1| (NM 076941) cC. 
elegans] 7 
gi|134425|sp|P29355|SEMS CAEEL Sex muscle abnormal protein 5 
gi|283556|pir| [825730 SH2-SH3 protein sem-5 ~ Caenorhabditis elegans 
gi|247605|gb|AAB21850. 11 (888446) cell-signalling [Caenorhabditis elegans] 
gi|861389|gb|AAA68405.1| (U29082) Hypothetical protein CiAF5.5 [Caenorhabditis 
elegans] 
gi|228675|prf||1808298A sem-5 gene [Caenorhabditis elegans] 

Length - 228 


elegans sem-5 [Caenorhabditis 


Score = 127 bits (319), Expect = le-30 
Identities = 60/50 (1003), Positives = 60/60 (100%) 


Query: 1 YLGKITRNDAEVLLKKPTVRDGHFLVROCESSPGEFSISVRFOQDSVOHFKVLRDONGKYY 60 
YLGKITRNDAEVLLKKPTVRDGHFLVROCESSPGEFSISVRPODSVOHFKVLRDONGKY Y 
Sbjct: 61  YLGKITRNDAEVLLKKPTVRDGHELVROCESSPGEFSISVRFQDSVOHEKVLRDQNGKYY 120 


»gili7508235|ref|NP 493502.1]| (NM 051101) Sre homology domain 2, Src homology 
domain 3, protein 


tyrosine kinase (src subfamily) [Caenorhabditis elegans] 


gi|7160701|emb|CAB04427.2| (281543) contains similarity to Pfam domain: 
PF00017 (Src homology 
domain 2), Score-115.5, E-value-2.66-39, N=1; PF00018 
(SH3 domain), Score-81.5, E-value=5.5e-21, WN-1; PF00069 
(Eukaryotic protein kinase domain), Score-276.2, 
E-value-1.4e-79, N=1-cDNA > 
Length = 507 


Score = 48.5 bits (i14), Expect = 6e-07 
Identities = 25/60 (41%), Positives = 35/60 (57%) 


Query: 1 YLCKITRNDAEVLLEKPTVRDGHFLVROCESSPGEF'S ISVRFQDSVQHFRVLRDQNGRYY 60 
Y GK- R DAE L G FLVR ES + StSVR DSV-Hé-t + +6 Yt 
Sbjct: 125 YFGKMRRIDAEKCLLHTLNEHGAFLVRDSESRQHDLSLSVRENDSVKHYRIRQLDHGGYE 184 


A high-quality alignment generally contains a small number of gaps and a large number 
of identical or similar residues. Identical residues are reported along the center line of the 
alignment. Similar residue» are denoted by a * in the center line of the alignment. By look- 
ing at the first alignment, it is observed that this protein matches identically with the Sem-5 
protein from C. elegans, which is actually the gene that the segment was drawn from initial- 
ly. We can now use the various reference codes (17569445, NP 509342.1, 134425, 
P29355,SEM5 CAEEL...) to find the full-length protein, and also to learn something 
about the protein as discussed below. Be cautioned at this point not to draw too many con- 
clusions from the GenPept annotation of the protein. Although the annotation may have 
been drawn from experiment, it was likely drawn instead from homology searches, and hence 
may not be strictly reliable. In addition, this search should he repeated with the full-length 
protein in order to collect a diverse collection of homologs to better understand the structure 
and function of the protein of interest. 
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What to do with mediocre scores? 


BLAST sometimes is unable to locate an identical or even a significant match to a protein seg- 
ment. This is especially true for organisms that have not been completely sequenced. First try 
a different database. If after searching all of GenPept, TREMBL, and PIR a quality match has 
not been discovered, it may be of value to use TBLASTN to search Genbank, or EST and 
unfinished genome sequence databases. If the exact protein cannot be found, working with a 
dose homolog might still be informative. However, if a homologous gene from a different 
organism (c.g., Kangaroo, when the query protein is from Human) is chosen, it is best to pro- 
ceed with caution, as even close homologs often vary in function and may contain quite dis- 
tinct domains (e.g., sce Section 3 of this chapter). It is also a good idea to periodically repeat 
searches of the organism-specific expressed sequence tag (EST) databases and organism-spe- 
cific databases available at the sequencing centers. 


PSI-BLAST and remote homology searches 


A multiple alignment of a sequence family provides much information about which residues 
are highly conserved and which regions of the sequence can accommodate insertions and 
deletions. Various methods exist that take into account this position-specific information. 
The simplest methods use consensus sequences or consensus sequence patterns. More 
sophisticated methods use profiles of the alignment, which consist of scores for allowable 
amino acid residues, insertions, and deletions specifically calculated for each position in the 
alignment. Searching with a profile often identifies remote homologs not found with a single 
sequence and scoring matrix. PSI-BLAST uses results of an initial BLAST search to generate 
a profile with which to perform additional searches of the same database. At each step, PSI- 
BLAST *improves" the profile, taking into account the new set of sequences discovered. When 
using PSI- BLAST, it is therefore important to verify that the sequences included at each stage 
are actually members of a homologous family. A few falsely identified proteins can badly skew 
the statistics, dooming further iterations. 

PSI-BLAST performs best on large databases, such as SwissProt, PIR, or GenPept. 
Running the first stage of PSI-BLAST is identical to running Gapped-BLAST. The output of 
PSI-Bi AST, however, presents the user with the option of selecting extra sequences for inclu- 
sion in the profile (sequences believed to be part of the family that were initially below 
threshold) and running extra iterations, Each cycle identifies more sequences, unless PSI- 
BLAST converges on a set of proteins and cannot find any more potentially homologous 
sequences within the database. lf, at a given iteration, the newly identified sequences diverge 
greatly from the original query protein, it is likely that a rogue sequence, not truly homolo- 
gous to the query, has been accidentally included and is biasing the search. 


The Importance of Orthologs 


For many functional analyses, the user often wants to identify orthologous proteins, rather 
than paralogous or simply homologous proteins. Typically, genes with a common evolutionary 
ancestor are denoted homologs. Orthologs are homologous genes generated by a speciation 
event, whereas paralogs are homologous genes separated by a duplication event (Fitch 1970). 
The similar sequences identified as statistically significant by algorithins such as BLAST and 
FastA are typically homologous. Consequently, the term homologous is often used to 
describe the sequences identified from a BLAST search, although the presence of a shared 
evolutionary ancestor is not strictly proven. Use of the term “homology” varies by discipline. 
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In certain biological communities (e.g., biochemistry), the term is used loosely. However, in 
others (e.g., evolutionary biology), it is used more strictly and cautiously. 

Regardless of semantics, for complete transfer of functional annotation from one 
sequence to another, one typically requires that the sequences be orthologs. However, it is 
clear that the hits from BLAST sequence searches are often not orthologous. For example, 
searching a yeast sequence database with the bacterial MutS DNA repair protein will return 
six classes of homologous DNA repair proteins, all part of the MSH protein family (for Mut 
Homologs). However, the functions of these proteins range from DNA mismatch repair, as 
for MSH2 and MSH6 (Johnson et al. 1996), to reciprocal recombination, as for MSH5 
(Hollingsworth et al. 1995). To specifically assign the function of a new DNA repair protein 
would require knowing to which of these subfamilies the protein belongs. 

Assigning a protein to one family out of several paralogous families is best done by cal- 
culating a phylogenetic tree for all of the proteins, and then examining which proteins are 
closest to the query protein in the tree. For an illustration of assigning functions from phy- 
logenctic trees (using the MSH proteins as an example), sec Eisen (1998). Programs for cal- 
culating phylogenetic trees are widely available, and several such programs are listed in the 
panel below. For good reviews on phylogenetic analysis, see Thornton and DeSalic (2000) 
and Mount (2001). 


ClustalX (Thompson et al. 1997) 
download at ftp://ftp-igbmc.u-strasbg. fr/pub/ClustalX/ 
available on the Internet at http://www.ebi.ac.uk/clustalw/ 


Tree View (Rod Page) 
download at http://taxonomy.zoology.gla.ac.uk/rod/treeview.html 


Phylip (Joe Felsenstein) 
http://evolution.genetics.washington.edu/phylip.html 
available on the Internet (courtesy of Allison Lim and Louxin Zhang) at 
http://sámc.krdl.org.sg:8080/^Ixzhang/phylip/ 


Paup 
http://paup.csit-fsu.edu/ 


In addition to phylogenetic trees, several rapid methods exist for finding possible orthol- 
ogous relationships between proteins. One such method is to simply find the protein in the 
COG database (short for “Clusters of Orthologous Groups" [Tatusov et al. 2001 IE 
http://www.ncbi.nlm.nih.gov/COG/). The curators of COG have organized the known pro- 
teins from a number of different genomes into orthologous groups. Within each COG “cat- 
egory proteins tend to have very closely related function. In fact, they are not strictly 
orthologs (¢.g., the E. coli paralogs gyrA and parC are in the same COG category), but they 
do tend to have very similar sequences and very similar functions. Therefore, if a protein of 
interest is in the same COG category as a protein of known function, itis reasonable to expect 
the protein of interest to have a closely related function. 

Protocol 2 details a rapid, easily automated method for identifying orthologs based on 


sequence similarity searches. 


PROTOCOL 2 


Rapid Identification of Orthologs by the Method of 
Bidirectional Best Hits 


Gros A GENF (GENE A FROM ORGANISM 1), à rapid sequence-comparison-based method can 
be used to identify an orthologous gene (gene B from organism 2), when both genomes have 
been completely sequenced. This is an approximate method, not absolutely guaranteed to 
find orthologs, but often effective. The method, which is called the method of "symmetric 
best hits” or “bidirectional best hits” (Overbeek et al. 1999), involves searching genome 1 with 
gene B. The best-matching sequence in genome | (corresponding to gene A, if A and B are 
orthologs) would then be used as the query in the reverse sequence search against genome 2. 
If the best-matching sequence in this search is gene B, then genes A and B are bidirectional 
best hits and are operationally defined as orthologs. This protocol is best automated if many 
sequences are to be examined, but the following protocol will serve well for only a few cases. 


1 


INPUTS 


As an illustration of this protocol’s utility, we will find the £. coli ortholog of a D. radio- 
durars gene. 

Organism 1l: E. coli 

Organism 2: Deinococcus radiodurans 


Gene B: aspartokinase | gene (Genbank identifier 6459119), from D. radiodurans 


METHOD 


1. Find the genome of organism 1 (in this case, E. coli) in the Entrez genome database. 


Many of the “prominent” organisms are available at hitp://www.ocbinlm.nih.gow/ PMGits/ 
Genomes/org.htmi. In this example, click on the name Escherichia coli K12, which brings up 
the Web page dedicated to the E. coli Kt2 genome (http://www.ncbi.nIm.nih.gov/cgi-bin/ 
Entrez/framik?db=G&gi=! 15). 


2. Click to BLAST against protein sequences. Click to new protein sequence. Paste in the 
amino acid sequence of protein B (the D. radiodurans protein 6459119) and press go. 


3. The best match from this BLAST search, protein A (in this case, the protein with the 
Genbank identifier 1786183), has the potential of being an ortholog of protein B. To cval- 
uate protein A by performing the reverse sequence search, begin by clicking on the name 
of the protein A sequence, which will hyperlink to its sequence entry. Copy the protein 
sequence from the bottom of the entry. 
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4. Find the genome of organism 2 (in this case, D. radiodurans) in the Entrez genome data- 
base at http://www.ncbi.nlm.nih.gov/PMGifs/Genomes/org.html. 


5. Click to BLAST against protein sequences. Click to new protein sequence. Paste in the 
copied sequence protein of protein A (e.g., 1786183) and press go. 


6. If the best-matching sequence from this search is protein B—in this case, 64591 19—then 
the two proteins (e.g., 1786183 and 6459119) are bidirectional best hits. In the event that 
the function of the D. radiodurans protein was unknown, a good guess as to its function 
would be that it has a similar function as its annotated ortholog, the F. coli aspartokinase I. 
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SECTION 2: Predicting Protein Structure and Function 


I. THE PREVIOUS SECTION, A GENE OR A SET OF GENES was identified that corresponds to a pep- 
tide of interest, allowing many questions to be answered about the function of the gene prod- 
uct. A set of programs and the biological questions they address are shown in Table 11.3. The 
types of biochemical information that these tools allow investigators to glean about a protein 
from its sequence are described schematically in Figure 11.2. To facilitate the use of these pro- 
grams, two "meta" servers have been set up, Gerard Klegyt's ProSal (http://xray.bnic.uu.se/ 
sbnet) and Burkhardt Rost's META-PP (META Predict Protein Server") (hitp://maple.bioc. 
columbia.edu/pp/submit, meta.html). Programs accessed by META-PP and ProSal are indi- 
cated by checkmarks in the last two columns under the headings MS and PS, respectively. 
Other relevant programs of interest are listed in Table 11.3 as well. 


Protein Domain Structure Prediction 


Before rigorously examining the function of a protein of interest, it is important to assess its 
domain structure. Although there is no single accepted definition, domains are generally 
acknowledged to be stable, compact, semi-independent, autonomously folding regions with- 
in a protein, Although some proteins consist of a single domain, others consist of several 
domains. Many protein domains are formed from a single continuous segment of a protein 
chain, but they can also be composed of discontinuous segments. The size of individual 
domains varies widely. For example, both F-selectin and lipoxygenase- | are two-domain pro- 
teins: E-selectin is 36 residues long, whereas lipoxygenase-1 is 692 residues long. However, the 
distribution of domain sizes Js narrow; 3095 of domains are composed of «200 residues. 
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FIGURE 11.2. Protein sequence motifs and domain structure. 
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TABLE 11.3. Web servers for identitying a variety of protein features 


Program URL MS PS 

Identifying the secondary structure of à protein 

Jpred http://jura.ebLac.uk:8888/ P4 wf 

PSI-pred http;//insulm.brunel.ac.uk/psiform.htmi "i 

Phd http:/cubicbioc.columbia.eduipp A "4 

Sspro htip.//promoter.ics.uci.edu/BRNN-PRED/ v 

Prof hitp://www.aber.ac.uk/~phiwww/prof/index html "4 

Identifying the tertiary structure of a protein 

SDSCI http://cl.sdsc.edu/hm.html v 

CPHmodels http /www.cbs.dtu.dk/services/CPHmodels; "4 "4 

SWISS-MODE!. http://www.expasy.ch/swissmod/SWI1SS-MODEL.htm! e v 

ModBase http://pipe.rockefeller.cdu/modbase-cgi/index.cgi 

Loopp http://ser-loopp.tc.corne!edu/loopp.htmi "4 

Superfamily http://stash.mrc-lmb,cam.ac.uk/SL PERFAMILY/ n4 

Sausage http://rsc.anu.edu.au/-drsnag/lheSausageMachinc.html P4 

Meta Server http://bioinfo.pl/Meta/ "4 

UCLA-DOE FRSVR — http://fold.doe-mbi.ucla.edu/ 

Identifying transmembrane segments/topology 

DAS http://www.sbe.su.se/~miklos/ DAS/ v 

TMHMM http//www.cbs.diu.dk/services MBMM- 1.0/ e v 

"TopPred2 hetp://www.sbe.su.se/~erikw/toppred2/ F4 

MOMENT http://www.doc-mb:.ucla.edu/Services/momenté e 

Identifying protein domains 

Pfam http://plam_.wustledu/hmmysearch shtml v 7 

Interpro http://www.ebi.ac.uk/ioterpro/scan-html "i 

Prodom http://protein.toulouse.inra.fr/prodom/blast_form.html £ 

BLOCKS http://blocks.fherc.org/blocks/blocks search.htn'! 

SMART http://smart.embl-heidelberg.de/ "d 

DomFISH http//www.bmm.icnet.uk/^ 3djigsaw/dom. fish/ 

Identifying documented sequence motifs and patterns 

Prosite http://www.expasy.ch/tools/scnpsiti hum! a 

Emotif http://elucidate.stanford.edu/emotif/emouf-search.btml P4 

GeneQuiz hitp://jura.ebLac.uk.8765/gqsrv/submut x 

PRINTS http://bioinf.man.ac.uk/cgi-bin/dbbrowser/fingerPRIN TScan/ 
muppet/FPScan.cgi 

Identifying subcellular localization 

SignalP http://www.cbs.dtu.dk/services/ SignalP/ 

Psortil hip://psort.nibb.ac.ip/form2.html 

ChloroP http; ^www.chs.dtu.dk/services/ChloroP Pa 

Identifying posttranslational modification or cleavage 

Prosite http://www.expasy.ch/toolsisenpsitl .htmi "d 

NetPhos bitp://www.cbs.dtu.dk/services/NetPhos/ r 

NetPica http://www.cbs.dtu.dk/services/NetPicoRNA/ n "4 


MS is META-PP and P5 is ProSal. 
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Knowing domain boundaries is not simply of academic concern, but it is of great prac- 
tical importance to the functional and structural characterization of a protein. For example, 
nuclear magnetic resonance (NMR) studies are sometimes not fcasible on a full-length pro- 
tein, but they are tractable on each of the single domains. Furthermore, conformational flex- 
ibility from flexible linker regions between domains often impedes protein crystallization, A 
priori knowledge of domain boundaries allows constructs more suitable for experimental 
structure elucidation to be built. Experimental methods such as phage display and yeast two- 
hybrid often also benefit from knowledge of domain structure. 

The importance of domain structure is not limited to experiments; computational 
annotation and prediction benefit from knowing domain structure as well. For example, 
subsequences corresponding to individual domains are usually better query sequences than 
whole-protein sequences in similarity searches because individual domains likely corre- 
spond to recurring functional and evolutionary units. Consequently, manvfold recognition 
and comparative modeling methods perform best when applied to the sequences of single 
domains. Each domain in a multidomain protein often catalyzes a distinct biochemical reac- 
tion. Consequently, it is important when inferring function to assign the function to the cor- 
rect domain. Identifying domains from protein sequences is an active field of study. The 
majority of available methods rely on either comparative sequence searches, tertiary struc- 
ture/contact prediction, general properties of domains (e.g., sequence length), or a combi- 
nation of these methods to infer domain boundaries. Methods relying on sequence search- 
es have been successful at identifying contiguous domains when sequence similarity is read- 
ily apparent. However, at low sequence similarity, evolutionary relationships are difficult to 
discern. Furthermore, if the domain connectivity is conserved within a protein family, 
domain positions cannot be inferred. Methods relying strictly on general properties of 
domains are limited in their training by the incompleteness of protein structure databases, 
which due to experimental complexity, do not contain many proteins with greater than three 
domains. Methods based on structure/contact prediction are limited by the ability to predict 
structures with relative accuracy. As no single method is dominant, several servers, each 
implementing a different approach, are described. In addition, a number of methods pro- 
duce "sequence domains" that only approximate structural domains, but are still quite use- 
ful when performing sequence searches and multiple sequence alignments of protein fami- 
lies. 

As mentioned above, the two tools ProSal and META-PP enable users to easily submit 
sequences for domain analysis via ProDom (Corpet et al. 2000) and Pfam (Bateman ct al. 
2002). A search page at SBASE (Vlahovicek et al, 2002; http://hydra.icgeb.trieste.it/~krist- 
ian/SBASE/) enables users to search for domain matches within their sequence using BLAST. 
An alternate library of domains/motifs is available from BLOCKS (Henikoff et al. 1999) at 
http://www.blocks.fherc.org/blacks_search.html. Curated domain databases are also a wide- 
ly used tool for predicting protein function. Discussed in a later section is how these data- 
bases and other sequence signals can be used to assess protein function. Sternberg's DomFish 
server (http://www.bmm.icnet.uk/~3djigsaw/dom_fish/) will conveniently attempt to locate 
PFAM domains within a query sequence as well. 

Alternate methods not strictly based on homology with characterized domain libraries 
can be found in SnapDragon (George and Heringa 2002; http://mathbio.nimr.mrc.ac.uk/ 

-rgeorge/snapdragon/), which simply requires the input of a sequence and e-mail address, or 
in DGS (Wheelan et al. 2000; http://www.ncbi.nlm.nih.gov/Structure/dgs/DGSWeb.cgi) 
which only takes as input the length of the sequence. DGS is generally best used only for 
sequences of less than 600 amino acids. 
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Protein Transmembrane Domain Prediction 


Approximately 20-30% of the proteins within a typical microbial genome contain trans- 
membrane domains, and many proteomics experiments (see Chapters 1 and 3) may explic- 
itly focus on membrane proteins. Membrane proteins are often difficult to overexpress and 
have thus far not been readily amenable to structural studies. Consequently, only a handful 
of structures of membrane proteins are available, in contrast to the tens of thousands of 
structures of soluble proteins. Therefore, computational methods that can identify mem- 
brane proteins and shed light on their structure and topology are quite useful. Much trans- 
membrane helix prediction has focused on predicting the topology of the helices. lopoiogy 
prediction methods begin with the observations that transmembrane helices are often 20-30 
residues long with a high overall hydrophobicity and that short nontranslocated loops con- 
tain many positively charged residues, whereas short translocated loops contain few such 
residues. Knowledge of the hydrophobicity of transmembrane segments enables identifica- 
tion of transmembrane segments from hydrophobicity plots, and knowledge of loop charge 
enables prediction of the orientation of a protein in the membrane. 

Topology prediction methods vary in complexity depending on how the hydrophobicity 
scale has been derived, what algorithm is used for prediction, and whether or not informa- 
tion from homologous sequences is taken into account. The best current methods claim that 
>90% of all transmembrane segments can be correctly identified and that the full topology 
is correctly predicted for >80% of all proteins. Predictions are slightly better for prokaryotic 
proteins than for eukaryotic proteins. Topology prediction of B-barre! membrane proteins 
(e.g. porins) is more difficult since the membrane-spanning -strands are short and thus 
hard to detect in the sequence. Reasonable predictions can be made when the protein can be 
aligned with the sequence of a protein of known structure. 

A variety of transmembrane helix predictors are available including DAS íCserzo et al. 
1997), TMHMM (Krogh et al. 2001 ), TopPRED (Claros and von Heine 1994), and PHDhtm 
and PHDtopology (Rost et al. 1996) all of which are accessed via META-PP. These methods 
use a combination of clever statistics or neural networks to fish out potential transmembrane 
helices. An alternative approach based on simple window hydrophobic moment is available 
at http://www.doe-mbi.ucla.edu/Services/moment/. TMHMM was recently evaluated to be 
the most accurate (Moller et al. 2001). In general, we suggest à consensus approach. 


Protein Secondary Structure Prediction 


Secondary structure prediction lies at the heart of many tertiary structure prediction meth- 
ods. The use of evolutionary information and patterns of amino acid sequence conservation 
improved prediction accuracy substantially in the 1990s. Recently, the evolutionary informa- 
tion gathered by improved search tools and larger databases has boosted prediction accuracy 
to its current 75-85% accuracy for three-state predictions (ie., predicting a-helices, B- 
strands, and random coils). 

A number of methods that are available employ neural networks, hidden Markov mod- 
els, or consensus methods. The methods JPRED (Cuff et al. 1998), PHD (Rost and Sander 
1994), PROF (Ouali and King 2000), PSIpred (McGuffin et al. 2000), and SSpro ( Baldi et al. 
1999) are each implemented within META-PP. PSIpred and JPRED are among the most 
accurate predictors of protein secondary structure (http://cubic.bioc.columbia.edu/eva/sec/ 


common.html). 
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Protein Tertiary Structure Prediction 


The connection between protein structure and function has been well-documented, and, 
consequently, a number of methods are available to predict a protein's function from its 
structure, In addition, even preliminary knowledge of a protein's structure can suggest inter- 
action hot spots, functional regions, and potential targets of mutagenesis. 

Structure prediction methods are often quite complex, and details of their methodology 
are not provided here (for detailed information, see Mount 2001). Briefly, structure predic- 
tion methods tend to belong to one of three classes: homology modeling, fold recognition, 
and de novo/ab initio. These classes are roughly defined by the degree of similarity that the 
method requires between the query sequence and the sequence of a matching structure from 
the Protein Data Bank (PDB) (http://www.rcsb.org). De novo methods generate structural 
models of proteins that cannot be readily modeled from existing structures, whereas fold 
recognition and homology modeling methods generate predictions of structural models 
using existing structures, Whenever possible (generally when a structure within the PDB 
shares sequence identity >30% with the query protein), homology modeling methods are 
preferred to fold recognition methods. Likewise, fold recognition methods are preferred to de 
novo methods. 

More specifically, fold recognition and homology modeling methods find a homologous 
template structure within the database of experimentally solved structures, determine the 
equivalent positions between the probe sequence and template structure, and then place the 
probe sequence onto the template’s backbone to generate a model. Consequently, hamology 
modeling and fold recognition methods typically return 


* a coordinate file containing the actual three-dimensional model, 
« an accession code identifying the template structure from which the model was generated, 


* an alignment between the probe sequence and the template showing the equivalent posi- 
tions, and 


e a score describing how well the probe sequence matched the template structure. 


[f a simple BLAST search of a query protein against the PDB database finds a reliable 
homolog, then the user can submit a sequence to either ModBase (http://pipe.rockefeller.edu/ 
modbase-cgi/index.cgi) (Pieper et al. 2002) or Swiss-Model (http://www.expasy.ch/swiss- 
mod/SWISS-MODEL.html) (Peitsch 1996). Both of these services keep a catalog of generat- 
ed structural models. 

If no obvious homolog emerges, the ProSal server can be used to submit the amino acid 
sequence to the Structure Prediction Meta Server, which is similar in nature to META-PP; it 
takes a query sequence and submits it to a variety of fold recognition servers. Unlike META- 
PP. the Structure Prediction Meta Server collates the top ten structural template predictions 
from each server into a single page, translating the server results into a uniform format. For 
each server, the first line of condensed, reformatted output shows the server's name and pro- 
vides links to the original raw output, the CASP format output, and the PDB model output. 
Next, the template structure, sequence-structure compatibility score, and alignment between 
the template structure and probe are given for the ten best predictions ranked by sequence- 
structure compatibility score. Two additional columns, FSSP and SCOP, contain identifiers 
describing the fold-type of the template structure. Fold-type and information that can be 
derived from fold-type are discussed below. It is important to note that each server has a dif- 
ferent scoring system. For example, scores from GenThreader range from 0 to 1, whereas scores 
from BIO INBGO range from — to œ. The raw output from each server describes how to 
interpret that server's scores. Predictions with "better" compatibility scores are more reliable. 
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Several attempts have been made to benchmark structure prediction methods including 
CASP (http//predictioncenter.llnl.gov/), CAFASP (http://bioinfo.pl/cafasp/), and LiveBench 
(http://bioinfo.pl/LiveBench/). Readers are encouraged to see Fischer et al. (1999), Bujnicki 
et al. (2001), and Moult et al. (2001). 


Using Structural Classification Databases to Learn about Protein Function 


Proteins of similar structure often share functional attributes such as active sites, interaction 
patches, metal-binding sites, and interacting partners. Consequently, it is valuable to know not 
only which protein is most similar in structure to the protein of interest, but also the family of 
structures similar to a protein. Furthermore, if severalfold recognition methods predict pro- 
teins that are classified as being structurally similar, the user gains confidence in the prediction. 
For more details on methods used to classify structures into families, see Mount (2001). The 
three databases most commonly used to determine structural relationships among proteins are 
SCOP (http//scop.berkeley.edu), CATH (http://www. biochem.ucl.ac.uk/bsm/cath/_new/ 
index.html), and FSSP (http://www2.ebi.ac.uk/dali/fssp/fssp.btml), which classify proteins by 
methods that are purely manual, a combination of manual and automated, and purely auto- 
mated, respectively. 

Accessing the data in these databases is simple: Accession codes for SCOP and USSP are 
given within the Protein Prediction Meta Server output and can be used to directly access a 
structural family. The PDB code of the template structure can also be used to find structural 
neighbors and hints about a protein's function. For example, entering the accession code 1 mbe 
into the CATH database (http://www. biochem.ucl.ac.ak/bsm/cath/_new/index.html) returns 
the information that there is one domain matching that PDB code and that its CATH classifi- 
cation identifier is 1.10.10.60. Clicking on the hyperlink associated with the CATH code returns 
information about the structure: In this case, Imbe is a member of a protein family of mainly 
o-helical proteins, in which the a-helices are arranged as an orthogonal bundle and are home- 
odomain-like. Consequently, proteins in this family bind DNA. Further analysis would reveal 
the parts of the protein that directly contact the DNA and are thus functionally important. 


Sequence Signals That Reveal Function 


In addition to inferring the function of a target protein from its sequence and structural 
homologs, many signals exist within the protein itself that are indicative of its function. These 
signals may help in the interpretation of proteomics experiments, as well as in the validation 
of results from these experiments. 


Linking Entrez with Medline 


In Protocol 1, the example query protein is matched to several accession numbers, including 
17569445 and SEM5_CAEEL. These two accession numbers are, respectively, the identifiers 
for the NCBI and SwissProt databases. These identifiers are useful to know because precom- 
puted information is often accessed using these identifiers (e.g., aceess to protein models in 
ModBase). References to articles potentially describing the sequencing or function of this 
protein can be accessed via these identification names/numbers at the NCBI (http://www. 
nebi.nlm.nih.gov) site or the SwissProt (www.expasy.ch) database. SwissProt is often partic- 
ularly useful for quickly assimilating information about a protein, as the curated nature of the 
database implies that the vast amounts of raw data associated with a protein have been con- 
veniently summarized. 
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In general, a significant problem associated with many proteomics experiments is the 
conversion of the raw proteomics data into a list of proteins named and annotated in a stan- 
dard format. Associating each protein with a consistently chosen accession number can ease 
this process and the resulting interpretation of the experiment. 


Annotated domain databases 


Some sequence signals are found within extended domains, such as those identified earlier 
within the Prodom, Pfam, and BLOCKS databases. Domain databases are a better way to get 
at the function of a protein than the annotation given in full-length protein sequence data- 
bases. Often, domain databases are hand-curated and empirically seem to suffer less from 
problems of misannotation sometimes associated with noncurated databases. Furthermore, 
the sequence patterns are frequently quite specific, and distinct subtypes of homologous 
domains may be distinguishable. 

As an example of protein domain identification, consider entering the Sern-5 protein 
sequence (used as an example in Protocol 1) in the sequence box at http://pfam.wustl.edu/ 
hmmsearch.shtml. This search reveals the domain composition of the protein: two SH3 
domains and one SH2 domain. The functions of each domain can often be found by clicking 
on the hyperlinks provided. Other similar information is available at http://smart.embl-hei- 
delberg.de/ and at http://www.blocks.fherc.org/blocks_search.html. 


Sequence pattern resources 


In addition to extended sequence patterns, regular expressions have been derived to describe 
common motifs indicative of protein functional sites. These simple motifs can provide con- 
siderable support for proteomics experiments, by supplying computational evidence for pro- 
tease sites (such as might be useful in Chapters 7 and 8), glycosylation sites, and phosphory- 
lation sites (see Chapter 9). 

Motifs are commonly given in a regular expression notation. For example, the motif 
[LIVM]-[ST]-A-[STAG] -H-C can be used to identify serine/threonine proteases. In this 
notation, x denotes a position where any amino acid is accepted. Often, more than one spe- 
cific amino acid type may be acceptable for a given position in a motif: These are indicated 
by listing the acceptable amino acids for a given position between square brackets į ). For 
example, [ALT] stands for Ala or Leu or Thr. Ambiguities may also be indicated by listing 
within curly brackets { } the amino acids that are unacceptable at a given position. In this 
format, {AM} stands for any amino acid except Ala and Met. Fach element in a pattern is sep- 
arated from its neighbor by a —. A repeated pattern element is indicated by following an ele- 
ment with a numerical value or a numerical range between parentheses, such as x (3) corre- 
spondingto x-x-x and x(2,4) corresponding to x-x or x-x-x Or X-X-x-X. 

Putting these rules together, the pattern [AC)-x-V-x( 4) - (ED) corresponds to [Ala 
or Cys]-any residue-Val-any res idue—any residue~any residue-any 
residue-[any residue except Glu or Asp}. ProSites (Falquet et al. 2002), PRINTS 
(Attwood et al. 2002), and Emotifs (Huang and Brutlag 2001) are all examples of short 
sequence pattern databases. Each of these services can be accessed via either META-PP or 
ProSal. In addition, META-PP also submits sequences to ChloroP (Emanuelsson et al. 1999), 
NetOglyc (Gupta et al. 1999), NetPhos (Blom et al. 1999), and NetPico (Blom et al. 1996) 
which search for sequence determinants of chloroplast transit, O-glycosylation, phosphory- 
lation, and picornaviral protease cleavage sites, respectively. 
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SECTION 3: Matching Proteins to Pathways and Identifying the Cellular 
Roles of Proteins 


Boroni FINDING THE BIOCHEMICAL FUNCTION of a protein, it is also often possible to discov- 
er clues about a protein's cellular role. The clues are derived in part from the protein 
sequence, in part from the relationship of this sequence to those of other proteins in the same 
protein family, and in part from additional data, such as results of gene expression experi- 
ments, often available from high-throughput functional genomics experiments. 

In proteomics experiments, these types of computational analyses of cellular function 
may help in interpreting results (e.g., when uncharacterized proteins are observed in an 
experiment, these techniques may suggest functions for the proteins) as well as validating 
results (e.g., when protein interactions are measured experimentally, these techniques may 
provide additional computational validation of the observed interactions). Section 3 pro- 
vides protocols for finding the cellular roles of proteins by computationally identifying a pro- 
tein’s specific pathway (or pathways) and its interaction partners. For experimental determi- 
nation of cellular roles, see Chapter 3 for information about subcellular localization and 
Chapter 10 for information about mapping protein interaction partners. 


Discovering Functionally Linked Proteins 
Three general computational approaches can be used to discover or validate protein interac- 
tions, complexes, and pathways: 
* Discovery of functionally linked proteins by observation of gene fusion events (Protocol 3). 


* Discovery of pathways and functionally linked proteins via the identification of operons 
(Protocol 4), 


* Discovery of functionally linked proteins based on coinheritance of proteins across many 
organisms (Protocol 5). 
The following are two additional approaches using data other than sequence data: 


* Identification of interaction partners from databases of experimentally determined inter- 
actions. 


* Discovery of functionally linked proteins from protein or gene expression patterns 
(Protocol 6). 


In the first approach, illustrated in Figure 11.3, two genes in one organism can be inferred 
to be functionally linked due to the discovery of a third gene, which is the fusion of the two 


P. falciparum topoisomerase || — a Sa} — 
Ecol gases Ne — — 
E. coll gyrase A ea o 


FIGURE 11.3. A functional link can often be inferred between two separate proteins after finding a third 
gene that is a fusion of the first two (Enright et al. 1999; Marcotte et al. 1999), Here, gyrase A and gyrase B 
of E. coli are each homologous to a distinct region of topoisomerase II from Plasmodium falciparum, indi- 
cated by shaded boxes, suggesting (correctly) a possible functional relationship between gyrA and gyrB. 
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FIGURE 11.4. Computational reconstruction of bacterial operons by two approaches. (a) The distances 
between genes in the same operon (light blue) tend to be shorter than distances between adjacent genes in 
different operons (dark blue}. In genomes with gene densities similar to those of E. coli K12, genes separat- 
ed by less than 40 nucleotides are more likely to be in the same operon (Salgado et al. 2000); genes separat- 
ed by longer intergenic distances are more likely to be in different operons. The boundaries of an operon 
can be estimated by finding the first large intergenic distance or gene encoded in the opposite direction. (b) 
An operon is reconstructed by scarching for homologs of neighboring genes conserved as neighbors in other 
genomes (similar colors) (Bork et al. 1998; Dnadekar et al. 1998; Overbeek et al. 1999). 
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FIGURE 11.5. Proteins in the same pathway are often coinherited across arganisms. This trend can be 
exploited by calculating the phylogenetic profile of a protein, indicated as a raw in the figure, recording in 
which genomes (abbreviated on column headings) a protein has a detectable homolog (colored boxes, where 
darker shading indicates stronger sequence similarity) or lacks detectable homologs (white boxes), Here, 
proteins with similar phylogenetic profiles can be seen to have similar function. ( Adapted, with permission, 
from Marcotte 2000.) 
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separate genes (Enright et al. 1999; Marcotte et al. 1999). Empirically, it appears that gene 
fusions typically only occur between genes of related function; consequently, this method can 
be used to rapidly suggest functional partners for a gene. 

The second approach relies upon the tendency for bacterial genes to be organized into 
operons. Although many operons are known for some organisms (e.g., for known operons of 
E. coli, see the RegulonDB database at http://www.ctfn.unam.mx/Computational_Genomics/ 
regulondb/), many more are uncharacterized. Two computational protocols, illustrated in 
Figure 11.4, are given for reconstructing operons from a genome sequence. Both of these 
methods ignore the identification of promoters and regulatory sequences, and instead exploit 
other properties of operons. One protocol exploits short intergenic distances characteristic of 
operons (Salgado et al. 2000), and the second protocol exploits the tendency for genes in 
operons to be conserved as neighbors in other genomes (Tamames et al. 1997; Dandekar et 
al. 1998; Overbeek et al. 1999), 

The third approach for identifying pathway and interaction partners exploits the ten- 
dency for proteins in the same pathway to be coinherited across organisms (Pellegrini et al. 
1999; Huynen et al. 2000), as illustrated in Figure 11.5. A protein sequence is compared with 
sequences of all known proteins from available fully sequenced genomes. From this analysis, 
a phylogenetic profile is calculated, describing which genomes contain homologs of the query 
gene. This phylogenetic profile can then be compared to those of all other genes in a genome 
to suggest functional partners of the query gene. 

Finally, data other than genomic data carry information about gene function and path- 
ways. Protocols and Internet URLs are presented for the computational analysis of existing 
protein interaction data, gene disruption phenotypes, subcellular localization, and gene and 
protein expression data. Additional information on these topics is included in Chapters 3, 8, 
and 10. 
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PROTOCOL 3 


searching for Functionally Linked Proteins Using Gene 
Fusions: A Domain-based Search 


METHOD 


Tus METHOD PERMITS THE IDENTIFICATION of proteins that are functionally linked to a query 
protein. Although this process is best automated, the search can be performed manually if 
desired. The search can be performed by comparing either protein domain structures or pro- 
tein sequences. This protocol presents the domain-based approach, because it is easiest to 
implement with existing Web-based tools. 

This version of the method takes advantage of a protein domain database, such as those 
discussed in Section 2 of this chapter. The ProDom database is used as an example here. It 
illustrates the domain structure of a given protein and allows the display of all other proteins 
sharing a given domain (Corpet et al. 2000). ProDom can be searched directly at http:// 
prodes.toulouse.inra.fr/prodom/doc/prodom.html or accessed after first identifying a 
sequence in a participating protein sequence database, such as SwissProt (http://ca.expasy. 
org/sprot/). SwissProt provides direct hyperlinks between each protein and the correspond- 
ing domain entries in ProDom. 

Certain domains (called promiscuous domains) are known to be of hmited value in this 
process (Marcotte et al. 1999), as they link large numbers of different proteins together. The 
SH3 domains, ATP-binding cassettes, ATPase domain, and tyrosine kinase domains are 
examples of promiscuous domains. Following these promiscuous domain links in Step 4 of 
the procedure will generally produce spurious linkages, indicated by finding an exceptional- 
ly large number of links to distinct proteins inferred from a single fusion protein. 


INPUTS 


For this example, functional links will be identified for the E. coli gyrase A protein, 


1. Find the entry for the query protein (called, generically, protein A) in the domain data- 
base, 


For example, the entry for E, coli gyrase A is found at http://protein toulouse.inra.fr/cgi-bin! 
ReqProdomlLpltacc seqU-P09097. 


2. Find all of the proteins sharing a homologous domain with the query protein. in 
ProDom, click on the icon to the left of the gene name, 


3. Search for proteins with additional domains beyond those of the query protein. 
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Each of these proteins will be evaluated for its potential as a fusion protein (AB) linking the 
query protein (A) to another protein (T). 


4. For each protein with an additional domain, find all of the proteins sharing homologous 
domains. Again, in ProDom, click on the icon to the left of the gene name. 


For example, in ProDom Release 2001.3, 414 genes share domains with the Plasmodium jalci- 
parum topoisomerase IJ. Search this list for genes from the same organism as the query protein. 
Here, of the 414 genes, only 4 are from F. colt. Eliminate all genes from this list that are homol- 
ogous to the query protein {ie~ that share a domain with the query protein). The remaining 
genes, in this case, the homologous genes ParC and gyrase B, are predicted to be functionally 
linked to the query protein. In this case, one of these predictions is correct (gyrase BJ, and one 
is incorrect (ParC), illustrating that the method does not distinguish among paralogs. 


in general, proteins linked by this method have a better than random chance of being in the 
same cellular pathway or of being interaction partners (Enright et al. 1999; Marcotte et al. 
19995, 


PROTOCOI 4 


Rapid Identification of Operons 


METHODS 


Tas PROTOCOL PERMITS THE DISCOVERY OF THR OPFRON to which a query gene belongs. Two 
methods are provided (see Figure 11.4). Method A uses short intergenic distances to identi- 
fy a gene's operon. For more details about this method and the choice of thresholds, see 
Figure 4 of Salgado et al. (2000), from which the thresholds are derived. Method B uses the 
WIT2 Web Site to identify groups of genes that are found clustered together in a variety of 
organisms. This conservation of gene neighbors is characteristic of operons. An alternative 
approach for identifying conserved operon structure is to use the STRING Web server (Snel 
et al. 2000) at http://www.bork.embl-heidelberg.de/STRING/. 


Method A: Identifying Operons by Short Intergenic Distances 


820 


1. Go to the Entrez bacterial genome Internet site http://www.nebinim.nih.gow/ PM Gifs/ 
Genomes/eub g.html and find the genome of interest. Follow the hyperlink to the 
genome entry, 


For example, E. coli K12 is hyperlinked via the file name NC. 000913 to http://www.ncbi.nlm. 
nih.gov/cgi-bin/Entrez/framik?db2Genome&gi-115. 


2. Calculate the gene density for this organism as the number of total bases in the genome 
divided by the number of protein-coding genes. The values for the total bases and the 
protein-coding genes are located near the top of the Web page. 

The thresholds given below are calculated for E. col K12, which has a gene density of - 1084 
nucleotides/gene. Note that genomes with very different gene densities may require recalibra- 
tion of the thresholds. 


3. Select the protein-coding gene feature table. 
For F, coli K12, this is found at http//www.ncbi.nlm.nih.gov/cgi bin/Entrez/altiktgi- I i S&db= 
Genome. 


4. Find the gene of interest in the table. On the basis of the gene's location in the chromo- 
some, given at the left of the table, calculate the number of nucleotides separating the 
gene from the genes on either side of it. 

A neighboring gene is likely to be in the same operon if it is encoded on the same strand of the 
DNA, and it is nearer than ~40 base pairs. 


5. Repeat the analysis for each gene included in the operon until the apparent boundaries 
of the operon are discovered. 
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Method B: Identifying Operons by Conservation of Gene Neighbors 


1. Go to the WIT database Internet site http://wit.mcs.anl.gov/WIT 2/. 
2. From the menu at the top of the Web page, select General Search. 


3. Find the gene of interest via the search menu. For example, to search for E. coli ubiquinol 
oxidase: 


« Select: All ORFs in the Select Data Categories to Search frame 
Bacteria—Escherichia coli in the Select Specific Organism to Search frame 
* Input: ubiquinol oxidase or cyoA, searching for an exact match to the term. 
+ Click on the Search button. 
4. Follow the links to the gene of interest; in this case to ORF 4472. 


5. On the frame at the left side of the Web page, select Preserved Operons to get a series of 
tables showing which neighbors of the gene of interest are conserved as neighbors in 
another genome. The relationships discovered are summarized at the bottom of the page. 
In the example of cyoA, the four neighboring genes, cyoB to cyoF, are predicted correctly 
to be in the same operon as cyoA. 


6. If desired, the precise locations and orientations of the genes can be examined from the 
Contig Region link from the main Web page for the query gene (i.e., from the same frame 
where Preserved Operons is located). 


PROTOCOL 5 


Finding Functionally Linked Proteins by Comparing 
Phylogenetic Profiles 


Tue PURPOSE OF THIS PROLOCOL is to identify proteins functionally linked to a query protein. 
Two methods are provided: Method A uses a Protein Link Fxplorer (PLEX; http;// 
apropos.icmb.utexas.edu/function-prediction.html). PLEX will perform a BLASTp search of 
the query protein against a database of fully sequenced genomes. PLEX reports the proteins 
with statistically similar sequences, the organisms in which these proteins are found, and the 
resulting phylogenetic profile of the query protein. From this profile, the investigator can 
rapidly determine the breadth of organisms containing the query protein, and by extension, 
the breadth of organisms containing the system in which the query protein operates. 


Method B is an alternative approach for identifying proteins with similar inheritance pat- 


terns using the COGs database (http://www.ncbi.nlm.nih.gov/COG/). 


METHODS 


Method A: PLEX 


Method B: COG 


1. 


. Paste a protein sequence into the PLEX. 


. Choose the genome from which the query protein is derived. If the genome is not pre- 


sent, choose one containing a close homolog of the query protein. Choose a threshold for 
the links to be reported. PLEX will then compare the phylogenetic profile of the query 
protein with the phylogenetic profile of every protein encoded by the chosen genome. 


PLEX returns those proteins whose phylogenetic profiles match above the threshold select- 
cd. These proteins have the most similar inheritance patterns with the query protein and are 
therefore candidates to operate in the same cellular pathway as the query protein. A quanti- 
tative score measuring the degree of coinheritance ranks each hit. The confidence is report- 
ed for which each protein in the list is linked to the query protein, based upon calibration 
of scores against known pathways. 


Identify the protein of interest in COG, Note the phylogenetic distribution of the corre- 
sponding COG category. 


- Search for other COG categories sharing this distribution using the phylogenetic patterns 


search feature of the COGs database, or by clicking on the phylogenetic pattern itself, 
which is hyperlinked to a list of COG categories with similar inheritance patterns. 
Proteins in these COG categories will be strongly coinherited with the query protein, and 
are therefore predicted to function in the same pathway. 
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Protein Interaction Databases 


Protein interaction data are increasingly available for human proteins as well as for proteins 
of several model organisms, especially yeast, Caenorhabditis elegaus, and the bacteria 
Helicobacter pylori. These interaction data can consist of protein-protein, protein-nucleic 
acid, and protein-small molecule interactions, which are collected in a number of databases 
that allow a protein of interest to be visualized in the midst of a network of interaction part- 
ners. 

Protein-protein interaction data are mainly derived from genome-wide, high-through- 
put yeast two-hybrid experiments in which interactions are measured between all gene pairs 
in a genome. More than 4000 unique protein interactions were observed between yeast pro- 
teins in three large-scale experiments (Ito et al. 2000, 2001; Uetz et al. 2000). More recently, 
complexes of yeast proteins have been isolated, and the protein constituents have been iden- 
tified by mass spectrometric approaches (Rain et al. 2001; Gavin et al. 2002; Ho et al. 2002), 
thereby identifying hundreds of additional interactions among yeast proteins. A test similar 
to the high-throughput yeast two-hybrid experiments, performed on bacterial proteins, iden- 
tified more than 1200 interactions between proteins of the human gastric pathogen H. pylori 
(Rain et al. 2001). 

In addition to large-scale experimental approaches, a number of groups have been 
attempting to cull the previously measured protein-protein interactions from the biological 
literature (Humphreys et al. 2000; Proux et al. 2000; ‘Thomas et ai. 2000; Blaschke et al. 2001; 
Marcotte et al. 2001). This systematic collection of protein interaction data provides neces- 
sary checks on the quality of the large-scale interaction data. Large-scale protein interaction 
data have varied widely in accuracy {von Mering et al. 2002), but many interactions in the 
databases have been observed by multiple experimental methods, providing some measure of 
confidence in the correctness of these interactions. 

Protein interaction. databases that combine the interactions from these large-scale 
screens with the interactions extracted from the literature include the BIND database and the 
Database of Interacting Proteins (DIP). As of this writing, DIP ({http://dip.doe- 
mbi.ucla.edu/) currently contains 717,500 interactions between ~6800 proteins, a majority 
of which are from yeast and the bacterium H. pylori (Xenarios et al. 2002). The BIND data- 
base (http://www.bind.ca/) includes >6000 interactions, primarily focusing on yeast proteins. 
These databases provide practical tools for identifying known interaction partners of a pro- 
tein of interest. 

The interaction partners for yeast proteins, and to a lesser extent bacterial and human 
proteins, can often be found in the interaction databases listed in the panel below. For pro- 
teins from other organisms, it is often possible to identify interelogs (Matthews et al. 2001), 
which are proteins presumed to interact on the basis of the known interactions of the pro- 
teins’ orthologs in another genome, as illustrated in Figure 11.5. To search for interologs (see 
Figure 11.6), first identify the yeast (or H. pylori or human) ortholog of the protein, and then 
search for interactions for this yeast ortholog. Any interaction partners of the ortholog, them- 
selves yeast proteins, would then be subjected to the reverse process of identifying their 
orthologs in the genome of interest. l 

Protein-DNA interaction data are alsa accumulating, especially with the advent at large- 
scale assays of transcription-factor-binding specificities (Ren el al. 2000; Bulyk et al. 2001; 
Iyer et al. 2001). Many protein-DNA interactions and binding specificities arc cataloged in 
the TRANSFAC database (http://transfac.gbf.de/ TRANSFACA and the RegulonDB database 
(http//www.cifn.unam.mx/Computational Gen omics/regulondb/). 
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FIGURE 11.6. The experimentally determined protein-protein interactions in. one organism can often 
reveal protein interaclions in a second organism. Here, the orthologs of the veast interaction partners are 
also likely to interact (Matthews et al. 2001 1. 


URLs or Protein, DNA, AND SMALL MOLECULE INTERACTION DATABASES 


BIND hitp://www.bind.ca/ 
BRITE http://www.genome.ad.jp/brite/ 
DIP http://dip,doe-mbi.ucla.edu/ 


DPInteract http://arep.med.harvard.edu/dpinteract/ 
INTERACT http://www.bioinf.man.ac.uk/resources/interact.shtml 


LIGAND http://www.genome.ad.jp/dbget/ligand.html 
MINT http://cbm.bio.uniroma2.it/mint/ 

MIPS http://mips.gsf.de/proj/yeast/CYGD/db/index.htmi 
REBASE http://rebase.neb.com/rebase/rebase.html 


REGULONDB http://www.cifn.unam.mx/Computational_Genomics/ regulondb/ 
RELIBASE hitp://www.ccde.cam.ac.uk/prods/relibase 

TRANSFAC http///transfac.gbf.de/TRANSFAC/ 

TRRD http://wwwmegs.bionet.nsc.ru/mgs/dbases/trrd4/ 

PIMRider http:/ /pim.hybrigenics.com/pimriderlobby/current/PimRiderLobby.htm 


Metabolic and Signaling Pathway Databases 


Beyond cataloging protein interactions, a number of groups are creating databases Lo store 
and search known cellula: pathways. Such pathway databases include the metabolic pathway 
databases KEGG ‘Kanehisa and Goto 2000) and FeoCyc/MetaCye (Karp et al. 2000), the sig- 
nal transduction database STKE, and the regulatory database TRANSFAC (Wingender et al. 
2001), For proteins whose functions have been characterized in metabolism or signal trans- 
duction, these databases allow rapid assessment of the systems in which these proteins oper- 


ate. 
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MEIABOUC AND SIGNALING PATHWAY DATABASES 


BioCarta http://www. biocarta.com/ 

BRITE http://www.genome.ad.jp/brite/ 
ECOCYCIMETACYC _http://biocyc.org/ 

EMP http://emp.mcs.anl.gov/ 

GeneNet http://brac.postech.ac.kr/eng/ 

KEGG . http://www.genome.ad.jp/kegg/ 

IGAND - http//www.genome.ad jp/dbget/ligand.btml 

MIPS | http://mips.gsf.de/proj/yeast/CYGD/db/index.html 
PFBP http://www.ebi.ac.uk/research/pfbp/ 
'REGULONDB http://www.cifn.unam.mx/Computational_Genomics/regulondb/ 
STKE http://stke.sciencemag.org/ 

TRRD http://wwwmmgs.bionet.nsc.ru/mgs/dbases/trrd4/ 
WIT http://wit.mcs.anl.gov/WIT2/ 


Protein and Gene Expression Patterns 


Due to the prevalence of publicly available large-scale gene expression data sets, quite a brt 
can be learned about a gene and the conditions under which it is active simply by looking up 
the gene’s expression patterns. These data come from a variety of sources, primarily EST 
libraries, Serial Analysis of Gene Expression (SAGE) libraries, and DNA microarray data. 
Currently, more than 11 million sequenced ESTs and SAGE measurements for more than 100 
SAGE libraries are available from the dbEST and SAGEmap databases, respectively. These 
ESTs have been clustered into contigs in the UniGene database, which provides a rapid too! 
to identify all of the ESTs derived from a given gene. Hundreds of DNA microarray data sets 
for several different organisms, including yeast and human, are also freely available from the 
Stanford Microarray Database. 


Gene Expression DATABASES 


dbEST* http://www.ncbi.nlm.nih.gov/dbEST/index.html 
SAGEmap http://www.ncbi.nlm.nih.gov/SAGE/ 
UniGene http://www.ncbi.nlm.nih.gov/UniGene/ 


Stanford Microarray Database http://genome-www5.stanfnrd.edu/MicroArray/SMD/ 


*dbEST can also be searched by BLAST at http://www.ncbi.nlm.nih.gov/BLAST/ and setting the search data- 
base to the appropriate EST set. 


Beyond simply determining expression patterns of genes, these expression data can often 
reveal which genes are coexpressed, providing clues about which genes work together in the 
cell. Searching for coexpressing genes is done most easily with DNA microarray data, which 
provides measurements of gene expression levels for all genes on a microarray, even low- 
abundance genes that might be missed from EST or SAGE library-based approaches. Many 
individual DNA microarray data are available in the public domain, and searching for coex- 
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pressing genes requires that these different DNA microarray experiments be analyzed simul- 
taneously. Analyzing for mRNA coexpression is relatively easy to do for all of the genes in a 
genome as follows: 


1, Collect multiple microarray experiments performed on the same organism under differ- 
ing conditions. Protocol 6 describes one manner in which such data can be gathered. 


2. [n a spreadsheet, create a table of data with genes along the vertical axis, and the gene's 
expression levels forming rows in the table (e.g. for the gene expression level measured 
on spotted microarrays, the investigator might choose the log of the mean value of the 
normalized ratios of red to green pixels). Write out the data as a tab-delimited text file. 


3. Install the freely available Cluster and Tree View software (http://rana.lbl.gov/ 
EisenSoftware.htm). 


4. Load the expression data into Cluster, and cluster the data using one of a number of dif- 
ferent clustering algorithms, such as hierarchical clustering. View the results in Tree View. 
Clusters of cocxpressing genes can often be identified containing the genes of interest. 


PROTOCOL 6 


Discovery of Functionally Linked Genes from DNA 
Microarray Data 


METHOD 


Tos PROTOCOL ALLOWS THE INVESTIGATOR TO QUERY the thousands of public DNA microar- 
ray experiments in order to find coexpressed genes. To identify the coexpression partners of 
a small set of genes using publicly available data, the expression data can be downloaded and 
then analyzed as described above. However, perhaps the easiest approach is to directly query 
the Stanford Microarray Database, as described in this protocol. The premise of this method 
is that genes that are coexpressed over a sufficiently large number of experiments are likely to 
bc functionally linked. The precise number of experiments required may vary and is a func- 
tion of the complexity of the gene’s expression patterns, but for yeast genes (and increasing- 
ly for human genes), sufficient DNA microarray experiments exist in the public domain to 
make quite strong inferences. 


1. Go to the Stanford Microarray Database public advanced search page: http://genome- 
www5.stanford.edu/cgi-bin/SMD/cluster/QuerySetup.pl. 


2. Select the organism of interest (e.g., "Saccharomyces cerevisiae"). Under Experimenter, 
select All. Press the Data retrieval and analysis button. 


3. Shift-click to highlight all available experiments, and then press the Data retrieval and 
analysis button. 


A. Activate the radio button labeled All genes. Press the Proceed to data filtering button. 


5. Press the Retrieve data button. 


The expression levels for all genes will be retrieved from the microarray experiments selected 
earlier. At the time of this writing, more than 440 experiments are available for yeast, so this 
retrieval may be quite a slow process. Although searching all genes is slow, it is required in order 
to find the most coexpressing genes in a genome. 


6. Follow the links to cluster the data. 


The final output will be a hierarchical clustering of all of the genes in the genome by their 
expression patterns across all available microarray experiments. The raw data files can be 
downloaded and viewed by the programs cited above, Cluster and Tree View, Given sufficient- 
ly complex gene expression data, such as those derived from hundreds of microarray experi- 
ments, genes that are clustered with the gene of interest are very likely to operate in the same 


cellular pathway. 
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Spatial Expression Patterns 


Spatial expression patterns of genes can be quite difficult to collect on a large scale. However, 
these patterns have been measured for several thousand genes in Xenopus embryos 
(Gawantka et al. 1998; Pollet et al. 2000). Screening with randomly chosen cDNAs, the 
expression patterns of hundreds of unique mRNAs have been detected by whole-mount in 
situ hybridization. The expression patterns were photographed and stored in a database 
( Axel DB). Comparison of the spatial expression patterns of these genes reveals that genes 
sharing similar spatial expression patterns often have related functional roles. Thus, if a gene 
of interest occurs in the set of tested genes, searching AxelDB for similar spatial expression 
patterns may potentially identify functionally linked genes. AxelDB can be searched at 
http/www.dkfz-heidelberg.de/abt0135/axeldb.htm. 


Protein Expression Measurements 


Gaining protein expression data is, of course, one of the goals of proteomics. in addition to 
temporal and spatial gene expression patterns, protein expression data offer the promise of 
many clues to the functions of proteins. Protein expression data are accumulating more slow- 
ly than the interaction and phenotype data. Currently, only a few sets of protein expression 
data are available outside of the private sector. However, a great deal of research effort is being 
focused in this direction, and it is likely that these data will rapidly become available. Recent 
developments in mass spectrometric techniques have allowed the first large-scale measure- 
ments of protein expression patterns, using two main approaches described below. 


e. [n the first approach, proteins from two sets of cells are proteolyzed to peptides, and the set 
of cysteine-containing peptides are purified from each sample with sufhydryl-specific 
affinity reagents (e.g., ICAT [Gygi et al. 1999b], see Chapter 8. Protocol 9). The specific 
forms of sulfydryl-specific reagents used for the two samples differ in their isotope distri- 
bution, such that the peptides from one set of cells wifi have a predictable mass difference 
from those of the other cell sample. These cysteine-containing peptides from the two sam- 
ples are mixed, and then analyzed by mass spectrometry (MS). Because of the mass differ- 
ences associated with the peptides, the relative abundance of a given peptide in the two cell 
samples can be measured as their relative peak heights in the mass spectra. This approach 
has yielded expression measurements of hundreds of proteins (Gygi et al. 1999b; Conrads 
et al. 2000); the specific details of this approach and the second approach to follow are 
described in Chapters 1 and 8. 


In the second general approach using MS, peptides generated from cell extracts have been 
identified based on capillary electrophoresis elution times and high-resolution mass mea- 
surements from a mass spectrometer capable of high mass accuracy, such as Fourier trans- 
form-ion cyclotron resonance MS (Conrads et al. 2000; Jensen et al. 2000; see Chapter 8). 
As with the first technique, isotope differences can be exploited in order to measure pro- 
tein expression level changes between two samples. Ín this second approach, isotope label- 
ing has been achieved by growing cells of the two samples in media with distinct isotopic 
distributions. As in the first technique, expression levels can be measured for thousands of 


peptides. 


In both MS approaches, peptides must be mapped to their parent proteins in a database 
search, In two large-scale tests of protein expression levels, measured protein expression lev- 
els are only poorly correlated with mRNA expression levels, suggesting that protein expres- 
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sion data will be of significant value in complementing available mRNA expression mea- 
surements (Gygi et al. 1999a; Ideker et al. 2001). 

Despite this progress on mass spectrometric proteomics, the majority of public protein 
expression currently derives from two-dimensional SDS-PAGE gels (e.g., see Chapter 4). 
These data are not currently organized for convenient searching of expression levels and pat- 
terns across many experiments. However, at least one database, Swiss2 DPAGE, reports the 
measured expression patterns for ~800 proteins derived from a small set of organisms. 
Swiss2DPAGE can be accessed at http://ca.expasy.org/ch2d/. 


Mutant Phenotypes 


Just as with interaction and pathway data, phenotypic data are accumulating at a rapid rate 
for thousands of genes. These data can be of use in proteomics in many different ways, but 
perhaps the most important will be by providing additional validation for proteomics exper- 
iments. For example, one expects that proteins functioning in the same pathway should have 
generally similar or related phenotypes when their corresponding genes are disrupted. 
Therefore, this becomes a quantitative test that can be applied to any particular set of pro- 
teins linked or coexpressed in a proteomics experiment. 

In several model organisms, including yeast and C. elegans, genes have been systemati- 
cally disrupted, and the effects of these gene disruptions have been observed. Recent data of 
this sort inchide the measurement of disruption phenotypes of more than half of the genes 
of yeast (Ross-Macdonald et al. 1999; Winzeler et al. 1999}, the transposon mutagenesis of 
most of the genes of Mycoplasma genitalium (Hutchison et al. 1999), and the RNAi inhibition 
of hundreds of genes of C. elegans (Fraser et al. 2000; Gonczy et al. 2000; Maeda et al. 2001). 

Gene disruption data can be accessed in a number of different ways. If the gene of inter- 
est is derived from one of the organisms with an extensive knockout database, it is a simple 
matter of finding the gene in the database. As with protein interaction interologs, it is possi- 
ble to extend gene-knockout phenotypes measured in one organism to the orthologs of the 
genes in a second organism. The standard cautions apply—this is an unproven approach, and 
one is taking something of a leap of faith. However, it is possible to generate reasonable 
hypotheses about the potential phenotypes of a disrupted gene in this manner. 

Gene disruption data for yeast is most easily accessible from the SGD database 
(http;//genome-www.stanford.edu/Saccharomyces/) or (through subscription) the YPD 
database (https://www.incyte.com/proteome/databases.jsp). Gene disruption data for M. 
genitalium are available at hitp://www.sciencemag.org/feature/data/ 1042937 shl. Finally, gene 
disruption data for C. elegans are available on the Internet in the supplementa! information 
of Fraser et al. (2000) and Gonczy et al. (2000), in the Nematode Expression Pattern Database 
(http://nematode.lab.nig.ac.jp/), and (by subscription) in WormPD (https://www.incyte. 
com/proteome/databasesjsp). These phenotypic data are organized for a number of other 
organisms, such as Drosophila, in FlyBase (http://flybase.bio.indiana.edu/), and mice, in 
MGD (http://www.informaticsjax.org/). 


Predicting the Subcellular Localization of a Protein 


Finally, it is often of great interest to know not only the function of a protein, but also where 
that function is performed. Although the subcellular localization may be directly measurable 
(e.g. for the preparation of specific subcellular fractions, see Chapter 3), subcellular localiza- 
tion can also be predicted to some extent, Two programs, PSORT (Nakai 2000) and SignalP 
(Emanuelsson et al. 2000), are commonly used to predict protein cellular localization and 
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secretion. SignalP is specifically trained to identify amino-terminal secretory signals. A 
sequence can be submitted to SignalP with META-PP (mentioned above). The SignalP out- 
put is thoroughly described at http://www.cbs.dtu.dk/services/SignalP/output.html, PSORT 
is a more general program, predicting localization to many different subcellular compart- 
ments. PSORT and PSORTII are available for use at http:/{psort.nibb.ac.jp/. The Web Site 
maintains an excellent manual clearly describing the output. 


ADDITIONAL RESOURCES 


A number of excellent texts are available for in-depth discussions of the topics in this chap- 
ter. 


* Books 
For general discussion of bioinformatics approaches, appropriate starting texts include: 


Bioinformatics: Sequence and Genome Analysis by David Mount (Cold Spring Harbor 
Laboratory Press) 


Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, edited by 
Baxevanis and Ouellette (Wiley) 


Introduction to Bioinformatics, edited by Attwood and Parry-Smith (Prentice Hall) 


Proteome research: New Frontiers in Functional Genomics, edited by Wilkins, Williams, 
Appel, and Hochstrasser (Springer-Verlag) 


For a more detailed look at the algorithms involved, we recommend: 
Biological Sequence Analysis by Durbin and colleagues (Cambridge University Press) 
Computational Molecular Biology by Pavel Pevzner (MIT Press) 


For extensive discussion of protein interactions and experimental protocols, we recom- 
mend: 


Protein-Protein Interactions, edited by Erica Golemis (Cold Spring Harbor Laboratory 
Press) 


For general information about protein structure and function, we recommend: 
Proteins by T.E. Creighton (W.H. Freedman) 
e Web Sites 
Several excellent on-line tutorials are available. We recommend starting with: 


http//www.biochem.ucl.ac.uk/bsm/dbbrowser/jj/prefacefrm.html 
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WWW RESOURCES 


http;//apropos.icmb.utexas.edu/function-prediction.html Protem Link Explorer (PLEX), University of 
Texas. E 

http://bioinfo.pl/cafasp/ CAFASP (Critical Assessment of Fully Automated Structure Prediction). 

http://bioinfo.pl/LiveBench/ The Live Bench Project—Continuous benchmarking of protein structure 
prediction servers. 

http://ca.expasy.org/ch2d 
base. 


/ SWISS-2Dpage—-Two-dimensional polyacrylamide gel electrophoresis data- 
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http://ca.expasy.org/sprot/ SWISS-PROT protein knowledgebase. 

http-//cubic.bioc.columbia.edu/eva/sec/common.html EX A summar 

http://dip.doe-mbi.ucla.edu/ The Database of Interacting Proteins. 

http://flybase.bio.indiana.edu/ The FlyBase—A Database of the Prasophida Genome. 

http://genome-www5.stanford.edu/cgi-bin/SMD/cluster/QuerySetup.pl The Stanford Microarray 
Database. 

http//genome-www.stanford.edu/Saccharomyces/ Ihe Saccharomyces Genome Database (SGD, 

http://maple.bioc.columbia.edu/pp/submit meta.html 1 he ME LA Server, 

http://mathbio.nimr.mrc.ac.uk/-rgeorge/snapdragon/ SnapDRAGON domain boundary prediction. 

http://nematode.lab.nig.ac.jp/ NEXTDB (The Nematode Expression Pattern. DataBase). 

http://pfam.wustl.edu/hmmsearch.shtm! Pfam HHM database, Washington University in St. Louis. 

http://pipe.rockefeller.edu/modbase-cgi/index.cgi ModBase 
models. 

http://predictioncenter.Inl.gov/ Protein Structure Prediction Center, Lawrence Livermore National 
Laboratory. 

http://prodes.toulouse.inra.fr/prodom/doc/prodom.html The Protein Domain Database- -ProDom. 

http://psort.nibb.ac.jp/ PSORT (Prediction of Protein Sorting Signals and Localization Sites in Amino Acid 
Sequences]. 

http;//rana.lbl.gov/EisenSoftware.htm The Fisen Lab Software page. 

http://scop.berkeley.edu SCOP (Structural Classification of Proteins]. 

http://smart.embl-heidelberg.de/ SMART sequence database. 

http://transfac.gbf.de/TRANSFAC TRANSFAC (The Transcription Factor Database}. 

http;//wit.mcs.anl.gov/WTT2/ WIT: Interactive Metabolic Reconstruction. 

http://xray.bmc.uu.se/sbnet/prosalL html ProSal (Protein Sequence Analysis [ auncher). 

http://www.bind.ca/ The Biomolecular Interaction Network Database i BINITY). 

http//www.biochem.ucl.ac.uk/bsm/cath/ new/index.html CATH Protein Structure Classification. 

http://www.blocks.fherc.org/blocks_search.html BLOCKS searcher, 

http://www.bmm.icnet.uk/~3djigsaw/dom_fish/ Domain Fishing 1.0. 

htip://www.bork.embl-heidelberg.de/STRING/ STRING ‘Search Tool tor Recurring Instances of 
Neighbouring Genes]. 

http//www.cbs.dtu.dk/services/SignalP/output.htm! CBS SignalP V1.1, World Wide Web Prediction 
Server, Center for Biological Sequence Analysis. 

http//www.cifn.unam.mx/Computational Genomics/regulondb/ RegulonDB v3.2: A Database. on 
Transcriptional Regulation and Genome Organization. 

http://www.dkfz-heidelberg.de/ abt0135/axeldb.htm The Axeldb database home page. 

http://www.doe-mbi.ucla.edu/Services/moment/ MOMENT Transmentbrane Helix Prediction. 

http://www2.ebi.ac.uk/dali/fssp/fssp-html The FSSP database, EMBL EBI. 

hitp://www.expasy.ch/swissmod/ SWISS-MODEL html 5W!55 MODET. 

http://hydra.icgeb.trieste.it/~kristian/ SBASE/ BLAST-FTILOM database. 

https://www.incyte.com/proteome/databases.}sp Fhe Proteome BioKnowledge® Dibrars, Incyte 
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TABLE Al.}. Metric prefixes 


TABLE AJ.2. Pressure conversion table 


Prefix Numerical equivalent kPa psi bar MPa 
yotta M 10" 690 190 6.9 0.7 
zetta (A) 107 3450 506 345 34 

exa (E) 108 6890 1000 68.9 6.9 
peta q ) 10” 10340 1500 103.4 10.3 
tera (T) [at 13790 2000 137.9 13.8 
gga (G) 10* 17240 2500 172.4 17.2 
mega (MJ 10f 20680 3000 206.8 20,7 
kilo ík) 10 24130 3500 241.3 24.1 

hecto (h) 1o? 27580 4000 275.8 27.6 
deka (da) 10 31030 4500 310.3 31.0 
deci (d) iy 34470 5000 344.7 34,5 
centi (c) 107 37920 5500 379.2 37.9 
milli (m) 103 41370 6000 413.7 41.4 

micro — (yu) lo“ 44820 6500 448.2 44.8 
nano {n} 105 48260 7000 482.6 AR à 
pio (p) iu 51710 7500 517.1 31.7 

femte tf) 1075 55160 8000 551.6 55.2 

atto — {a} 10-18 58610 8500 586.1 53.6 
zepto — (z) 1077! 62050 9000 620.5 62.1 
yocto (y) 1074 65500 9500 655.0 65.5 
68950 10000 689.5 68.9 
TABLE A1.3. Moles/weight conversion table 
Molecular mass 
(daltons) tug ] nmole 
100 10 nmoles or 6 x 10% molecules 0.1 ug 
1,000 ] nmole or 6 x 10!* molecules Lug 
10,000 100 pmoles or 6 X 10? molecules 1d ug 
20,000 50 pmoles or 3 x 10'^ molecules 20 ug 
30,000 33 pmoles or 2 x 10" molecules 30 ug 
40,000 25 pmoles or 1.5 X 10! molecules 40 ug 
50,000 20 pmoles or 1.2 x 19" molecules 50 ug 
60,000 17 pmoles or 1.0 x 10" molecules 60 Lg 
70,000 14 pmoles or 8.4 X 10'^ molecules 70 ug 
80,000 12 pmoles or 7.2 x 10” molecules BO ug 
90,000 11 pmoles or 6.6 x 10" molecules 90 ug 
100,000 10 pmoles or 6 x 10" molecules 100 ug 
120,000 8.3 pmoles or 5 x 10" molecules 120 ug 
140,000 7.1 pmoles or 4.3 x 10” molecules 140 Lg 
160,000 6.3 praoles or 3.8 x 10 molecules L60 ug 
180,000 56 pmoles or 3.3 x 10” molecules 180 ug 
5 pmoles or 3 x .0" molecules 200 ug 


200,000 


TABLE A1.4. Selected buffers and their pK values at 25°C 
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Trivial name 


Buffer name 


ie pK, ApK/At 

Phosphate (pK, } - 245 0.0044 
Malate (pK) — 340 . 
Formate - 475 0.0 
Succinate (pK) - 4.21 -0.0018 
Citrate (pK;) - 4.76 -0.0016 
Acetate - 4.76 0.0002 
Malate 5.13 - 
Pyridine - 5,23 ~0.014 
Succinate (pK,) - 5.64 0.0 
MES 2-(N-Morpholino)ethanesulfonic acid 6.10 0.01 
Cacodylate Dimethylarsinic acid 6.27 - 
Dimethylglutarate 3,3- Dimethylglutarate (pK) 6.34 0.0060 
Carbonate (pK) - 6.35 -0.0055 
Citrate (pK) - 6.40 0.0 
BIS-Tris [Bis-(2-hydroa yethyl)imino trist hydroxymethyl) methane 6.46 0.0 
ADA N-2-Acetamidoiminudiacetic acid 6.59 -0.011 
Pyraphosphate - 6.60 - 
EDPS (pK) N,N -Bis(3-sulfopropyl)ethylenediamine 6.65 - 
Bis-Tris propane 1,3- Bis | tris{ hydroxymethyl methylaming propane 6,80 - 
PIPES Piperazine-N,N ' bis(2-ethanesulfonic acid) 6.76 —0.0085 
ACES N-2-Acetamido 2 aminoethanesulfonic acid 6.78 4,020 
MOPSO 3-(N-Morpholino)-2-hydroxypropanesulfonic acid 6.95 - 0.015 
Imidazole - 6.95 4.020 
BES N,N -Bis-(2 hydroxyethyl)2-aminoethanesulfonic acid 7.09 -0.016 
MOPS 3-(N-Morpholino)propanesulfonic acid 7.20 4015 
Phosphate (pK;) 7.20 - 0.0028 
EMTA 3,6-Endomethylene- 1,2,3,6- tetrahydrophthalic acid 7.2 - 
TES 2-[Tristhydrox ymethylmethylamino]ethanesuifonic acid 7.40 -0.020 
HEPES N-2-Hydroxvethylpiperazine- N -2-cthanesulfonic acid 7.48 hold 
DIPSO 3-[ N-Bis( hydroxyethyl amino |-2-hydroxypropanesulfonic acid 7.60 0.015 
TEA Triethanolamine 7.76 0.020 
POPSO Piperazine-N,N "-bis(2-hydroxypropanesulfonic acid) 7.85 -0.013 
EPPS, HEPPS N-2-Hydroxyethylpiperazine-N -3-propanesulfonic acid 8.00 - 
Tris Tris(hydroxymethy! }aminomethane 8.06 -0.028 
Tricine N-[Tris(hydroxymethylimethyllglycine 8.05 -0.021 
Glycinamide - 8.06 0.029 
PIPPS 1,4-Bis(3-sulfopropy! piperazine 8.10 - 
Glycylglycine - 8.25 -0.025 
Bicine N,N-Bis(2-hydroxyethyl glycine 8.26 0.018 
TAPS 3-4 [ Tris(hydroxymethy] mcthyl]amino} propanesulfonic acid 8.40 0.018 
Morpholine - 8.48 - 
PIPBS | ,4-Bus(4-sultoburyl)piperazine 4.60) - 
AES 2-Aminoethylsulfonic acid, taurine 9.06 -0.022 
Borate - 9.23 -0.008 
Ammonia 4.25 - 0.031 
Ethanolanune - 9,50 -0.029 
CHES Cyclohexylaminoethanesulfonic acid 9.55 -0.029 
Glycine (pK) - 9.78 -0.025 
EDPS N,N’-Bis(3-sulfopropyljethylenediamine 9.80 
APS 3-Aminopropanesulfonic acid 4.89 -- 
Carbonate (pK;) - 10.33 -0.009 
CAPS 3-(Cyclohexylaminu)propanesulfonic acid 10.40 0.032 
Piperidine - 11.12 - 

12.33 -0.026 


Phosphate (pK,) 


Reprinted, with permission, from Blanchard (1984). 


‘The change in pK per degree centigrade, These values can be used 


to correct the pK values in the table to higher or lower temperatures. 
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TABLE A1.S. Approximate molarities and specific gravities of concentrated acids and ammonia 


Percentage Approximate Milliliter/iter 
by weight molarity (M) to prepare M-solution Specific pravity 
Acetic acid 99.6 17.4 57.5 145 
Ammonia 25 13.3 75. 0.91 
35 18.1 55.2 0.88 
Formic acid 90 23.6 424 1.205 
98 25.9 38.5 1.22 
Hydrochloric acid 36 11.6 85.9 1.18 
Nitric acid 70 15.7 63.7 1.42 
Perchloric acid 60 9.2 108.8 1.54 
72 12.2 82.1 1.70 
Phosphoric acid 90 16.0 62,4 1.75 
iN = 48.1) IN = 20.8) 
sulfuric acid 98 18.3 54.5 1.835 
(N = 36.7) UN 7 27.3} 


Adapted, with permission, from Dawson et al. (1969). 


TABLE A1.6. Mass changes due to some posttranslational modifications of peptides and proteins 


Monoisotopic Average 
Modification mass change mass change 
Homoserine formed from Met by CNBr treatment —29.9928 -30.0935 
Pyroglutamic acid formed from Gin -17.0265 -17.0306 
Disulfide bond formation -2.0157 -2.0159 
Carboxy-terminal amide formed from Gly 0.9840 0.9847 
Deamidation of Asn and Gin -0.9840 0.9847 
Methylation 14.0157 14.0269 
Hydroxylation 15.9949 15.9994 
Oxidation of Met 15.9949 15.9994 
Proteolysis of a single peptide bond 18.0106 18.0153 
Formylation 27,9949 28.0104 
Acetylation 42.0106 42.0373 
Carboxylation of Asp and Glu 43.9898 44.0098 
Phosphorylation 79.9663 79.9799 
Sulfation 79.9568 80.0642 
Cysteinylation 119.0041 119.1442 
Pentoses ‘Ara, Rib, Xyl) 132.0423 132.1161 
Deoxyhexoses (Fuc, Rha) 146.0579 146.1430 
Hexosamines (CialN, GIcN) 161.0688 161.1577 
Hexoses (Fru, Gal, Glc, Man) 162.0528 162.1424 
Lipoic acid (amide bond to lysine} 188.0330 188.3147 
N-acetylhexosamines (GalNAc, GIcNAc} 203.0794 203.1950 
Farnesylation 204.1878 204.3556 
Myristoylation 210.1984 210.3598 
Biotinylation (amide bond to lysine) 226.0776 226.2994 
Pyridoxal phosphate (Schiff Base formed to lysine) 231.0297 231.1449 
Palmitoylation 238.2297 238.4136 
Stearoylation 266.2610 266.4674 
Geranylgeranylation 272.2504 272.4741 
N-acetylneuraminic acid (sialic acid, NeuAc, NANA, SA) 291.0954 291,2579 
CGlutathionylation 305.0682 305.3117 
N-glycolyineuraminic acid (NeuGe) 307.0903 307.2573 
5'-Adenosylation 329.0525 329.2091 
4'- Phosphopantetheine 339.0780 339.3294 
ADP-ribosylation (from NAD) 541.0611 541.3052 
Adventitious modifications 
acrylamide 71.0371 71.0788 
glutathione 304.0712 304.3038 
75,9983 76.1192 


B-mercaptoethanol 


Adapted, with permussion, from Burlingame and Carr (1996). 
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TABLE À1.7. Common amino acids not encoded in the genetic code 


Monoisotopic Average 
Symbol Name and composition mass mass 
Abu 2-aminobutyric acid 85.05276 85.1057 
C,H;NO 
AECYs aminoethylcysteine 146.05138 146.2133 
CHNOS 
Aib 2-aminoisobutyric acid 85,05276 85.1037 
C,H,NO 
Cme carboxy methylcysteine 161.01466 161.1815 
CHNOS 
Cys(O,H} cysteic acid 150,99393 151.1430 
C,H;NO,S 
Dha dehydroalanine 69.02146 69.0630 
C,H,NO 
Dhb dehydroamino-2-butyric acid 83.03711 83.0898 
Gla 4-carboxyglutamic acid 173.03242 173.1253 
C,H,NO 
Hse homoserine 101.04768 101.1051 
C,H;NO, 
Hyl hydroxylysine 144.08988 144.1736 
CHN;O, 
Hyp hydroxyproline 113.04768 113.116l 
C,H;NO, 
Iva isovaline 99.06841 99.1326 
C,H, NO 
Nic norleucine 113.08406 113.1595 
CHa NO 
Orn ornithine 114.0793] 114.1473 
CHa N,Q 
Pip 2-piperidinecarboxylic acid 11 1.06841 111.1436 
C,H,NO 
Pyr pyroglutamic acid 111.03203 111.1002 
C,H;NO, 
Sar sarcosine 7103711 71.0788 
C,H,NO 


Adapted, with permission, from Burlingame and Carr (1996), 
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TABLE A1.8. Structures of the 20 common amino acid side chains found in proteins 


Alanine on, S Leucine ul 
CH ww 
Ala NL 5A Leu HG” en, UV 
A » L NES A n 
CoN 
H.N., NH 
Arginine "os rw - i Be 
HT y^ Lysine ucc i. 
Arg ag SAN Lys tH ' 
R y oH r ne” ' ^x 
HN OA K | t p 
z Y Y PEt ad od. ^ 
4 HON C " 
L| 
B 
WH, at 
Asparagine p bi g^" s 
He” o M08 "a Du 
Asn by on f Methionine ao ~ 
N HIN” Ü Para Met dn p" P Mo 
b M nn G n 
T c 
Aspartic Acid 7^ P ub: Phenylalanine Jo ° V ` 
Asp HAN > Phe He à 
M9 l w Pan 
D Hf a b F H wr a 4 
DH 
9H M 
. H, - 5 
cyseene u pt OH - Proline ( 7 - on ^, 
H à 2 X - Pro Y SU ' n 
P o 
HALO . . aA 
Glutamine "q A Serine "Oen, Ju 
CH, ! S | E d 
Gin pe one er uc PH 2H b 
Q UY 4^ ^ S 1 
k 
Threonine ^L. 
o ü Thr oS gU 
d ^ Ane, ^ 
Glutamic Acid „i. i T em a“ 
Glu i T CUM à 
E HN E i ot 
$ i a4 
Tryptophan ya ‘ 1 
Glycine i - ^ ” " NH JL 
ly QAUH, OH x. d ~~ A 
G H.N T ni i ne y n 
r 
t 
Histidine Pw ub C “ae 
His an N Se Tu Tyresine 4 UNT 
H HN” 7 y MN Tyr p" o (UU 
s AN eA MR 
° s * Y Ho b 
; H, ~ 
Isoleucine wd " F valine HE e 4S 
lle OH CUN Val i OH en 
| QUH 0H Pa A a adis « m 
a 


The full name, three-letter cade, and one-letter code are given for each amino acid (e.g. alanine, Ala, A 1, as well as the chem- 
wal formula. Color cade: (red) oxygen atoms; (dark blue) nitrogen atoms; (light blue) hydrogen atoms; (gray) carbon alums; (rel 
low) sulfur atoms. For additional amino acid information, see http;/prowl.rockefeller.edu/aainfo/contents/htm 
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TECHNIQUE l 


Quantifying Protein by Bicinchoninic Acid 


Tu: METHOD IS A VARIATION OF THE Lowry ASSAY developed by Smith et al. (1985), which uses 
bicinchoninic acid (BCA) to enhance the detection of Cu* generated under alkaline condi- 
tions at sites of complexion between Cu^ and protein. The resulting tetradentate 
BCA-cuprous ion chromophore absorbs at 562 nm. The speed with which color develops in 
the BCA assay is influenced by the temperature of the reaction, At 37°C, the Cu*-protein 
complexes develop chiefly by oxidation of cysteine, tyrosine, and tryptophan residues. At 
60°C, the temperature used in this protocol, additional Cu*-protein complexes arise from 
oxidation of peptide bonds (Smith et al. 1985; Wiechelman et al. 1988). The concentration 
limits of common reagents on the BCA assay are listed in Table A2.1. 

This technique is divided into four parts: Standard Procedure, Construction of a 
Calibration Curve for the BCA Protein Assay, Microprocedure, and 96- Well Microtiter Plate 
Procedurc. 


TABLE A2.1. Concentration limits of common reagents on the BCA and Bradford assays 


Compound Bradford BCA 

Buffers 
acetate 0.6 Me 0.2 M, (.25 M [pH 5.58) 
cacodylate-Tris Q.1 wb - 
glycine 0.1 M IM(pH 11} 
HEPES H)Ó mM 100 mM 
MES 700 mM 50 mat 
MOPS 200 mM 50 mmi 
Nà'-citrate 50 mM «I mw 
PIPES 500 mm 50 mM” 
potassium phosphate IM 
sodium phosphate im DL Mt 
sodium acetate 0.6 Mt 0.2 M (pH 5.5) 
TES - 50 m? 
Tris 2M 0.1 M, 0.25 m (pH 11.25) 
Tricine OK - 
BES 2. 5M - 

Salts 
MgCl, lm 207 
ammonium sulfate IM interferes 
NaCl 5M LI 
urea 6 M AM 
KCl IM 
guanidine HCI OK am 

Detergents and Denaturants 
Lubrol - 1%" 
Brij 35 interferes: poe 
CHAPS 196 1% 
Na-deoxychalate 0.25% - 
Nonidet P-40 interferes‘ 1% 

206 1% 


g 


actyl glucoside 
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TABLE A2.1. (Continued) 


Compound Bradford BCA 
SD5 0.196 195: 
sodium cholate interferes" - 
Triton X- 100 0. 1f 1% 
‘Tween-20 interferes 
glucopyranoside OKE - 
Media 
Eagle's MEM OK - 
Hank’s salt solution OK - 
Earle's salt solution OK 
Sugars 
glucose OK 10 mw 
sucrose Tw 1M 
Chelators 
EDTA 160 ms 10 mM 
EGTA 0.05 v - 
Reducing Agents 
2-mercaptoethanol iM 50 um” 
dithiothreital lm | mM 
Alcohols, Polar Compounds 
acetone OK - 
DMSO - 58065 
ethanol OK - 
glycerol 99% 10% 
methanol OK - 
Miscellaneous 
sodium azide - 0.2%" 
DNA ! mg/ml - 
lipids - interferes” 
MgCl, LM - 
RNA 0.3 mg/ml - 
ATP 1 mM - 
ampholytes 0,596, 196! interferes 
HCI 0.1M? 0.18 
NaOH 0.1 Me? GLK 
NAD 1 mM - 
phenol 5% = 
amino acids OK - 
polypeptides («3 kD) OK = 
3-ethanolamine interferes" interferes 


Adapted, with permission, from Bollag et al. (1996 [&Wiley]). 

A dash (—) indicates that this compound has not been tested, “OK” indicates that this compound was used successfully, but no 
concentration was indicated. Note that there may be some variation in permissible amounts due ta differences in the assay protocol. 

“Stascheck (19901. 

Kessler and Fanestil (1986). 

*Smith et al. (1985). 

dKaushal and Barnes (1986). 

"Fanger (1987). 

fLow concentration (0.00896! of Triton X- 100 results in both improved sensitivity and protein-ta-protein variability for the 
estimation of kew-molecular- weight proteins (Friedenauer and Berlet 1989). 

&Unlike other detergents, the glucopyrancside detergents, when used to solubilize membrane bound proteins, do not inter- 
fere with the Bradford assay (Fanger 1987). Certain ampholytes (¢-g-, 2D Pharmalyte [pH 3-10], Polybutfer [pHi 6-4] from 
Sigma; and Servaly+ [pH 7-9] from Serva), ammonium sulfate, and reducing agents (glucose, 2-mercaptoethanol, ETT), inter- 
fere with the BCA protein assay (Brown et al. 1989). These substances can be removed prior to performing the BCA assay by 
DOC/TCA prec:pitation (Brown et al. 1989). 

&5Hill and Straka (1988). 

‘Spector (19781. 

‘Read and Northcote (1981); Peterson (1983). 

*Concentrations of 150 mm 3-ethanolamine interfere with both the Bradford and BCA protein assays- 
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TECHNIQUE 1A: Standard Procedure 


MATERIALS 


Tu STANDARD BCA ASSAY I$ CARRIED OUT in a volume of ~1 mi, using 50 pl of a series of dilu- 
tions of the test sample. The concentration of protein in the sample is calculated by interpo- 
lation into a plot of concentration versus absorbance obtained using known amounts of a 
reference protein (in this example, bovine serum albumin; please see Technique 1B}. 

The range of sensitivity for this protocol is 0.02—2.0 mg of protein/ml. 


p Reagents 


» Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Bovine serum albumin (BSA) stock solution 
Dissolve 100 mg of BSA in 10 ml of H,O. 


Sample buffer 
Standard reagent A 
1.0 g of sodium BCA (4,4-dicarboxy-2, 2'-biquinoline, disodium salt) (26 mM final} 
2.0 g of NaCO, ‘H,O (0.16 m final) 
0,16 g of sodium tartrate-2H,O (7 mM final) 
0.4 g of NaOH (0.1 M final) <!> 
0.95 g of NaHCO, (0.11 M final) 


Adjust the volume to 100 ml with H,O. Stir the solution and, if necessary, adjust the pH to L1.5 with 
10 M NaOH. 

Standard reagent B 
Dissolve 0.4 g of CuSO,:5H;O in 10 ml of H,O. 
Standard reagents A and b are stable for 1—3 weeks when stored in plastic containers at room tem- 
perature, and indefinitely at 4°C. Note that although the disodium salt of BCA is soluble at neutral 
pH, the free acid is not readily soluble even at basic pH. 
Although Standard reagents A and B are simple to prepare, more reproducible results are obtained 
using commercial reagents, which are available in a variety of kits from Pierce Chemical Company 
(Rockford, Illinois). Commercial reagents are compatible with commonly used biological buffers, 
have a higher sensitivity, and display an expanded linear range. 

Standard working reagent 
Mix 100 volumes of Standard reagent A with 2 volumes of Standard reagent B. This solution is sta- 
ble for 2-3 days at room temperature and should be green in color. 


Cuvettes (glass or polystyrene) 
Spectrophotometer 

Test tubes (small disposable) or microfuge tubes 
Water bath or heating block preset to 60°C 


» Biological Sample 


Test sample(s) 
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METHOD 


When using the BCA method to measure the concentration of protein in solution, it is essential to 
assay in parailel à set of samples containing known amounts of a reference protein, such as BSA, 
The results obtained from the reference samples are used to construct a calibration curve (see 
Technique 18). 


è Assay dilutions of test and reference samples in duplicate (preferably, in triplicate). 
* If the same cuvette is to be used throughout the assay, read the samples with the lower protein 


content first in order to minimize errors arising from sample carryover due to incomplete rins- 


“ig. ` 


After the cuvette has been rinsed with H,O between samples, tap the inverted cuvette (gently) 
on Kimwipes to remove excess water droplets. 


« For optimal results, add the BCA working buffer to the protein standards and “unknowns” sequen- 


tially and, following color reaction development, read the absorbance values of the samples in the 
same order. 


Each assay is only as good as the calibration curve. Do not take short cuts: Assay of unknown 
protein solutions must be accompanied by the generation of a new calibration curve. 


. Dilute aliquots of the test sample(s) with sample buffer to a concentration that will fall 


within the range of the calibration curve (0.25—2.0 mg/ml). To do this, estimate the pro- 
tein concentration by measuring the absorbance of the test sample at 280 nm. 


The BCA method can be used with a wide variety of buffers (see Table A2.1), but it is sound 
practice to include a "buffer blank” in the assay to validate this assumption. 


_ Use small disposable test tubes ar microfuge tubes to prepare dilutions of the reference 


protein (BSA) ranging from 0.2 to 2.0 mg/ml. 


. Aliquot 50 ul of each dilution of test sample and reference protein into microfuge tubes. 


The amount of reference protein in the tubes ranges from 10 to 100 g. Add | ml of stan- 
dard working reagent to each tube and mix the contents well by vortexing. 


Incubate the tubes for 30 minutes at 60°C (or for 2 hours at ambient temperature). 


[ncubating the samples for longer periods of time can increase the sensitivity of the assay. 
Conversely, if the calor is becoming too intense, terminate the incubation earlier. Take care 
to treat samples and standards identically. Although the assay may also be performed at 
room temperature, incubation at temperatures <60°C results in diminished sensitivity (due 
to the inability of peptide bonds to react with the cupric ions at lower temperatures) and 
greater sample-to-sample variability. A procedure for developing the color in seconds, using 
a microwave oven, has been described by Akins and ‘Tuan (1992). 


. Cool the reactions to room temperature and then measure the absorbance at 562 nm, 


using glass or polystyrene cuvettes. The color is stable for at least 1 hour. 


. Plot A4; versus protein concentration to generate the calibration curve, and determine 


the exact concentration of protein in the test samples by interpolation as shown in 
Technique 1B. 
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TECHNIQUE 1B: Construction of a Calibration Curve for the BCA Protein Assay 


Duvnons OF REFERENCE SAMPLES {PROTEIN STANDARDS) should be assayed in duplicate 
(preferably, in triplicate}. Each assay is only as good as the calibration curve. Do not take 
short cuts: Assay of unknown protein solutions must be accompanied by the generation of a 
new calibration curve. It is essential that the calibration curve remains linear (i.e. adheres to 
Beer’s law) over the range of protein concentrations tested; otherwise, a significant margin of 
error will be associated with the protein estimates of the unknown samples. Table A2.2 illus- 
trates how to arrange absorbance data in a format that facilitates construction of the calibra- 
tion curve (Figure A2.1). 

Assuming that the test samples were assayed in parallel with the protein standards used 
to generate a calibration curve (see, e.g., Figure A2.1 and Tables A2.2 and A2.3), the protein 
concentration of the unknown samples can be calculated as follows: 


|. For each dilution set of the test protein, calculate the average absorbance at 562 nm for a 
given volume of test sample. 


2. Convert the average absorbances calculated in Step | to mass of protein by interpolation 
of the calibration curve. 


3. Calculate the protein concentration in each dilution set by dividing the mass of protein 
by the sample volume (i.e., the volume of test protein plus the volume of buffer). 


4. Disregard average absorbance values that are outside the linear portion of the calibration 
curve. 


TABLE A2.2. Sample data for a calibration curve for the BCA assay 


Standard BCA 
Sample Protein solution Experiment reagent Ave 
number (ug) 2 mg/ml (ul) buffer (Jl) (ml) Ase: Åsa 
l 0 o - 50 Ü 0 
2 i 6.25 43.75 1 0.21 
3 12.5 6.25 43,75 l 0.21 0.20 
4 6.25 43.75 aoo o at 
5 12.50 37.5 i 0.53 
6 25.0 12.50 37.5 0.45 0.49 
7 12,50 37.5 — 100 050 uu 
8 18.75 31.25 0.80 
8 37.5 18.75 31.25 i 0.72 0.76 
10 18.75 31.25 Lo o 07 ] 
11 25.00 25.00 1 1.06 
12 50.0 25.00 25.00 I 1.02 1.05 
13 25.00 25.00 bo — 107 EE 
l4 50,00 - 1 1.95 
15 100.0 50.00 - l 2.10 31.02 
16 50.00 - l 2.01 


Adjust the concentration of the protein standard to 20 ugiml by measuring the Asus OF the Aggie! Asso 10 determine the 
protein content, and dilute with sample buffer as necessary. 
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FIGURE A2.1. Calibration curve for the BCA protein assay. 


5. Calculate the average of the protein concentrations of appropriate dilution sets (Step 3) 
to obtain the final estimate of protein concentration of the test sample. For exarnple, 
using the calibration curve in Figure A2.! and the data in Table A2.3: 


a. 


b. 


The average absorbance for samples 7, 8, and 9 is (0.88 + 0.87 + 0.88)/3 — 0.88. 


Using the calibration curve (Figure A2.1), a horizontal line at A.,, = 0.88 intersects the 
calibration curve at a point that corresponds to 43 ug of protein, 


. The protein concentration is 43 ug/50 pl = 8.75 ug/pl = 8.75 mg/ml. 


. For the data set shown in Table A2.3, ignore the lowest and highest average readings 


since they are unreliable (either off scale [i.e., >2] or too low [ie., «Q.11). 


. Averaging the remaining two protein concentration values (i.e., 8.3 and 8.75) gives 


the final estimate of protein concentration of the unknown sample ([8.5 + 8.7] /2 = 
8.5 mg/ml). 


TABLE A2.3. Sample calculations for determining protein concentration using the BCA assay 


Sample Sample Experiment BCA reagent Ave Protein 
number (ph) buffer (ul) (ml) Ase: Ase: Hg mg/ml 
1 50 - l >2 
2 50 - l 2 unreliable 
3 50 - ] 22 
4 19 10 1 1.66 
5 10 40 i 1.70 L.67 83.0 8.3 
6 10 40 l 1.65 
7 5 45 l O88 
8 5 45 1 0.87 0.88 43.0 8.7 
9 5 45 1 0.87 
10 2.5 47.5 1 0.09 
il 2.5 47.5 l 0.07 0.08 unreliable 
}2 2.5 47.5 l 0.09 
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Technique 1C: Microprocedure 


MATERIALS 


Ta STANDARD BCA ASSAY (TECHNIQUE 1A) USES LARGF VOLUMES of both reagents and samples 

and cannot easily be automated. The following method is more parsimonious and can be 

adapted for use with a microplate reader (see Technique 11) and Hinson and Webber 1988). 
The range of sensitivity for this protocol is 0.5-20 ug protein/ml. 


b Reagents 


b Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with < E», 


Bovine serum albumin (BSA) stock solution 
Dissolve | mg of BSA in 10 ml of H,O. 


Microreagent A 

0.8 g of Na,CO,-H,O 

1.6 g of NaOH <!> 

1.6 g of sodium tartrate -2H,O 

Adjust the volume to 100 ml with H;O. Stir the solution and adjust the pH to 11.5 with 10 M NaOH. 
Microreagent B 

Dissolve 4.0 g of sodium BCA in 100 ml of H;O. 

Microreagents A and B are stable for 1-3 weeks when stored in plastic containers at room temper. 


ature, and indefinitely at 4C. Note that although the disodium salt of BCA is soluble at neutral pH, 
the free acid is not readily soluble even at basic pH. 


Although Microreagents A and B are simple to prepare, more reproducible results are obtained 
using commercial reagents, which are available in a variety of kits from Pierce Chemical Company 
(Rockford, Illiois). Commercial reagents are compatible with commonly used biological buffers, 
have a higher sensitivity, and display an expanded linear range. 


Microreagent C 
Dissolve 0.4 g of CuSO,:5H,O in 10 ml of H,O. 


Microworking reagent 
Mix 1 volume of Microreagent C with 25 volumes of Microreagent B, and then add 26 volumes of 
Micrereagent A. The microworking reagent is stable for 24 hours at room temperature. 


Sample buffer 


Cuvettes (glass or polystyrene) 
Spectrophotometer 

Test tubes (small disposable) or microfuge tubes 
Water bath or heating block preset to 60°C 


P Biological Sample 


Test samplets) 


METHOD 
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. Dilute aliquots of the test sample(s) with sample buffer to a concentration that will fall 


within the range of the calibration curve (0.2-30 ug/ml). To do this, estimate the protein 
concentration by measuring the absorbance of the test sample at 280 nm. 


The BCA method can be used with a wide variety of buffers (see Table A2.1), but it is sound 
practice to include a “buffer blank" in the assay to validate this assumption. 


. Use small disposable test tubes or microfuge tubes to prepare dilations of a stock solution 


of BSA, ranging from 0.4 to 2.0 ug/ml of protein, to use as reference standards. 


. Transfer 500 ul of each test sample and each reference standard (the final range of refer- 


ence standards is 0.2—1.0 wg of protein) into microfuge tubes. Add 500 ul of microwork- 
ing reagent to each tube and mix well. 


. Incubate the tubes for 1 hour at 60°C. 


. Cool the reactions to room temperature and then measure the absorbance at 562 nm, 


using glass or polystyrene cuvettes. The color developed is stable for at least 1 hour. 


. Plot Ag versus protein concentration to generate the calibration curve, and determine 


the exact concentration of protein in the test samples by interpolation as shown in 
Technique 1B. 
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TECHNIQUE 1D: 96-Well Microtiter Plate Procedure 


MATERIALS 


T« BCA “MICROPROCEDURE” FOR QUANTIFYING PROTEIN (Technique 1C) can be carried out in 
microtiter plates, and the results read in microplate readers. This modification permits the 
protein concentration of very targe numbers of samples to be determined with ease. If the 
microplate reader is interfaced with a computer, more than 1000 samples can be read per 
hour. This protocol has been adapted from Hinson and Webber (1988). 

The range of sensitivity for this protocol is 0.5-20 ug protein/ml. 


b Reagents 


» Equipment 


CAUTION: Sec Appendix 3 for appropriate hand'ing of material marked with «17. 


Bovine serum albumin (BSA) stock solution (500 ug/ml) 
Dilution buffer 
Generally, the dilution buffer is phosphate-buffered saline (PBS). Some detergents can be used if 
solubility is a problem, For reagents that interfere with the BCA assay, see Table A2.1. The dilution 
buffer used for preparing samples should also be used for preparing the standard for consistency. 
Standard reagent A 
5 g of sodium BCA (4,4-dicarboxy-2, 2’-biquinoline, disodium salt) 
10 g of NaCO, HLO 
0.8 g of sodium tartrate 
2 g of NaOH <!> 
4.75 g of NaHCO, 
Adjust the volume to 500 ml with H,O. Stir the solution and, if necessary, adjust the pli to 11.25 
using 10 M NaOH. 


Standard reagent B 
Dissolve 2 g of CuSO,5H,O in 100 ml of H,O. 
Standard reagents A and B are stable for 1-3 weeks when stored in plastic containers at room tem- 
perature, and indefinitely at 4°C. Note that although the disodium salt of BCA is solubie at neutral 
pH, the free acid is not readily soluble even at basic pH. These reagents may be purchased ready pre- 
pared as a kit from Pierce Chemical Company (Rockford, lilinois). 


Standard working reagent 
Mix 50 volumes of Standard reagent A with 1 volume of Standard reagent B. This solution is stable 


for 2-3 days at room temperature and should be green in color. 


ELISA plate reader {e.g Multiscan MC.C/340, Pathtech Diagnostics Pty. Ltd.) 
Microtiter plates (96-well disposable) 

Oven preset to 609C 

Pipette (multichannel 20—200 yl) 


» Biological Sample 


Test sample(s) 
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METHOD 


1. Use the multichannel pipette to add 30 pl of the dilution buffer to each well in the 
microtiter plate. 


It may not be necessary dilute the sample, but it is useful to prepare a range of concentra- 
tions to ensure that at least one will fit on the standard curve to establish protein concen- 
tration. 


EXPERIMENTAL TIP: Draw a schematic of the microtiter plate that illustrates the distrib- 
ution and concentration of the protein samples (see below). 


30 pl of 
500 g/ml 
BSA 


30 ul of 
500 ng/ml 


Sample 
#3 


sample | Sample 
#4 a4 


Serial dilution 


2. Add 30 ul of 500 ug/ml BSA solution to the top wells in the first three lanes (ie. positions 
Al, A2, and A3). Mix thoroughly by pipetting the solution up and down several times. 
The standard is performed in triplicate to verify reproducibility. 


As there was 30 Jil of dilution buffer already present in the well, a 1:1 dilution has heen per- 
formed; therefore, the starting concentration for the BSA standard is 250 g/ml. 


3. Add 30 pl of test sample (in duplicate) to the top wells in the remaining lanes, i.e., posi- 
tions A4 to A12. Mix thoroughly by pipetting the solution up and down several times. 


A. Use the multichannel pipette to carry out a serial dilution of each test sample and the BSA 
standard. Transfer 30 pl of sample from a well in row A (which has a total volume of 60 
ul per well) to the corresponding well in row B. Pipette the solution up and down in the 
well to mix thoroughly. Without changing pipette tips, transfer 30 pi! of sample from row 
B to the corresponding well in row C. Continue the serial dilution until ail wells in the 
microtiter dish contain the appropriate sample. 


5. Use the multichannel pipette to transfer 20 ul of each sample from the original plate to a 
fresh microtiter plate. 


To minimize cross-contamination and avoid changing tips between rows, begin the Irans- 
fers from the bottom row, where the samples are most dilute, and work up the plate row-by- 
row, to row A. 


6. Pour the working reagent into a reservoir that will accommodate the multichannel 
pipette. Add 250 pl of the working reagent to each well containing 20 ul of sample. Mix 
thoroughly by pipetting the solutions up and down. 


7. incubate the plate for 30 minutes at 60°C in an oven. 
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BSA Standard Curve 
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FIGURE A2.2. Calibration curve for the 96-well microtiter plate BCA protein assay. The R? value is an indi- 
cation of how well the trend-line fits the plotted values, and it can be interpreted as the proportion of the 
variance in y attributable to the variance in x. The closer the R? value is to 1.0, the better the fit. 


8. Remove the plate from the oven, and place it on the bench top. Allow the microtiter plate 
to cool to room temperature (~10 minutes). 


9. Ensure that the wells are free of air bubbles and that the bottom of the plate is clean. Read 
the absorbance at 562 nm (or close to) using an ELISA plate reader. 


10. Plot the average of the three absorbance readings for the BSA standard to generate a stan- 
dard curve. Plot the absorbance (562 nm) as a function of BSA concentration (pg/ml). A 
straight line should result. For a sample standard curve, see Figure A2.2. 


11. Calculate the sample concentrations using the straight-line formula. Remember to take 
into account the dilution factors. 


TECHNIQUE 2 


Coomassie Blue Dye-binding Assay (Bradford Assay) 
for the Determination of Protein Concentration 


Ta BASE, SPEED, AND SENSITIVITY OF THE BRADEORD ASSAY (Bradford 1976) no doubt accounts 
for its continuing popularity. Binding of the Coomassie Blue dye is generally carried out in a 
reaction mixture that contains sufficient phosphoric acid to ensure that most of the dye mol- 
ecules become protonated and absorb light at 465 nm. However, the pH of the assay must be 
high enough to allow a small amount of dye to survive in the nonprotonated blue form, 
which absorbs at 595 nm in solution and is available to bind to protein. Equilibrium shifts 
between free forms of the dye prevent depletion of the unprotonated blue form as a conse- 
quence of binding to protein. 

The dye-ptotein complexes are traditionally detected by an increase in absorption at 595 
nm, which results from conversion of protonated dye to unprotonated blue molecules bound 
to protein. However, a small metachromatic shift occurs on binding of Coomassie to protein, 
and the À,,, of the dye-protein complex is 620 nm, where the sensitivity of the method is 
maximal. 

The chief disadvantage of the method is the large variability of color development among 
different proteins. The change in absorbance per unit mass of protein varies over a three- to 
fourfold range depending on the nature of the protein assayed (Pierce and Suelter 1977; Van 
Kley and Hale 1977). Ideally, the reference protein used to construct the standard curve 
should be the same as the test protein. However, in most cases, this is impractical and the dye 
response of a sample is generally quantified relative to that of a generic protein (e.g., BSA), 
which permits comparisons with previous studies using the same standard. Note, however, 
that BSA is atypical because it exhibits an unusually large response to the Coomassie Blue dye 
and hence may result in an underestimate of the concentration of a test protein, Ovalburnin 
and y-globulin may be better choices for protein standards, because their dye-binding capac- 
ity is similar to that of other proteins (Read and Northcote 1981). Because of the uncertain- 
ty in relating the absorbance values obtained with the test protein to those obtained with the 
reference protein, the Bradford assay has been largely displaced by the BCA method as the 
technique of choice to quantify protein in solution. 

The range of sensitivity for this protocol is 10-100 ug/ml for the standard assay and 1—10 
ug/ml for the microassay. 
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MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!> 


P» Reagents 


Bradford stock solution 


Dissolve 350 mg of Coomassie Blue G250 (Serva Blue G) in 100 ml of 95% ethanol ^1». Mix the 
resulting solution with 200 ml of 88% phosphoric acid <!> (stable indefinitely at room tempera- 
ture). Although the amount of soluble dye in various commercial preparations of Coomassie Blue 
(3250 varies markedly (Wilson 1979) (generally, Serva Blue G is regarded as having the greatest dye 
content), the quality of the dye is not critical for routine protein determination. 


Bradford working buffer 


Mix together 425 ml of distilled H,O, 15 ml of 9596 ethanol, 30 m] of 88% phosphoric acid, and 30 
ml of Bradford stock solution. Pass the reagent through Whatman No. | filter paper, and store it in 
an amber bottle at room temperature. The working buffer remains usable for several weeks. 
However, during this time, dye may precipitate from the solution; hence, it is recommended that 
the stored reagent be filtered before use. 


Buffer or solvent used to prepare the sample 
Reference protein stock solution (1 mg/ml) 


p Equipment 


Dissolve 10 mg of BSA in 10 ml of distilled H,O (store the stock solution at ~20°C in small 200-pg 
aliquots). Since the moisture content of lyophilized proteins varies markedly (e.g, 5-15%) depend- 
ing on the nature of their storage, the precise concentration of protein in the pratein standard stock 
solution should be determined by amino acid analysis or from its absorbance at 280 nm, The 
absorbance at 280 nm of a 1 mg/ml solution of BSA, in a |-cm light-path-length cuvette is 0.70. The 
corresponding values for y-globulin and ovalbumin are 1,38 and 0.79, respectively. 


Cuvettes 


Plasticware and glassware should be absolutely clean and detergent-free. Disposable plastic cuvettes 
are recommended because dye accumulates on cuvette walls and is difficult to remove. Traces of dye 
bound to glassware or plasticware can be removed by a thorough rinsing with methanol <!> or 
washing with concentrated glassware detergent, followed by H,O and ethanol. Quartz (silica) 
cuvettes should not be used, since the dye sticks to them tenaciously. 


Microfuge tubes 
Spectrophotometer 


» Biological Sample 
Sample of test protein 


METHOD 


1 


. Prepare a dilution series of the reference protein in duplicate or triplicate, as follows: 


Pipette 2.5, 5, 7.5, 10, 15, and 100 pl of a 1 mg/ml solution of the reference protein into 
microfuge tubes, and adjust each to 100 pi with the buffer used to prepare the test sam- 
ple. Pipette 100 pl of the buffer into a microfuge tube to provide the reagent blank, 


2. Pipette 2.5-20 ug of the test protein solution (maximum volume 100 pl) into microfuge 


tubes. Adjust the volume in each tube to 100 wl with buffer. If even an estimate of the con- 
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centration of protein in the test solution is unknown, assay a range of dilutions (e.g., 1, 
1/10, 1/100, 1/1000). Prepare duplicates (or triplicates) of each sample. 


3. Add 1 ml of Bradford working buffer to each tube of sample and reference protein. Mix 
the reactions well by inversion or gentle vortexing. Avoid foaming as this will lead to poor 
reproducibility. 


4, Incubate the reactions for 5 minutes at 25°C. 


5. Measure the Asx of the samples and standards against the reagent blank between 2 min- 
utes and | hour after mixing. 


Color development is complete after 5 minutes. Prolonged incubation (10—:5 minutes} 
causes microprecipitation of proteiri that increases with time and «an result in visible aggre- 
gates. This phenomenon is especially noticeable at higher protein concentrations and is 
believed to be due to the acidic conditions of the assay. To minimize any error due to color 
loss, read the OD of test samples within 10 minutes of the standards. Error duc to color loss 
should then be «296 (Peterson 1983). 


E 
ue 


6. Prepare a calibration curve and estimate the protein content of the test sampies i 
described in Technique 1B for the BCA assay. 


EXPERIMENTAL TIP: The standard curve for the Bradford assay becomes nonlinear at 
high concentrations of protein due to the depletion of free dve. For this reason, it is essen- 
tial to include a calibration curve with each assay. If necessary, the linearity of the assay can 
be improved by plotting the Agip/Asgs ratios, which corrects tor free-dye depletion (Sedmak 
and Grossberg 1977). 


TECHNIQUE 3 


Purification of Urea 


Tiss TECHNIQUE REMOVES CYANATE IONS THAT ARF PRODUCED IN UREA solutions upon stand- 
ing; the extent of formation varies with the pH of the solution (Hagel et al. 1971). Cyanate 
ion formation in aqueous urea solutions occurs more rapidly at high pH and high tempera- 
tures. Hence, unbuffered urea solutions should be stored under acidic conditions at 4°C. 
Alternatively, after deionization and filtration, the water from the urea filtrate can be evapo- 
rated, and the crystallized urea can be collected and stored in the dry state at 4°C. Under these 
conditions, crystalline urea can be stored for several weeks without significant cyanate ion 
formation. The concentration limits to common reagents on the Bradford assay are listed in 
Table A2.1 in Technique t. 


MATERIALS 
CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 
b Reagents 
Deionizing resin (e.g., AG 501-X8 mixed-bed ion-exchange resin, Bio-Rad) <-> 
AG 501-X8, whick consists of equivalent amounts of AG 1-X8 (a strong anion exchange resin), and 
AG 50W-X8 (a strong cation exchange resin) contains a dye that changes from blue-green to gold 
when its exchange capacity is exhausted. 
Heptane, warmed to 40-50°C <!> 
Heptane is extremely flammable. 
Urea, analar (or technical ) grade <!> 
P Equipment 
Buchner funnel or Scintered glass filtration funnel 
Filter papers (e.g., Whatman No. 2 and Whatman cellulose Grade 1 Chr) or equivalent 
Ice bath 
METHOD 


Removal of Cyanate lons from Urea Solutions 


1. Prepare an aqueous solution of urea to the desired molarity. 
2. Just prior to use, deionize the resin by either 


e passing the solution through a column of mixed-bed deionizing resin 


or 
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* swirling the urea solution with deionizing resin for several hours at room tempera- 
ture. Recover the urea solution by filtering it through cellulose filter paper, Grade 1 
Chr (11 pim) or a sintered glass filtration funnel of high porosity. 


The deionized urea solution should have a conductivity of less than 2 115. Urea solutions 
containing amine buffers such as Tris have an advantage in that they compete with the 
protein for the cyanate. 


Recrystaltization of Urea 


3. Dissolve urea (10 mg/ml) in warm heptane (40-50°C) and filter rapidly. 
4. Cool the mixture to 5°C in an ice bath, which will result in crystal formation. 


5. Use a Pasteur pipette to remove the mother liquor and set it aside at -20°C for further 
recrystallization. 


6. Harvest the crystals by filtration using Whatman No. 2 filter paper. Allow the recovered 
crystals to dry in the dark in a chemical fume hood. 


7. Store the recrystallized urea at —209C in a dry container in the dark. 


TECHNIQUE 4 


separation of Peptides ond Proteins by HPCEC 


Thomas P. Hennessy," Marion I. Huber,’ Klaus K. Unger,’ Reinhard I. Boysen,* and Milton T.W. Hearn’ 


"Centre for Bioprocess Technology, Department of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, 
Australia; Institut für Anorganische und Analytische Chemie, Johannes Gutenberg Universitat, Mainz, FOR 


Aaraoucn CLOSELY RELATED TO HPLC AND HPCE, high-performance capillary electrochro- 
matography (HPCEC) is now claiming its own unique orthogonal place in the separation sci- 
ences, particularly with regard to the nanoscale analysis of peptides and proteins. Moreover, 
the technique capitalizes on existing mobile-phase technology and knowledge to improve 
separations. To extensively exploit the hybrid characteristics of CEC, the peptide or protein 
chemist, as well as the chromatographer, must understand how the background electrolyte 
simultaneously propagates flow and achieves the selectivity needed for high-resolution sepa- 
rations, particularly with biomolecules (Walhagen et al. 2000a; Zhang et al. 2000). 

In packed-capillary CEC, the electro-osmotic flow (EOF) is predominantly caused by the 
charged surface of the particles in the column bed (Ross et al. 1996), An analogous situation 
occurs with chemically modified open tubular CEC, where the EOF is generated from the 
residual silanol groups of the etched and surface- modified capillary (Walhagen et al. 2000b; 
Matyska et al. 2001). The contributions to the EOF of the capillary wall and inner and outer 
surface of the particles, respectively, have been discussed elsewhere (Grimes et al. 2000; Liapis 
and Grimes 2000; Walhagen et al. 2000b). Thus, in CEC, the EOF velocity depends on the 
surface charge of the sorbent, the temperature, viscosity, and composition, such as dielectric 
constant, ionic strength, and pH, of the eluent (Walhagen et al. 2001). Moreover, in the 
implementation of CEC analytical procedures, the unique biophysical properties exhibited 
by peptides and proteins must be taken into consideration. A particular peptide or protein 
will display a defined isoelectric point (pl) value whereby the net charge of the molecule 
reaches zero with a buffer of specific pH value. The nature of the amino acid sequence and 
the extent of structure/folding that has occurred in the presence of a particular background 
electrolyte determine the effective charge and its relationship to the pI value (Walhagen et al. 
2001). Apart from chromatographic interactions, which can be most conveniently adjusted 
via the organic modifier content in the background electrolyte, the electrophoretic migration 
velocity can also be tuned by varying the pH and/or the ionic strength of the buffer. It must 
be emphasized that the separation in CEC is governed by both chromatographic retention 
(thermodynamic process) and electrophoretic migration (kinetic process) ( Walhagen et al. 
2000a, 2001; Zhang et al. 2000). However, this book is not the platform to discuss in theoret- 
ical depth all of the relevant variables and their dependencies; instead, some general consid- 
erations toward the practical understanding of retention/migration processes are detailed. 
The major task throughout this technique will be to provide familiarization with instrumen- 
tal aspects of this separation technique and how it is applied in peptide analysis (for a sketch 
of the instrumental setup, see Figure A2.3). 
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FIGURE A2.3. Schematic showing the instrumental setup for CEC separation of peptides or proteins. 


MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


b Reagents 
H,PO, <!> (or other acids and bases to prepare buffers) 
HPLC-grade acetonitrile <!>, 2-propanol <!>, methanol «1», or other suitable organic sol- 
vent with excellent transparency in the UV range down to 210 nm 
Na, HPO, (or other suitable salts) <!> 
Thiourea <!> or sodium nitrate «1» 


b Equipment 

Analytical C, column 
Other n-alkylsilica sorbents can be used in place of Ca The average particle diameter of the sor- 
bent should be 0.5—5 um, porosity 60-300 A, and a suitable phase ratio should be established. Pack 
the sorbent in analytical capillaries of the appropriate dimensions, e.g. 3 um, 80 A, 340 mm x 100 
pm 1.D. Sorbents of different surface chemistries can also be employed, such as ion exchange or 
mixed mode, to achieve alternative selectivities, provided the surface allows generation of a rela- 
tively high EOF under the experimental conditions. 

CEC apparatus 
The CEC equipment consists of a high-voltage supply, spectrophotometer with interface, analytical 
column, cartridge thermoelement, autosampler, PC, printer and software, CE system with atten- 
dant data management systems, and system automation controllers. The equipment system may be 
constructed in the lab from modular components or a commercially available instrument (c.g., the 
Agilent 3? CE system). 

Nitrogen, pressurized gas <!> 
Nitrogen is used for autosampler operation or pressurization of background electrolyte vials. 


Sonicator, benchtop (e.g., Eltrosonic Type 07, Kühn & Bayer) 


P Biological Sample 
Proteins or peptides of interest 
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PREPARATION OF BACKGROUND ELECTROLYTE 


1. Prepare | liter of each buffer stock solution at 100 mM, For example, weigh 14,196 g of Na HPO, 
and dissolve with 900 ml of H,O. Adjust the pH with concentrated H,PO, and then add H,O to 
the final volume. Store the stock solutions at 4C. 


2. Prepare background electrolyte by diluting an aliquot of the buffer stock solution with the 
appropriate amount of H,O and adding the desired amount of organic modifier, e.g., 100 ml of 
acetonitrile. As an example, 5 ml of stock solution, 45 ml of H,O, and 50 ml of acetonitrile yield 
a 5 mM background electrolyte 

The molarity refers to the overall composition, The volumes of organic solvent, stock solution, and 
H,O must be measured independently and are then combined because of the volume contraction 
that occuts upon mixing, which may be up to 30 ml/liter prepared solvent (depending on the nature 
of the organic solvent). Failure to do so can lead to substantial errors in mobile phase composition. 
Be aware that changes in the organic modifier content will also influence the pH of the eluent. By 
convention, the definition of the pH value used in the procedures described here only apylies for 
H,O-only solutions. For the background electrolytes, the pH detailed is really the pH of the buffer 
used in the background electrolyte. 


3. Mix the solvent either by stirring with a Teflon-coated magnetic flea or by shaking in a stoppered 
cylinder (time depends on volume). 


WARNING: Parafilm should not be used under any circumstances to cover flasks or cylinders con- 
taining the eluents used in CEC, because the organic solvents in combination with buffer compo- 
nents may dissolve components in the Parafilm, yielding extraneous peaks in the chromatogram. 


4, Filter all eluents through a 0.2-:m PTFE filter. 


Filtering of eluents increases the column lifetime and contributes to degassing. 


5, Degas the background electrolyte by sonication. 
IMPORTANT; Degas solvents for use as background electrolyte extraordinarily thoroughly. 


6. Close unused eluent bottles with a stopper to avoid evaporation of the organic solvent. 


METHOD 


Preparation of the Sample 


1. Dissolve the protein sample in H,O (weak mobile phase} to achieve the desired concen- 
tration. lf the sample is insoluble, add smal! amounts (typically <25% of the total vol- 
ume) of organic modifier (strong mobile phase). 


IMPORTANT: Do not dissolve the sample in background electrolyte, because buffers/addi- 
tives may be changed frequently especially during method development. 


2. Inspect the sample for clearness. Filter the sample through a 0.2-um PTFE filter if it is 
insoluble, or opalescence or solid particles are present. Alternatively, centrifuge the sam- 
ple and use the supernatant for injection. 

Samples that are not fully dissolved should not be injected intu the column because they can 


block the column. 

3. Store the sample at 4°C or -20°C (depending on the planned storage time and usage) 
until ready for use. Peptides and protein samples can degrade at room temperature and 
during storage. 

Avoid repeated freezing/thawing of the sample. Instead, prepare small aliquots of the pep- 
tide or protein sample, store them at —209C, and use each aliquot only once. 
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ISOCRATIC SEPARATIONS OF PEPTIDES AND PROTEINS with CEC 


The elution window for peptides and proteins is known to be very narrow in comparison to other 
molecules; thus, it may prove helpful to construct a separation around data acquired in HPLC 
and/or capillary electrophoresis. However, if such data are not available, the investigator must antic- 
ipate the appropriate separation conditions, e.g., the stationary-phase properties, peptide composi- 
tion, and conformational propensities, In the case of peptide samples of unknown composition or 
structure, such as would arise from the enzymatic digestion of a new protein, follow the strategy 
that mimics a course similar to that followed with micro-HPLC with narrow-bore columns. 

‘The ionic strength of the background electrolytes applied most commonly ranges from 5 to 20 
mM, and the volume fraction of organic modifier encompasses the wide range of 20-70%, depend- 
ing on peptide charge and hydrophobicity for isocratic elution. The pH can be adjusted with vari- 
ous buffers from pH 3 to 8. Phosphate, acetate, and Tris buffers are popular for these adjustments, 
with acetate buffers of relatively low ionic strengths particularly suitable for CEC-MS applications 
due to their volatility. 

A good starting point for finding a set of separation conditions is to utilize a 10 mM ammonium 
acetate buffer and to vary the organic modifier (e.g., acetonitrile) content in the background elec- 
trolyte. Once the elution window is determined, optimize the separation by varying other condi- 
tions. If the variation of conditions is performed too drastically in terms of EOF velocity alteration, 
cavity formation is likely to occur and the capillary must be reconditioned externally on an HPLC 
purip. 


Testing the Instrumental CEC 


1. Install the capillary in the instrument as detailed by the manufacturer or as appropriate 
if a lab-made instrument is utilized. 


2. Supply the instrument with background electrolyte and sample(s). 


. Produce a blank run (inject background electrolyte) and run under the same conditions 


as those intended for the separation of the peptide or protein sample. Repeat if "ghost 
peaks occur. 


. Produce a background run (inject the solution in which the sample is dissolved when dif- 


ferent from the background electrolyte) 


Since buffers are frequently changed during the optimization procedure, dissolve samples in 
H,O if possible or H,O and an organic modifier to help solvation; do uot dissolve samples 
in the buffer/background electrolyte. The importance of the background run in establishing 
an acceptable separation with the peptide or protein mixture cannot be averemphasized. 


. Measure the dead time of the column with thiourea or sodium nitrate (or any other non- 


interactive solute). 


The choice of the dead time marker is dictated by the separation conditions. Neutrality must 
be secured for the conditions in question if accurate values of the EOF are to be determined. 


. Test the column performance with an isocratic run and an appropriate test mixture, e.g., 


a suitable HPCEC test mixture included 0.15% iw/v) dimethylphthalate, 0.15% (wiv) 
diethylphthalate, 0.0196 (w/v) diphenyl, 0.0396 (wiv) O-terphenyl, and 0.32% (wiv) 
dioctyiphthalate in methanol (Walhagen et al. 2000a,b, 2001). 

First, this test allows the evaluation of the column bed integrity (low integrity wil} be asso- 


ciated with split, fronting, or tailing peaks) and column performance (in terms of plate 
numbers). Second, this test allows, if repeated at regular intervals, the monitoring of the per- 
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formance during the lifetime of a column and the assessment of batch-to-batch differences 
of column fillings. 


Analytical RPCEC of Peptides and Proteins 


Pressure 
7A 
A 
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FIGURE A2.4. Schematic illustrating the different injection modes based on pressure-driven hydrodynamic ! 


1. Prior to sample analysis, conduct at least one blank run under the same conditions as 
those intended for the peptide sample. Repeat if “ghost” peaks occur. 


2. Separate the sample with an analytical RPCEC procedure in order to assess the sample in 
terms of purity, peak profile, and elution conditions with the experimental conditions as 
outlined below. 


3, Perform peak assignment. 


SELECTION OF THE ELECTROCHROMATOGRAPHIC CONDITIONS FOR THE ANALYTICAL SEPARATION OF PEPTIDE 
AND Protein MIXTURES 


The following electrochromatographic conditions are representative of the type of standard analyt- 

ical protocol that can be employed. Numerous variations in terms of choice of reversed-phase sor- 

bent, applied voltage, run duration, background electrolyte composition (including the choice of 

different organic solvent modifiers, buffer species, or ion-pairing reagents) and pH, temperature, 

and sample size (controlled by injection mode/duration) can be contemplated. The following iso- 

cratic procedure represents a robust protocol around which these variations can be constructed. 

e Capillary column: A Cy Ce or Cig reversed-phase sorbent packed into a suitable column (e.g. 3 
yum, 300 A, 330 mm x 100 jm LD.) 

«Background electrolyte: Acetonitrile/5 mM Na, HPO, buffer (pH 7.5) (50:50 v/v) 

æ Isocratic elution 

e Injection: 5 kV, 3-second sample, 5 kV, 3-second background electrolyte (see also the panel on 
ADDITIONAL REMARKS AND CONCLUSIONS below) 

e Electric: 15 kV 

+ Detection: 214 nm 

e Temperature: Room temperature 


Backgroun 


low as used in p-HPLC 


and electrokinetic flow as used in CEC The direction of liquid flow is highlighted bv arrows. 
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Additional Remarks and Conclusions 


* Injection: There are several choices for CEC injection (see Figure A2.4): hydrodynamic injection 
(by pressure) and electrokinetic injection (applying voltage, current, or power). As opposed to cap- 
illary electrophoresis, injection by voltage is the most frequently used injection technique in CEC, 
despite the obvious problem with differences in injection concentrations for molecules with differ- 
ent mobilities. Reproducible hydrodynamic injection is not applicable due to the nature of the sor- 
bent immobilized in the capillary. High pressures cannot currently be delivered accurately enough 
to the capillary to favor this injection mode, In the various examples found in the literature, the 
injection consists of two individual parts. The first is the injection of the sample itself followed by 
injection of background electrolyte, thus exploiting a focusing effect if the electrophoretic velocity 
of the background electrolyte is faster as compared to the components of the sample. 

* Electrical polarity effects: Most instruments allow for reversal of the electric field polarity. This 
feature can be exploited to reverse the direction of flow. Voltage, current, and power are related 
to one another by the resistance of the capillary/background electrolyte system; thus, mechani- 
cally stable frits may experience disintegration in an electric field applied across the capillary 
inner diameter. In the constant voltage mode, current can be utilized to indicate the stability of 
the systern. Rapid alteration or decrease of current thus displays cavity formation or a clogged or 
broken capillary and can thus be used to identify unstable or inappropriate conditions. 

* Operation mode: Most instruments (this applies for lab-made as well as commercially available 
instruments) require capillaries designed to consist of a packed bed and an open tubular seg- 
ment. In instruments with fixed locations for the detection capillaries, tbese instruments can be 
operated either in the "regular" operation mode or in the mode referred to as "short end" oper- 
ation mode with the injection and flow direction adapted appropriately (see conditions for CASE 
srupy and the legend to Figure A2.5). 


Intensity (mAU) 


Time {min} 


FIGURE A2.5. klectrochromatogram of three synthetic TRAP ithrombin receptor antagonist peptide} 


analogs. Note that peptides assigned to peaks 2 and 3 only differ in order of an alanine residue substitution 


of the corresponding amino acid residue in the sequence, Hydrodynamic injection results in the first peak (as 
). Sequences: 


an eigen peak not assigned) in the electrochromatogram [tor other instrumental details, see text 
Peptide peak 1: H-Ser-Ala-Leu-Leu-Arg-Asn- Pro-OH 
Peptide peak 2 H-Ser-Phe- Ala- Leu-Arg-Asn-Pro OH 
Peptide peak 3: H-Ser Phe- Leu -Ala- Arg-Asn- Pro-OH 
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Case Stupy 


Three synthetic thrombin receptor antagonist peptide analogs (TRAP analogs) were analyzed by 
CEC. The capillary column employed was operated in the short-end operational mode (see also 
ADDITIONAL REMARKS AND CONCLUSIONS above). The electrochromatogram is displayed in Figure 
A2.5. The separation conditions are detailed below: 


* Instrument: Agilent Technologies YCE modified with external pressure supply 

* Stationary phase: Noncommercial reversed-phase (C,,) porous silica (3 pm, 80 A) 
* Column: 330 mm x 100 um [.D.; effective length (packed bed length including frits) 8 cm 
* Background electrolyte: Acetonitrile/5 mM Na, HPO, buffer (pH 7.5) (20:80 v/v) 
Elution mode: Isocratic 

* Operation mode: Short end 

* Injection: 12 bar, 1 second 

* Electric: +10 kV 

* Detection: 200 nm 

* Temperature: 15°C (cartridge), 25°C (sample tray) 

e Pressurization: 9 bar to both vials (inlet and outlet) 
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Automated Two-dimensional LC-MS/MS for Large-scale 
Protein Analysis 
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Å vonc THE PROTEIN SEPARATION AND IDEN TIFICATION METHODS currently in use, two-dimen- 

sional polyacrylamide gel electrophoresis (2D-PAGE) followed by mass spectrometry (MS) is 

the most widely used. 2D-PAGE resolves thousands of proteins in a singie gel. However, the 

obstacles to high-throughput large-scale protein analysis, a key issue in proteomics, by 2D- 

PAGE are manifold. 

© The identification of these proteins requires time-consuming steps to digest the proteins 
within the gel, extract individual protein "spots" from the gel, and prepare the extracted 
samples for MS analysis. 


* Experimental 2D-PAGE has not been fully automated. 


» Some cellular proteins are rarely isolated in 2D- PAGE studies, such as membrane-associat- 
ed proteins and those with extremes in either pf or molecular weight. 


Nearly two decades ago, we presented an automated 2D high-performance liquid chro- 
matography (2D-HPLC) technique for systematic separation of very complex peptide or pro- 
tein mixtures (Takahashi et al. 1985, 1991; Isobe et al. 1991b), and applied the technique to 
the sequencing of very large proteins and protein genetic variants (Takahashi et al. 1984, 
1987a,b), and for the profiling of proteins expressed in developing rat cerebella (Isobe et al. 
199 1a; Taoka et al. 1992, 1994). This technique has now been refined by incorporating new 
MS technology, such as replacing a conventional UV detector by a mass spectrometer with an 
electrospray ionization (FS1) source. This technique describes a new platform for proteomics: 
on-line 2D-LC-MS (MS/MS), which provides a fully integrated analytical system for non-gel- 
based large-scale protein identification. 


THE PRINCIPLE OF 2D-LC IS SIMILAR TO THAT OF 2D-PAGE 


2D-PAGE separates proteins according to their intrinsic charge and molecular weight by the 
combination of isoelectric focusing (IEF) and SDS-PAGE. The two-dimensional chromato- 
graphic technique described here employs two columns having different separation specifici- 
ties: 

e An ion-exchange column that separates proteins and peptides according to charge. 


« A reversed-phase column that separates molecules on the basis of hydrophobicity. 
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Chromatography is performed sequentially by the two columns under time-dependent 
control of the flow system for each column. The resolution power of the chromatography is 
multiplied by the combination of the two columns. For example, if each column has a peak 
capacity of 50, then the theoretical separation capacity of the 2D-LC system is 50 x 50 = 2500. 
Integration of MS adds significant advantages to the 2D-1.C technique. With the ability of MS 
to separate biological samples very precisely according to molecular mass, direct coupling of 
MS with 2D-LC further increases the resolution of the 2D-LC system. When combined with 
automated tandem mass spectrometry (MS/MS), the system also provides structural infor- 
mation on protein samples simultaneously upon detection. Thus, ^ 10,000 MS/MS spectra 
can be obtained in a typical 2D-LC-MS/MS analysis of extremely complex peptide mixtures, 
such as a tryptic digest of crude cellular extracts, from which thousands of peptides can be 
assigned using the available genc and protein databases. 


INSTALLATION OF THE 2D-LC SYSTEM 


Several installations are possible for performing automated 2D-LC-MS (MS/MS). For 
instance, Link et al. (1999) have developed a system coupled with ESI-MS, in which a single 
capillary column is packed with two independent chromatography phases in series (also see 
Chapter 8, Protocols 4 and 5). The system presented here is composed of two independent, 
conventional HPLC assemblies with an ion exchange or a reversed-phase column connected 
in tandem through a six-way column-switching valve, and a mass spectrometer with ES] 
source. In this system, the elution of solutes from the ion-exchange column (1XC) is stopped 
during the chromatography in order to perform the reversed-phase chromatography (RPC), 
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of 2D-LC system. The program synchronizes ion-exchange and reversed-phase 


FIGURE A2.6. Time-dependent clution program ange 
and an electrical six-way column-switching valve (E-vatve). 


chromatography by controlling two LC assemblies (LC-1 and LC-2) 
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MATERIALS 


Data 
collection 


FKSURE A2.7. Schematic diagram of the 2D-LC-MS (MS/MS) system. The assemblies are explained in the 
text. (M) Manifold; (D) drain. 


to which the eluent from the IXC is applied. After the completion of the chromatography of 
the RPC, chromatography of the IXC is started again; the eluent is applied to the RPC, and 
the RPC is repeated as in the first step. These separation cycles are repeated over and over 
agatn by a PC-based control program (see Figure A2.6). Figure A2.7 shows a flow diagram of 
the system, together with the multiple column assembly connected to the column-switching 
valve. This system is equipped with a small reversed-phase "trap" column inserted between 
the two analytical columns that serves to remove salts from peptides and proteins eluted from 
the IXC. This installation is important for reproducible analysis as the salts prevent efficient 
ionization of samples on MS analysis and often cause troubles in the system. A PC program 
controls the two HPLC assemblies and the column-switching valve to synchronize the ion- 
exchange and the reversed-phase chromatography. The flow system is followed hy a series of 
PCs for MS data collection and handling. 


» Reagents 


CAUTION: See Appendix 3 for appropriate handling of materials marked with <!>. 


Ammonia, aqueous <!> 
Ammonium bicarbonate buffer (10 mm, pH 8.0) <!> 
Column buffers and solvents: 
Buffer B1: 0.025 m Tris-Cl buffer (pH 8.0) 
Buffer B2: 0.025 M Tris-Cl buffer (pH 8.0) containing 0.4 M NaCl 
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Solvent B3: 0.2% formic acid <!> in H,O 
Solvent Bá: 0.2% formic acid in acetonitrile <!> 
Solvent B5: 1% formic acid in HO 


Dithiothreitol (DTT) <!> 
HCl (6 M) «1 
lodoacetamide (LAA) <i> 


TPCK-treated trypsin (Worthington Biochemicals) 
Tris-C! (0.5 M, pH 8.5) containing 7 M guanidinium hydrochloride <!- and 10 mw EDTA 


b Equipment 


Columns 


Hirst dimension: Bioassist C) (5 um, 2 mm J.D. x 35 mm) (Tosoh Corp.) 
Second dimension: Mighty-sil C,, (3 urn, 0.32 mm I.D.x 100 mm) (Kanto Chemicals, Tokyo) 
Trap column; Mighty-sil C,, (15 jm, 1 mm LD. x 5 mm) (Kanto Chemicals, Tokyo) 


HPLC assemblies (Shimadzu) with an jon exchange or a reversed-phase column connected 


in tandem through a six-way column-switching valve (VICI) 


Mass spectrometer with ESI source (QTOF 2, Micromass) 
Nitrogen gas <!> 


b Biological Sample 
Protein mixture, lyophilized 


METHOD 


Preparation of the Protein Sample 


1. 


Dissolve 200-500 ug of protein in 200 ul of 0.5 M Tris-Cl (pH 8.5) containing 7 M guani- 
dinium hydrochloride and 10 mm EDTA. Bubble the solution with nitrogen gas for 10 
minutes. 


. Add 100 pg of DYT (~3 mm final concentration) to the protein solution at room tem- 


perature with gentle mixing. Incubate the solution for 2 hours at room temperature. 


. Add 250 [ig of IAA (~7 mM final concentration) and leave the solution in the dark for 1 


hour at room temperature. 


. Dialyze the solution for 8 hours against four changes of 10 mM ammonium bicarbonate 


buffer (pH 8.0). 


If the amount of protein in the solution 1s below 100 ug, the solution can be dialyzed using 
à small dialysis unit (Pierce Chemical Company) to minimize adsorption to the dialysis 
membrane. Because the sample solution is often viscous, neither ultrafiltration nor gel fil- 
tration is recommended for desalting. 


. Add 1-2 pg of TPCK-treated trypsin into the protein solution and digest overnight at 


MC. 
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. Acidify the digest to pH 2 by adding an aliquot of 641 HCE and, where necessary, remove 


any precipitates by centrifugation. 


. Neutralize the supernatant with aqueous ammonia to pli 8, dilute the peptide mixture 


with an equal volume of HỌ, and apply to the 2D-LC-MS/MS system. 


Two-dimensional Liquid Chromatography 


Refer to Figure A2,7 while carrying out Steps 8-18. 


8. 


Activate the computer program that controls the HPLC systems. Steps 9-18 are per- 
formed automatically. The total time required to analyze the protein sample using this 
protocol is 12 hours. 


. Equilibrate columns C1 (ion-exchange; and C2 (reversed-phase) for 20 minutes with 


Buffer BL and Solvent B3, respectively, at flow rates of 100 ul/min (for column C1) and 5 
ulmin (for column C2). 


. Apply the protein sample to column C1 through the sample injector (In), and elute with 


Buffer Bi for 5 minutes at a flow rate of 100 ul/min. 


11. Mix the cluent on-line with Solvent B5, which is continuously supplied by pump 5 (P5. 
The mixture flows into a reversed-phase "trap" column, where salts are separated from 
peptides and proteins by continuous washing with Solvent B5. 

12. LC-1 is automatically stopped, and after 2 minutes, the column-switching valve moves to 
connect the trap column to column C2. 

13. Introduce the salt-free samples adsorbed on the trap column into the reversed-phase 
column ‘column C2) and separate them for 40 minutes with a linear gradient formed 
from Solvents B3 and B4 (5-6096 acetonitrile in 0.2% formic acid) at a flow rate of 5 
l/min. 

14. Load the eluted samples directly into the ion source of the mass spectrometer. MS and 
MS/MS data are collected automatically. 

15. Equilibrate column C2 again with Solvent B3 for 15 minutes after the linear gradient elu- 
tion is finished. The first cycle of the 2D-1.C is completed. 

16. The computer reactivates the LC-1 system, so that proteins are again eluted from the ion- 
exchange column C1 for 5 minutes at a flow rate of 100 pl/min, this time with a mixture 
of Buffers B1 and B2 (95% B1:596 B2). 

17. Repeat Steps 11-15. 

18. Repeal the separation cycles eight more times, each time changing the ratio of Buffers B1 
and B2 used to elute proteins from the ion-exchange column. Below is the percentage of 
Buffer B2 in each of the ten cycles. 

Cycle l 2 l 3 4 | 5 | 6 7 fy l9 T jo 

—4 ——]-- -— ue — 
% B2 0 | 5 |m |5 |% | 25 | 30 | 40 | 50 | 100 
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DATA COLLECHON AND DATABASE SEARCHING 


MS/MS data are generally converted to text files listing mass values and intensities of the fragment 
ions. The files are analyzed using appropriate search engines that scan gene and protein databases and 
report back with peptide assignments and putative identification of the proteins. Figure A2.8 and 
Table A2.4 show the results of an analysis of an insoluble protein fraction from Caenorhabditis ele- 
gans, The collected MS/MS data were converted to text files by MassLynx software (Micromass, 
Manchester, UK) and analyzed by the Mascot Daemon algorithm (Matrix Science, London, UK) to 
assign peptides and identify proteins from the C. elegans database (National Center for Biotechnology 
Information). The database search was performed with the following parameters: 

* Fixed modification: carbamidemethyl (Cys) 


* Variable modification: N-acetyl (Protein), oxidation (Met), phospho (Ser, Thr, Tyr), pyro-Glu 
{amino terminal} 


e Maximum missed cleavages: 3 


Peptide molecular weight tolerance: £500 ppm 
Peptide charge number: +2 and +3 
MS/MS tolerance: 1.0.8 dalton 


TABLE A2.4. Portions of a small sample of proteins identified by 2D- L- MS/MS analvsis of the insoluble 
protein fraction of C. elegans 


Accession Number of 
Protein name number peptides 
(U18546)NaK-ATPase alpha subunit[Caenorhabditis elegans; gilé4^15 15 
(LAE148U53)myotactin formB[Caenorbabditis elegans | gil 5002308 14 
hypothetical protein K11D9.2a-Caenorhabditis elegans gil7505778 13 
hypothetical protein W04D2. La-Caenorhabditis elegans gi|7 508956 5 
hypothetical protein Y57G11C. 1 5- Caenorhabditis elegans gi[7510283 3 
hypothetical protein C47E12.5-Caenorhabditis elegans gil7497568 3 
hypothetical protein T22D1.4-Caenozhabditis elegans 917508220 3 
hypothetical protein D2023.7-Caenorhabditis elegans gi7498188 2 
(Y14949)SURE-4 protem [Caenorhabditis elegans} gi|2440040 2 
hypothetical protein F33D11.1 1-Caenorhabditis elegans gil? 50G388 2 
(AF036692 Hypothetical protein C44B12.1 [Caenorhabditis elegans gil 457 3941 2 
NADH-UBIQUINONEOXIDOREDUCTASECHAINI gil 128633 Z 
{AE040646)Similar to sugar transporter coded for by C. elegans «DNAy — gi13592445 2 
cytochrome b[Caenorhabditis elegars| gi[5854888 ! 
hypothetical protein E57F4.3-Caenorhabditis elegans gi[7504539 | 
hypothetical protein C12D12.2 Caenorhabditis elegans gi[74195879 i 
hypothetical protein CIAFI 1.3-Caenorhabditis elegans gi7496015 ] 
PUTATIVE GLUCOSYLT RANSFERASE COBH9.3 gi| 17 51299 ! 
INTEGRIN BETA PAT-3 PRECURSOR gi| 13431773 1 
synaptobrevin SNB- |-Caenorhabditis elegans gil751 166! ! 7 


‘The Lible lists protein name, gene identification number, and number of peptides assigned to identity each protem. Note 
that all ot the hypothetical proteins were predicted to be membrane-associated protes Ta the SOSELE algolthm 
(hup:/susi. proteome. bio.tuatac ipfsosumramed hemi. 
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FIGURE A2.B. A three-dimensional visualization of a 2D-LC-MS/MS profile of insoluble protein fraction 
of C. elegans. ' 


CONCLUSIONS 


"Technique 5 detailed an analytical platform for proteomics, 2D-LC-M5 (MS/MS, which pro- 
vides a fully integrated approach to non-gel-based protein profiling and identification. The 
high resolution of the system is achieved by sequen tial multidimensional chromatography on 
ion-exchange and reversed-phase columns combined with high-resolution MS, while repro- 
ducibility is maintained by the automation of the total system. The technique is amenable for 
analysis of expression profiles of cellular proteins and identification of proteins in highly 
complex protein mixtures. In particular, it is very powerful for large-scale analysis of protein 
components in biological complexes, such as functional membrane domains and cellular 
organelles. We are currently applying this technique for the analysis of functional 
protein/membrane complexes, rafts, and cellular organelles such as the endoplasmic reticu- 
lum. However, several obstacles must be overcome during future development of this LC- 
based technology. 


e The resolution of large proteins is limited when cellular proteins are directly analyzed. This 
is probably because large proteins bind very tightly to the reversed-phase column and elute 
with low recovery during the chromatography. 


« The peptide-based 2D-LC-MS/MS analysis following protease digestion of protein mixture 
allows mass identification of proteins regardless of their physicochemical characteristics, 
molecular size, and solubility, but it makes quantitative information of cellular levels of 
proteins ambiguous. To analyze quantitative changes in proteins, the method must there- 
fore be combined with an appropriate technology for mass spectrometric quanti iot, 
such as isotope labeling of cells (Gygi et al. 1999). 
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a The informatics for data handling, processing, and evaluation must be improved. The 2D- 
LC-MS/MS technology produces a large amount of MS/MS data in 12 hours and typically 
reports back with a list of more than 1000 peptides and several hundred candidate proteins, 
At present, this data processing step is not fully integrated and usually requires several days 
to complete. The evaluation of the final data is extremely time-consuming not only because 
of the lack of a complete knowledge database, but also because of a lack of a common stan- 
dard among protein and gene entries in the current genome and protein databases. In 
many cases, there are more than ten entries for one protein and each has a different acces- 
sion number in the database. Far more confusing is that one cannot determine if a slight 
difference in amino acid sequence between two protein entries indicates different proteins 
or merely artifacts due to experimental error. 


A Cast STUDY: ANALYSIS OF A PORTION OF THE C. ELEGANS PROTEOME 


The system presented here is applicable for protein and peptide separations. In MS mode, proteins 
and peptides are separated by 2D-LC and detected by MS, giving rise to the elution profile of pro- 
teins and peptides with additional information on the molecular mass. As reported earlier on our 
prototype system, this profile could be used for analysis of expression levels of cellular proteins and 
their dynamics (isobe et al. 1991b; Taoka et al. 1992). In general, the 2D-LC-MS system can resolve 
hundreds of proteins expressed in cells and is therefore a good supplement to 2D-PAGE. For this 
purpose, a simple mass spectrometer with an electrospray ionization source, such as ES]-TOF-MS 
without a collision cell, can be used, and an LC-based "protein profiling" system can be constructed 
that is fully autornated from sample injection to data collection. However, this system cannot pro- 
vide information on protein identification, because the mass data of a protein molecule is insuffi- 
cient to specify a particular protein, unless candidates are predicted. 

The 2D-LC-MS/MS system is extremely powerful for peptide analysis, It can separate peptides to 
much higher resolutian than proteins, with simultaneous collection of the precise molecular mass 
of each peptide and MS/MS data for internal amino acid sequence. In fact, the system can separate 
and identify all of the tryptic fragments of human serum albumin in a single chromatographic run, 
except for several small peptides that are not adsorbed to the reversed-phase column. The most sig- 
nificant application of this system is mass identification of proteins in crude extracts of cells or iden- 
tification of protein components in functional complex and cellular organelles. Combined with the 
automated generation of collision-induced dissociation (CID) of peptide fragments and computer- 
assisted retrieval of the spectra, one can expect to assign thousands of peptides in a highly complex 
peptide mixture generated with protease digests of crude protein mixture, from which hundreds of 
proteins are generally identified from the gene and protein databases. This is possible because a very 
precise peptide mass value with a portion of the amino acid sequence (sequence tag) is often suffi- 
cient for unequivocal assignment of one corresponding peptide and identification of one particular 
protein in the database. Because the identification depends on a part of the protein structure (eg. 
peptide mass and sequence tag), this technique is applicable to any protein mixture, even intractable 
ones, such as those that include membrane-bound proteins. Figure A2.8 shows one such example, 
in which tryptic digests of an insoluble protein fraction of the nematode C. elegans are analyzed 
directly by this system. Although the figure illustrates total ion chromatogram (TIC), a plot of base 
peak intensities during total ion monitor of chromatography, the automated MS/ MS analysis actu: 
ally generated almost 8000 MS/MS spectra, which allowed ~1200 peptide and 400 protein identifi- 
cations to be assigned (Yamauchi et al., unpubl.). Table A2.4 illustrates a portion of the proteins 
identified by this system. The table includes many "hypothetical" proteins presumably located in the 
membrane that are identified in the nematode genome but have not yet been characterized as func- 


tional proteins. 


TECHNIQUE 6 


Modification of Phosphoamino Acids to Facilitate 
Identification 


Alastair Aitken 


Division of Biomedical and Chinical Laboratory Sciences, University of Edinburgh, Scotland 


I ADDITION TO DIRECT SFQUENCING OF PHOSPIOPL PTIDES, methods have been developed that 
convert the phosphoamino acid to a more stable derivative. These are generally applicable to 
phosphoserine and, to a lesser extent, phosphothreonine. Only one of these methods has 
proved particularly useful in practice when multiple phosphorylation sites are encountered 
(as is frequently encountered). 

¢ Phosphoserine can be identified after conversion to B-methvlaminoalanine by B-elimina- 
tion and addition of methylamine (Annan et al. 1982). 

e The conversion of phosphoserine io alanine by B-elimination followed by reduction with 
sodium borohydride has also been used (Richardson et al. 1978). O-glvcosylated serine 
residues are similarly converted to alanine. 

¢ During these reactions, the corresponding threonine derivatives are converted, with lower 
efficiency, to 2-aminobutanoic acid. Therefore, independent confirmation of the presence of 
phosphate in a given peptide should be sought, because O glycosylated serine would also 
undergo §-elimination and reduction during the derivatization procedure. In this case, how- 
ever, sequence analysis of the unmodified peptide would not give a PTH-serine at the cycle 
of appearance of the glycosylated residue, whereas a substantial amount of the dithiothreitol! 
(DTT) adduct of dehydroserine would be expected in the case of a phosphorylated residue. 

e A particularly useful method, which is detailed in this technique, exists for the derivatiza- 
tion of phosphoserine to the stable adduct S-ethylcysteine (Meyer ct al. 1986) by D-elimi- 
nation of the phosphate followed by addition of ethanethiol. Due to the alkaline conditions 
required for the conversion to S-ethyl-Cys, Asn and Gin may be deamidated and identified 
as P''H-Asp or PTH-Glu. This procedure may be used for the selective isolation of phos- 
phoseryl peptides (Holmes 1987). A derivatized peptide from a doubly phosphorylated 
species will elute correspondingly later than the singly derivatized species, indicating the 
applicability of the method to multiple phosphoseryl peptides. HPLC before and after 
derivatization should produce highly purified peptides even from a very complex mixture 
because the elution position of all others will be unaffected. The dehydroalanine interme- 
diate from amino-terminal phosphoserine peptides wil! rearrange to pyruvate and car- 
boxy-terminal phosphoserine peptides will give ethvlamine. 

e It has been possible to raise antiphosphoamino acid antibodies to particular phosphopep- 
tide sequences in proteins. Although there are commercial antibodies claimed. against 
phosphoserine or phosphothreonine, these do not appear to be generally successful for 
identification of these derivatives. In contrast, the use of antiphosphotyrosine antisera 18 
well established. 

e The development of metal-ion affinity chromatography to specifically isolate phosphopro- 
teins has also been useful (see Chapter 9, Protocols 1, 3, and 4). uu 
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MATERIALS 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «1> 


» Reagents 
Dimethylsulfoxide (DMSO) <!> 
Ethanethiol (10 m} <!> 
Ethanol «1» 
Glacial acetic acid <!> 
NaOH (5 M) <!> 


P Equipment 
Heat block preset to 50°C 


HPLC solvents 
The HPLC solvents are used to prepare linear gradients of H,O and acetonitrile «!7 in 0.1% triflu- 
oracetic acid <!> (for additional details, see Chapter 5). 


Nitrogen <!> 


RP-HPLC column (e.g., Vydac C,,) 
See text preceding Step 1. 


b Biological Sample 
Phosphopeptides of interest 


METHOD 


Selective Isolation of Phosphosery! Peptides from a Complex Mixture 


Steps 1, 2, 8, and 9 are additional and necessary if the phosphopeptides are part of a mixture 
of proteins (e.g. a cell extract). If purified phosphopeptides are being modified, only Steps 
3—7 need be carried out. 


1. (Optional) Apply the underivatized peptides to an RP-HPLC column. 


2. (Optional) Elute the peptides under standard conditions (see Chapter 5). Collect frac- 
tions containing the phosphopeptide(s). 


Derivatization of Phosphoserine Residues 


3. Prepare incubation mixture in a microfuge tube by combining: 


H,O 200 pl 
DMSO 200 pl 
ethanol 100 Lu 
5M NaOH 65 ul 
10 M ethanethiol 60 pl 


WARNING: Ethanethiol has an extremely noxious odor. Carry out manipulations in à 
chemical fume hood. 
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4. Dissolve the peptide in the tube of incubation mixture. 


5. Flush the tube with nitrogen and incubate the reaction for 1 hour at 50°C. 


An extended reaction time (18 hours at 509C) may be used because the [J- elimination step 
af derivatization of a phophoserine adjacent to a proline residue (particularly if the proline 
is carhoxy-terminal) is slow. This can be carried out with acetonitrile, which minimizes sub- 
sequent manipulations (Aitken et al. 1995). 


6. After cooling the tube on ice to minimize potential hazard, add 10 pl of glacial acetic acid 
in small aliquots to neutralize the solution. 


7. Either apply the derivatized peptide directly to a protein sequencer or concentrate it by 
vacuum centrifugation. 


The elution position of this PTH derivative on HPLC during automated sequencing is well- 
characterized. 


Separation of S-ethylcysteinyl Peptides from Underivatized Peptides 


8. Apply the impure S-ethylcysteinyl peptides to the same RP-HPLC column used in Step 1. 


9. Elute under the same conditions as in Step 2. The derivatized peptide(s) can be expected 
to emerge at 4—596 acetonitrile later than the native phosp hopeptide. 
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ih ERY SES S| 


Identification of Phosphoamino Acids by Thin-layer 
Electrophoresis 


Alastair Aitken 


Division of Biomedical and Chinicel Laboratory Sciences, University of Edinburgh, Scotland 


MATERIALS 


Ta ANALYSIS OF PHOSPHOAMINO ACIDS is frequently confined to qualitative or semiquantita- 
tive determinations based on their separation by 1D or 2D thin-layer or paper electrophore- 
sis. This is still an extremely useful technique because only inexpensive equipment is 
required, and the spots corresponding to phosphoamino acids may be excised and their 
radioactivity ("P) quantitated in a scintillation counter. 


P Reagents 


P Equipment 


CAUTION: See Appendix 3 for appropriate handling of materials marked with «7. 


HCl (6 N) €!» 

HPLC-grade H,O 

pH 1.9 electrophoresis buffer (acetic acid <!>:formic acid <!>:H,O 4:1:45 v/v/v) 

pH 3.5 electrophoresis buffer (pyridine <!>:acetic acid: H,O. 1:10:89 v/v/v} 

Phenol (0,596 v/v] <!> 

Phosphoserine, phosphothreonine, and phosphotyrosine (1 ug/l [Le., ~5 nmole/pi]) (Sigma) 
Ninhydrin spray (0.2% ninhydrin w/v in ethanol) <!> 


Argon (pure, sequencer grade) <!> 
Centrifugal evaporator (e.g., SpeedVac, Thermo Savant, or equivalent) 
Electrophoresis tank 


Glass test tubes, small 
Number the hard-glass test tubes with a diamond scriber. 


Liquid scintillation counter 

Oven or block heater preset to [10°C 

Plastic-backed thin-layer cellulose sheets (e.g., Camlab, Machery- Nagel) 

Screw-top glass or PTFE containers for hydrolysis of peptides (Waters-Milligen or Ciba- 
Corning) 


P Biological Sample 


Phosphopeptide samples of interest, radiolabeled with "P «t» 
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METHOD 


. After desalting in a volatile buffer, transfer the phosphopeptide samples to small test 


tubes, previously numbered. Remove the volatile sample buffer in vacuo, either in a des- 
iccator or a centrifugal evaporator. 


. Place the numbered tubes in the hydrolysis container(s) with 2 ml of 6 N HCI and 0.5% 


(v/v) phenol. Flush the container with pure argon for 2 minutes, close the container, and 
evacuate it if possible. 


. Heat the container in an oven or block heater for 1—4 hours at 1 10°C to hydrolyze the "P- 


labeled phosphopeptides. 


The recovery of phosphoamino acids varies with the primary structure around the phos- 
phorylation site as well as the acid lability of the phosphoamino acid. The order of lability 
of free phosphoamino acids is phosphotyrosine > phosphoserine » phosphothreonine. 


. Cool the hydrolysis tubes, and then quickly transfer them to a centrifugal evaporator and 


evacuate the tubes without initially turning on the rotation (to prevent washing down 
traces of HCl into the tubes). Dissolve the residues in 50 ul of HPLC-grade H,O and 
evaporate them in a centrifugal evaporator (in the conventional manner) to dryness to 
remove traces of HCI. 


. Dissolve the dried residue in 5-10 nmoles each of unlabeled phosphoserine, phospho- 


threonine, and phosphotyrosine. 


. Subject the samples to electrophoresis at pH 1.9 in the first dimension and pH 3.5 in the 


second dimension on plastic-backed thin-layer cellulose sheets. 


. Stain the electrophoretogram with ninhydrin to locate the unlabeled phosphoamino 


acids by spraying or dipping it in the ethanolic ninhydrin solution and allowing the elec- 
trophoretogram to develop. Heat in a warm oven to hasten the process (see Figure A2.9). 


. Cut out the regions of the electrophoretogram containing phosphoamino acíds and 


count them in a liquid scintillation counter. 


3em 
P, + - 
e RS dye 
‘corm 
@PTyr 6cm 
FIGURE A2.9. Cellulose TLC plate indicating sample and dye onosphopeptiies 0f 
origins and “P-labeled products from a sample applied to origin + + L 
1. Four samples are typically applied on a TLC plate at the indi- 1 2 
cated sample origins, shown as 1, 2, 3, and 4. Before electro- 
phoresis in the first dimension, a marker dye mixture of £- DNP- 8 cm 
lysine and xylene cyanol FF is applied to the dye origin depicted 
in the upper right corner. The plate is rotated 90° counterclock- | 
wise for second-dimension electrophoresis at pH 3.5. The posi- pH 3.5 + + 
tions of phosphoserine (P.Ser), phosphothreonine (P.Thr), phos- pu19 3 4 3 om 
photyrosine (P.Tyr), released orthophosphate (P,), and partially 
hydrolyzed phosphopeptides are shown for a sample applied to in RP - 
origin 1. (Reproduced, with permission, from Blume-Jensen and — 7*7 oom oe 3m ccm 


Hunter 2001.) 


TECHNIQUE 8 


isothermal Titration Calorimetry: Measuring 
Intermolecular Interactions 


Adrian Velazquez-Campoy and Ernesto Freire 


Department of Biology and Biocalorimetry Center, The Johns Hopkins University, Baltimore, Maryland 21218 
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Ta ASSOCIATION OF BIOLOGICAL MACROMOLFCUI FS with one another or with their ligands has 
a central role in the structural assembly and functional regulation of biological systems (e.g., 
binding of hormones or toxins to receptors, allosteric enzymatic control, and regulation of 
transcription and replication). [n a time when proteomics is considered as the next necessary 
step toward a more advanced understanding in diverse fields such as cell biology, develop- 
mental biology, and genetics, it is imperative to have access to a fast, robust, and reliable tech- 
nique to measure the energetics of intermolecular interactions (e.g., protein-ligand, protein- 
protein, or protein-nucleic acid interactions) and to develop a quantitative description of the 
driving forces that govern molecular associations and regulatory processes. Recent reviews 
highlight isothermal titration calorimetry (ITC) as a powerful tool to accomplish these goals 
(Jelesarov and Bosshard 1999; Leavitt and Freire 2001). 

The simplest binding process corresponds to a macromolecule with a single ligand-bind- 
ing site (1:1 stoichiometry) M + Le ML, where M and Lare the free reactant species, macro- 
molecule and ligand, and ML represents the binary complex. More complex processes, like 
the onc corresponding to a macromolecule with n-independent and equivalent ligand-bind- 
ing sites (1:1 stoichiometry, with noninteracting identical binding sites), M + nL <> ML, can 
be easily generalized from the simple case. This model and other extensions to sets of inde- 
pendent or interacting sites (positive and negative cooperativity) have been discussed in the 
literature (Cantor and Schimmel 1980; Wyman and Gill 1990; Freire et al. 1990; Van Holde 
et al. 1998). 

The strength of the association complex is given in terms of the association constant, K,, 
or the dissociation constant, Ky 


x - AMH Ll. (1) 
(MI[L] Kg 


which are related to the Gibbs energy of binding, AG: 
AG = -RT InK, = RT Ink, (2) 
which, in turn, can be expressed in terms of the enthalpy, AH, and entropy, AS, of binding: 
AG = AH - TAS (3) 


For a binding process to occur, the Gibbs energy of binding must be negative. Basically, AH 
reflects the interactions between macromolecule and ligand (hydrogen bonds, electrostatic 
and van der Waals interactions) taking as a reference their respective interactions with the 
solvent. The more negative AH, the more favorable the interactions in the binary complex. 
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The entropy, AS, represents the balance between two phenomena with opposite contribu- 
tions. A large desolvation of hydrophobic surfaces upon binding is accompanied by an 
extremely large increase in entropy corresponding to the release of water molecules, On the 
other hand, the binding imposes conformational and roto-translational restrictions, result- 
ing in a loss of degrees of freedom of the individual reactants and, consequently, a loss of con- 
formational and translational entropy. 

A variety of experimental techniques and analytical methods exist to study binding 
processes. Most of the traditional methods used to study binding are based on partitioning 
the system into free and bound species (e.g., equilibrium dialysis, ultrafiltration, gel exclusion 
chromatography, ultracentrifugation, and radioligand binding) and are both time- and mate- 
rial-consuming. Spectroscopic methods that detect a binding event by measuring changes in 
an observable parameter proportional to the extent of binding saturation fail to provide the 
stoichiometry directly. These methods require special reporter groups (e.g., chromophores or 
fluorophores) and therefore cannot be used to measure every binding event. All of these 
techniques offer a partial description of the binding reaction and only allow the determina- 
tion of the association constant and evaluation of the Gibbs energy of binding, but they do 
not permit the dissection of the latter into enthalpic and entropic contributions. This analy- 
sis has been proven to be fundamental to precisely describe and understand the interactions 
underlying different binding processes (Gomez and Freire 1995; Parker et al. 1999; ‘Todd et 
al. 2000; Velazquez-Campoy ct al. 2000a). 

Among the techniques used to study binding, ITC is unique because, in a single experi- 
ment, it provides a complete thermodynamic characterization of the energetics of a binding 
process, i.c., the determination of all the thermodynamic quantities describing that process: 
Gibbs energy, association constant, enthalpy and entropy of binding, as well as the stoi- 
chiometry of binding. Like any other experimental technique, ITC exhibits very important 
advantages and some limitations. 


Advantages of ITC 


Heat is a universal signal. Any binding process is accompanied by a heat effect susceptible to 
being measured by ITC. Even if under certain experimental conditions the reaction heat is 
near zero, it is possible to slightly change the experimental setup (with a change in tempera- 
ture, pH, or buffer) in order to have a measurable heat effect. 

Given that the measured signal is the reaction heat, it is possible to determine directly the 
enthalpy of binding in addition to the binding affinity using ITC. In addition, ITC does not 
require the presence of reporter groups (very often nonnatural), because the signal propor- 
tional to the extent of binding is the heat effect. 

ITC provides a complete model-independent thermodynamic characterization of the 
binding process, because it directly measures the enthalpy of binding (reaction heat at constant 
pressure) and the association constant. Unlike spectroscopic techniques, there is no need to 
obtain the temperature dependence of the association constant to estimate the so-called vant 
Hoff enthalpy. The van't Hoff method lacks accuracy because (1) of the narrow range of tem- 
peratures allowed in experiments, (2) the association constant very often appears temperature- 
independent, due to the so-called enthalpy/entropy compensation phenomenon, (3) poor pre- 
cision in the association constant determination may be translated into large errors for AH and 
AS, and (4) the van't Hoff analysis is model-dependent. ITC is a nondestructive technique. 
After running the experiment, it is possible to recover and recycle the reactants. 


Limitations of ITC 


In ITC, global heat effects are measured. The identification of specific contributions and 


interactions in the binding process requires ad hoc experimental designs. A more detatled 
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study, usually involving different environmental conditions, a number of modifications of 
the reactants (e.g., mutant proteins and/or small library of ligands), and a structure-based 
thermodynamic dissection (Luque and Freire 1998), might reveal the nature of the specific 
contributions and individual interactions driving the binding process. 

As described below, there is a well-defined range (dependent on the macromolecule con- 
centration in the reaction cell) for the accurate determination of the association constant 
(roughly, 30* m | S K, $ 10? 4^), even though the range can be extended by using displace- 
ment titrations {see below). 


OUTLINE OF THE PROCFDURE 


Principle of the Technique 


A calorimetric titration measures the heat released or absorbed by the stepwise addition of a 
molecule (ligand) into the reaction cell of the calorimeter containing a solution of the inter- 
acting macromolecule while maintaining constant temperature and pressure. Changing the 
chemical composition of the sample as a result of the addition of ligand, using a precision 
computer-controlled syringe, triggers the reaction. The content of the sample cell is contin- 
uously stirred to ensure rapid mixing of the reactants. Thus, the system goes through a 
sequence of equilibrium states, and the direct thermodynamic observable is the heat associ- 
ated with every change of state. A second cell, where no binding reaction takes place, is used 
as a reference, 

The measured heat is related to both the enthalpy and the extent of reaction. For each 
injection, the heat released or absorbed, q, is given by: 


q= VAH (IML), - [ML] (4) 


where V is the reaction cell volume, AH is the enthalpy of binding, and [ML], is the change 
in complex concentration after the sth injection. A typical ITC experiment is illustrated in 
Figure A2.10. Modern ITC instruments are designed on the dynamic compensation princi- 
ple, in which the measured signal is the amount of thermal power necessary to maintain null, 
at any time, the temperature difference between the reaction and reference cells, 1.e., the 
amount of thermal power generated by the reaction triggered by the ligand injection. khe 
time evolution of the signal exhibits characteristic deflections from the baseline in the form 
of peaks, each corresponding to one injection. The heat after each injection is obtained by 
calculating the area under each peak. Because 4, is proportional to the increase in bound 
macromolecule concentration, its magnitude will decrease as the macromolecule is titrated 
unti] reaching complete saturation. After saturation, additional injections produce small 
identical peaks, due to nonspecific phenomena such as dilution and mixing effects, whose 
heat effect must be subtracted from all the injection peaks before performing data analysis. 
Data analysis of the heat effect for each injection leads to the calculation of the thermody- 
namic binding parameters (n, K,, AH). 


Type of Calorimetric Experiments: Information Available by ITC 


Simultaneous determination of the association constant and the 
enthalpy of binding 
As in every equilibrium-binding technique, reactant concentrations must be within an 


appropriate range in order to obtain reliable estimates of the association constant. A practi- 
cal rule of thumb is given by the parameter c, where c = K, X [M], which must lic between 
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FIGURE A2.10. (Panel a1 Illustration of the configuration of an [TC reaction cell. The cell volume is 1.4 ml 
and initially is filled with the macromolecule solution (blue). The injection syringe is filled with the ligand 
solution (black). At specified time intervals (400 seconds), a small volume (10 pl) of the ligand solution is 
injected into the cell, triggering the binding reaction and producing the characteristic peak sequence in the 
recorded signal (panel b). After saturating the macromolecule, the residual heat etfects (the so-called “dilu- 
tion peaks"), if any, are due to mechanical and dilution phenomena. The experiment corresponds to the 
titration of HIV-1 protease with acetyl-pepstatin (a universal inhibitor of aspartic protcases), performed in 
10 mm sodium acetate, 2% DMSO (pH 5.0) at 250C. The concentrations of reactants are 20 HM HIV-! pro- 
tease and 300 jim acetyl-pepstatin. After integration of the area under each peak (and subtraction of the 
dilution heat effects and normalization per mole of injected ligand), the individual heats are plotted against 
the molar ratio (panel c) from which, through nonlinear regression, it is possible to estimate the thermo 

dynamic parameters ri, K, and AH. The solid line corresponds to the theoretical curve with n = 1.03, K, = 
2.3 x 10* m, and AH = 7.7 kcal/mole. 


0.1 and 1000, thus imposing a limit to the lowest and largest association constant measurable 
at a given macromolecule concentration (Wiseman et al. 1989}. This phenomenon is illus- 
trated in Figure A2.11. 

As the c value increases, the transition from low to high ligand concentration is more 
abrupt. In the case of very high association constants (macromolecule concentration much 
higher than the dissociation constant), all of the ligand added in any injection will bind to the 
macromolecule until saturation occurs and all the peaks, except the last oncs, therefore exhib- 
it the same heat effect. For low association constants (macromolecule concentration far 
below the dissociation constant), from the very first injection, only a fraction of the ligand 
added will bind, producing a less steep titration in which saturation is hardly reached. To 
obtain accurate estimates of the association constant, an intermediate case is desirable. 

To obtain a satisfactory titration curve, the concentration of ligand should be enough to 
exceed the stoichiometric binding after completion of the injection sequence (i.e. ligand con- 
centration in the syringe should be 10-20 times the concentration of macromolecule in the 
cell). In case of poor solubility of the ligand, the reverse titration should be attempted. 

The constraints dictated by parameter c impose an experimental limitation, For very high 
association constants, optimal concentrations ate too small to be practical. For very low asso- 
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FIGURE A2.11. Illustration of the effect of the association constant value on the shape of a titration curve. The plots represent 
three titrations simulated using the same parameters (concentrations of reactants and binding enthalpy), but different association 
constants, Low (panel A), moderate (pane! B), and high affinity (panel C) binding processes are shown. To obtain accurate esti- 
mates of the association constant, an intermediate case is desirable (center). The limit value of the curve when the molar ratio 
reaches zero, taking as à reference the heat effect associated to the dilution peaks, provides a good estimate of the enthalpy only 
when the parameter c is large enough, because such limit value is given by (c/c 1) x AH (Indyk and Fisher 1998). 


ciation constants, the concentrations may be prohibitively high, either because there is a pos- 
sibility of aggregation or the economic/production costs are simply too great. 

It is sometimes possible to change the experimental conditions (temperature or pH, 
without compromising stability against aggregation or unfolding) in order to modify the 
association constant toward accessible experimental values (Doyle et al. 1995; Baker and 
Murphy 1996; Doyle and Hensley 1998; Velazquez-Campoy et al, 2000b). In this case, appro- 
priate equations are required to extrapolate to the original conditions. However, there is an 
extension of the ITC protocol aimed at overcoming such drawbacks. Without changing 
experimental conditions at all, displacement ITC extends without limits the useful range for 
the association constant determination (Sigurskjold 2000; Velazquez-Campoy et al. 2001). 
Basically, a displacement experiment consists of a titration of the high-affinity ligand into a 
solution of the macromolecule prebound to a weaker ligand, therefore decreasing the appar- 
ent affinity of the potent ligand. The thermodynamic parameters for the binding of the high- 
affinity ligand are calculated from the apparent binding parameters characterizing the dis- 
placement titration, and the known binding parameters for the weak ligand. A similar exper- 
imental design can be used to measure the binding of weak ligands. 


Determination of the enthalpy of binding 


Binding enthalpy is usually determined from titrations in the optimal range of reactant con- 
centrations. However, a greater accuracy can be achieved if enthalpy is measured by per- 
forming injections of ligand into a large excess of macromolecule (far above the dissociation 


TECHNIQUES ™ 887 


constant). Therefore, although there will be no saturation, ali the ligand injected will be fully 
associated, giving a good measurement of the binding reaction heat. Blank experiments (see 
below) must be included in order to eliminate the contribution of different phenomena 
(dilution of reactant, solution mixing, and other nonspecific effects) occurring simultane- 
ously during the injections and to obtain the heat effect associated with the binding process. 


Blank experiment 


To estimate the heat cffect assaciated with mechanical and dilution phenomena (where bind- 
ing does not take place), blank experiments can be performed. In this case, the ligand ts 
injected into a buffer solution without the macromolecule. However, this may not be the ideal 
way to estimate such heat effects, because the degree of solvation of the ligand and the chem- 
ical composition would be different compared to the titration with the macromolecule pre- 
sent. For this reason, some investigators usually consider the average effect of the last injec- 
tion peaks as the reference heat effect. If, during the experiment, it is not possible to reach 
complete saturation, precluding such averaging, it is possible to include in the fitting func- 
tion a constant term accounting for such dilution heat effects. 

Blank experiments, also called “heat of dilution experiments,” are required to measure 
binding enthalpies under excess macromolecule concentration, as mentioned previously. In 
addition, these types of experiments can be used as a test to detect possible self-association of 
the ligand (especially very hydrophobic ones) at high concentrations. On the basis of this con- 
sideration, protein oligomerization processes can be studied calorimetrically by performing 
dilution experiments in which a concentrated solution of protein is injected into a solution 
containing buffer. When injected, the complex partially dissociates giving rise ta a heat effect 
proportional to the enthalpy of association. From the data analysis, it is possible to estimate 
the association constant and the enthalpy for the oligomerization process (Lovatt et al. 1996). 


Determination of heat capacity change of binding 


The temperature derivative (at constant pressure) of the binding enthalpy, Le. the heat 
capacity change upon binding or binding heat capacity, ACy, is defined as 


| _ [9AH 
AC, = Ca ) (5) 


and can be determined by repeating the same experiment at several temperatures. The heat 
capacity is given by the slope of an enthalpy versus temperature plot. 

AC, has been shown to originate from surface dehydration upon binding and, to a lesser 
extent, from the difference in vibrational modes between the complex and the free species. It 
provides information about the nature of the interactions driving the binding, in addition to 
the temperature dependence of the thermodynamic functions (Gibbs energy, enthalpy, and 
entropy of binding). 


Determination of coupled proton transfer process 


ITC is one of the more suitable techniques for the assessment of the existence of protona- 
tion/deprotonation processes coupled to binding, When a binding process is coupled to pro- 
ton transfer between the bulk solution and the bound complex, the enthalpy of binding will 
depend on the ionization enthalpy of the buffer molecule, AH,,, as shown in Equation 6. 


AH = AH, * mA. (6) 
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where AH, is the buffer-independent enthalpy of binding and fy is the number of protons 
being exchanged. Therefore, by repeating the titration under the same conditions, but using 
several buffers with different ionization enthalpies, there will be a linear dependence of AH 
versus AH» from which it is possible to estimate n, (slope) and AH, (intercept with the axis) 
(Biltonen and Langerman 1979; Gomez and Freire 1995). 


e If n, is zero, there is no net proton transfer. 


* [f nų is positive, there is a protonation, i.e., a proton transfer from solution to complex. 


* [f ny is negative, there is a deprotonation. 


Amount of Sample Required 


Modern calorimeters (compared to old ones) require only small amounts of sample to com- 
plete an experiment. Advances in material sciences and electronics, and improvements in the 
operation design principle, allow measurements of far less than ! cal, making it possible to 
conduct experiments at macromolecule concentrations close to typical dissociation constants 
in biological processes. Considering that the reaction cell volume is ~1 ml, the amount of 
macromolecule required is in the range of 1-100 nanomoles (~! mg for an average protein). 


Preparation of the Sample for Calorimetry 


To obtain an accurate thermodynamic description of the binding process under study, reac- 
tants must be of the highest purity achievable. However, minor traces of contaminants will 
sometimes not distort the interactions enough to affect the results. When considering puri- 
ty, it is important to distinguish between chemical and conformational homogeneity. Even if 
the macromolecule solution is 100% chemically pure, it is essential that it is correctly folded 
and that none of the protein is denatured. 

It is also important that the solutions.of macromolecule and ligand are chemically equi- 
librated, perfectly matched in salt content, pH, and cosolvents (e.g., organic solvents such as 
ethanol and DMSO). Because ITC detects global heat effects, the thermal effect associated 
with the mixing of different buffers will be included in every injection. Therefore, if possible, 
reactants should be dialyzed against the same buffer solution (if the molecular weight of the 
ligand is too low, at least it should be dissolved in the same buffer solution and then pH- 
adjusted). Extreme care should be taken when using additional organic cosolvents (e.g., 
ethanol and DMSO), because their heat of dilution is exceptionally large. 

When performing experiments at a given temperature, samples should be cooled slight- 
ly below the temperature, because the handling of the sample while loading the calorimetric 
cel] will raise the sample temperature. If the final cell temperature after loading is higher than 
the experimental temperature, the calorimeter will go through a long cooling process. 


Analysis of the Results 


Before the widespread use of computers, it was common to utilize mathematical transfor- 
mations (e.g., Scatchard plots) to linearize binding equations in order to obtain thermody- 
namic parameters from linear least-square regression. However, such manipulations often 
incurred a systematic propagation of errors and uneven distribution of statistical weights 
over the binding curve. Nowadays, it is customary to implement nonlinear fitting procedures 
to directly analyze the raw calorimetric data. Considering Equation 4, the total cumulative 
heat released or absorbed, Q; is proportional to the total concentration of binary complex: 
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Q, = VAH|MI.], (7) 


The analysis of the calorimetric titration data can be performed by estimating the variable 
model parameters (n, K,, and AH) by fitting either the cumulative heat, Q, or the individual 
heat, q, for each injection. The analysis in terms of the individual heat is more convenient 
because it eliminates error propagation associated with cumulative data. 

Even when assuming a model obeying 1:1 stoichiometry, it is useful to introduce a para- 
meter n, which functions as the number of binding sites, as in the general 1:5 model. When 
estimating the parameters, n should be equal to unity, within the experimental error, other- 
wise the following statements would hold: 


n> 1 There is an error in the determination of the reactants concentration (real lig- 
and concentration is lower and/or real macromolecule concentration is higher). 
There is more than one binding site (specific binding or not) per molecule. 


n«1 There is an error in the determination of the reactants concentration (real lig- 
and concentration is higher and/or real macromolecule concentration 1s lower) 
There is less than one binding site per molecule, i.e., the sample is not chemi- 
cally or conformationally homogeneous. 


In considering the general binding model 1:n, Equations 3 and 4 can be expressed in 
terms of the free concentration of ligand: 


( nK IL], nK [L] ) 

qi= VAH[M)y| > Dp cliFKIH)p 

1+K,[L],  4KlL],, 

= vanis, | ZEEL i 
Q= "M KL, 


where [M], is the total concentration of macromolecule. Although the binding equations, 
Equation 8, acquire an easy form in terms of the free ligand concentration, the independent 
known variable in ITC experiments is the total ligand concentration 

[L] = [L] + [ML] (9) 


and the 1:n model permits an explicit analytical solution (Wiseman et al. 1989; Freire et al. 
1990): 


i+ [M]ynK, + KE - NO + (Mk, + KIL 41M] 2K CUL, 
2K, 
4= Q-Q- (10) 
In practice, a modified version of Equation 10, where the dilution effect on the macromole- 
cule concentration due to the addition of the ligand is considered, is used: m 
V 1 + [MlgnK, + K,[L]z; Aa t [M] ink, t K [Liny - AIME AKL hrs 
a= l 2K, 
[M] au 
[M] 


and the total concentrations of reactants after any injection i are given by: 


Q, = VAH 


4 = Q- Q.; 


| r) 
[M],,= [Mx Y = y 


(Ls - t-t - 


(12) 
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where [M[_, is the initial concentration of macromolecule tn the cell, |L], is the concentra- 
tion of ligand in the syringe, v is the injection volume, and V is the cell volume. Nonlinear fit- 
ting of q, as a function of 'L],, provides n, K, and AH as adjustable parameters. It is possible 
to avoid performing blank experiments to estimate the dilution heat effect and include in 
Equations 10-11 a constant term, qa representing the dilution heat effect contribution. Note 
that in the fitting procedure, qa floats. 

In general, models with different sets of binding sites require a numerical approach in 
which, via an iterative process, the model parameters (1, K, and AH, where j stands for each 
set of binding sites) are determined. 


ITC: MEASURING INTERMOLECULAR INTERACTIONS TECHNIQUE 


This technique was designed for experiments performed using the VP-ITC titration 
calorimcter from MicroCal, LLC. (Northampton, Massachusetts), which, at the present time, 
is the most sensitive and easy to use [TC instrument. With a VP-ITC titration calorimeter or 
similar instrument, it is possible to perform an experiment in 1 hour (including data analy- 
sis) with 1 mg of sample. If using a calorimeter other than the VP-ITC, the method described 
here may be of help as a guide. 

Pancreatic ribonuclease A (EC 3.1.27.5) and the nucleotides 2’- and 3’-cytidine mono- 
phosphate, inhibitors of its endonuclease activity, provide a model system for ITC. The bind- 
ing of these compounds, extremely exothermic and with moderate affinity, has been exten- 
sively characterized (Flogel and Biltonen 1975; Wiseman et al. 1989; Straume and Freire 
1992). 


CAUTION: See Appendix 3 tor appropriate handling of materials marked with zt». 


Prepare 2'CMP (C 7137, Sigma) and 3'CMP (C 1133, Sigma; at 1.5 mM in 15 mM potassium acetate 
(pH 5.5) dissolving the compounds directly in the buffer (-1 ml). Measure the concentration of 
each monophosphate by diluting it 1:20 in 100 mm sodium phosphate (pH 7.0) and using an 


Prepare ribonuclease A (R 5500, Sigma) at 0.1 mM in 15 mM potassium acetate (pH 5.5) dissolving 
the protein in the buffer (—5 ml) and dialyzing it overnight at 4°C against 4 liters of the same buffer. 
Measure the concentration of the enzyme using an extinction coefficient of 9800 m'em at 278 


MATERIALS 
y Reagents 
2'CMP and 3CMP 
extinction coefficient of 7400 m'em! at 260 nm. 
Potassium acetate (15 mM, pH 5.5) «t» 
Ribonuclease À 
nm. 
Sodium phosphate (100 mm, pH 7.0) <!> 
» Equipment 


Calorimeter 
Glass vials (12 x 75 mm and 6 x 50 mm) 
Loading syringe (a long-needle syringe) 
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METHOD 


. Equilibrate the calorimeter at 24.59C.. 


. Prepare 2.2 ml of ribonuclease A, 0.5 ml of nucleotide (in a small vial), and 10 ml of 


buffer. Degas all solutions with a vacuum pump for 15 minutes. 


. While the solutions are degassing, thoroughly clean the calorimetric reaction cell. 


. Use the loading syringe to fill the reference cell with H,O or 15 mM potassium acetate (pH 


5.5) (the latter may help to reduce noise level). 


. Rinse the reaction cell with 15 mM potassium acetate (pH 5.5). Slowly inject the ribonu- 


clease solution into the reaction cell. Carefully remove bubbles. 


. Fill the 250-1 injection syringe with the nucleotide solution. Rinse the outside of the 


syringe tip with 15 mM potassium acetate (pH 5.5) or H,O and dry it. 


Although the syringe nominally indicates 250 pl, the available total injection volume is 309 
ul. Filling the syringe incompletely wilt generate nonuniform injections. 


. Carefully insert the injection syringe into the reaction cell. Avoid bending the needle or 


mixing the liquids inside and outside the needle. 


. Equilibrate the calorimeter at 25°C. 


. Set the running parameters for the experiment as follows: 


Number of injections; 30 Füter: 2 seconds 

Reference power': 10 [ical/sec Temperature: 25°C. 

Concentration in syringe: ^1.5 mM initial delay: 180 seconds 

Stirring speed: 490 rpm Concentration in cell: ~ 100 jim 

Feedback mode’: high File name: "itc 

Comments: ad lib Equilibration options’: fast 

Duration: automatically set according Injection volume: first 2 pl, the rest 8 ul 
to the injection volume Spacing between injections: 400 seconds 


Experiments with other molecules may require different parameter values. 


Mf the reaction is expected to be so exothermic and/or the concentration of ligand is so high 
that the heat effect in the first injections exceeds —200 eal, raise the reference power to pre- 
vent the signal from going below zero. 


!Set the feedback to high feedback gain, otherwise a drop in performance will occur, Given 
a heat effect, the peak corresponding to the injection will be smaller and broader (.c., there 
will be a loss in sensitivity and an increase in the time for running an experiment). 


‘In the equilibration options, compared to the other choices, fast equilibration is mare con- 
venient because it permits simultaneous thermal and mechanical equilibration, and it 
allows the user to manually start the injection sequence whenever the baseline is good (see 
Step 10). 
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FIGURE A2.12. Titrations of ribonuclease A with 7 CMP (left) and 3'CMP (right). Experiments were per- 
formed in 15 mM potassium acetate (pH 5.5) at 25°C. The concentrations of reactants are 80 yt ribonu- 
clease A and 1.3 mm 2'CMP and 3'CMP. Because the titrations were carried out using the same reactant 
concentrations and experimental conditions, from the steeper titration tor 2°CMP it is possible to conclude 
that this compound is a more potent ligand than 3'CMP. The solid lines correspond to theoretical curves 
with n = 1.02, K, = 2.9 x 10* M! and AH = -19.3 kcal/mole for 22ZCMP and n = 0.98, K, = 2.4 x 10* w^, and 
AH = —18.6 kcal/mole for 3'CMP. 


10. Start the experiment. There will be thermal and mechanical equilibration stages. Initiate 
the injection sequence after a stable no-drift noise-free baseline occurs (as seen in the 
1 pical/sec scale). 
The first injection is usually erroneous because there is some mixing of the solutions inside 
and outside of the needle when inserting the injection syringe and/or during the time 
required to reach thermal and mechanical equilibration. Therefore, set the first injection to 
be a smaller volume than subsequent injections to minimize the amount of wasted ligand. 
In the data analysis, the first injection will be taken into consideration for the calculation of 
the total ligand concentration in the cell, However, its associated heat effect will not be 


included in the fitting procedure. 


11. Use the software provided by MicroCal for data analysis according to the equations 
described previously. Results should be consistent with the information shown in Figure 


A2.12. 
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http://prowl.rockefeller.edu/aainfo/contents.htm Amino acid information, PROWL, protein chemistry 
and mass spectrometry resource developed in collaboration between ProteoMetrics and Rockefeller 
University. 

http://sosui.proteome.bio.tuat.ac.jp/sosuiframe0.htm! SOSUI system, Department of Biotechnology, 
Tokyo University of Agriculture and Technology. 
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Cautions 


T« FOLLOWING GENERAL CAUTIONS should always be observed. 


Become completely familiar with the properties of substances used before beginning the proce- 
dure. 
The absence of a warning docs not necessarily mean that the material is safe, since information may 
not always be complete or available. 
If exposed to toxic substances, contact your local safety office immediately for instructions. 
Use proper disposal procedures for all chemical, biological, and radioactive waste. 
For specific guidelines on appropriate gloves, consult your local safety office. 
Handle concentrated acids and bases with great care. Wear goggles and appropriate gloves. Wear a 
face shield when handling large quantities. 
Do not mix strong acids with organic solvents as they may react. Sulfuric acid and nitric acid espe- 
cially may react highly exothermically and cause fires and explosions. 
Do not mix strong bases with halogenated solvent as they may form reactive carbenes which can 
lead to explosions. 
Never pipette solutions using mouth suction. This method ts not sterile and can be dangerous. Always 
use a pipette aid or bulb. 
Keep halogenated and nonhalogenated solvents separately (¢.g., mixing chloroform and acetone 
can cause unexpected reactions in the presence of bases). Halogenated solvents are organic solvents 
such as chloroform, dichloromethane, trichlorotrifluoroethane, and dichloroethane. Some nonhale- 
genated solvents are pentane, heptane, ethanol, methanol, benzene, toluene, N,N-dimethylfor- 
mamide ( DME), dimethylsulfoxide (DMSO), and acetonitrile. 
Laser radiation, visible or invisible, can cause severe damage to the eyes and skin. lake proper pre- 
cautions to prevent exposure to direct and reflected beams. Always follow the manufacturer's safety 
guidelines and consult your local safety office. See caution below for more detailed information. 
Flash lamps, due to their light intensity, can be harmful to the eyes, They also may explode on occa- 
sion. Wear appropriate eye protection and follow the manufacturer's guidelines. 
Photographic fixatives and developers also contain chemicals that can be harmfu). Handle them 
with care and follow manufacturer's directions. 
Power supplies and electrophoresis equipment pose serious fire hazard and electrical shock hazards 
if not used properly. 
Microwave ovens and autoclaves in the lab require certain precautions. Accidents have occurred 
involving their use (e.g., to melt agar or bacto-agar stored in bottles or te sterilize). If the screw top 
is not completely removed and there is not enough space for the steam to vent, the bottles can 
explode and cause severe injury when the containers are removed from the microwave or autoclave. 
Always completely remove bottle caps before microwaving or autoclaving. An alternative method for 
routine agarose gels that do not require sterile agar is to weigh out the agar and place the solution in 
a flask. 
Ultra-sonicators use high-frequency sound waves | 16-100 kHz) for cell disruption and other pur- 
poses. This "ultrasound," conducted through air, does not pose a direct hazard to humans, but the 
associated high volumes of audible sound can cause a variety of effects, including headache, nausea, 
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and tinnitus. Avoid direct contact of the body with high-intensity ultrasound (not medical imaging equip- 
ment). Use appropriate eat protection and display signs on the door(s) of laboratories where the units are used. 
Use extreme caution when handling cutting devices such as microteme blades scalpels, razor blades, or nee- 
dies, Microtome blades are extremely sharp! Use care when sectioning. If unfanuliar with their use, have some- 
one demonstrate proper procedures. For proper disposal, use the “sharps” disposal container in the lab. Discard 
used needles unshielded, with the syringe still attached, This prevents injuries (and possible infections; see 
Biological Safety) while manipulating used needles since many accidents occur while trying to replace the nee- 
dle shield, Injuries may also be caused by broken Pasteur pipettes, coverslips, or slides. 


GENERAL PROPERTIES OF COMMON CHEMICALS 


‘The hazardous materials list can be summarized in the following categories: 


Inorganx acids, such as hydrochloric, sulfuric, nitric, or phosphoric, are colorless liquids with stinging vapors. 
Avoid spills on skin or clothing. Spills should be diluted with large amounts of water. The concentrated forms 
of these acids can destroy paper, textiles, and skin as well as cause serious injury to the eyes. 

Inorganic bases such as sodium hydroxide are white solids which dissolve in water and under heat development. 
Concentrated solutions will slowly dissolve skin and even fingernails. 

Salts of heavy metals are usually colored powdered solids which dissolve in water. Many ot them are potent 
enzyme inhibitors and therefore toxic to humans and to the environment (e.g. fish and algae). 


Most organic solvents are flammable volatile liquids. Avoid breathing the vapors which can cause nausea or 
dizziness, Also avoid skin contact. 

Other organic compounds, including organosulphur compounds such as mercaptoethanol or organic amines, 
can have very unpleasant odors. Others are highly reactive and should be handled with appropriate care. 

If improperly handled, dyes and their solutions can stain not only the sample, but also skin and clothing, Some 
of them are also mutagenic (e.g., ethidium bromide), carcinogenic, and toxic. 

All names ending with “ase” (e.g., catalase, B- glucuronidase, or zymolase) refer to enzymes, There are also other 
enzymes with nonsystematic names like pepsin. Many of them are provided by manufacturers in preparations 
containing buffering substances, etc. Be aware of the individual properties of materials contained in these sub- 
stances. 

‘Toxic compounds are often used to manipulate cells. They can be dangerous and should be handled appropri- 
ately. 

Be aware that several of the compounds listed have not been thoroughly studied with respect to their toxica- 
logical propertics. Handle each chemical with the appropriate respect. Although the toxic effects of a compound 
can be quantified (e.g., LD sq values), this is not possible for carcinogens or mutagens where onc single exposure 
can have an effect. Also realize that dangers related to a given compound may also depend on its physical state 
(fine powder vs. large crystals/diethylether vs. glycerol/dry ice vs. carbon dioxide under pressure in a gas bomb. 
Anticipate under which circumstances during an experiment expasure is most likely to occur and how best to 
protect yourself and your environment, 
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HAZARDOUS MATERIALS 


In general, proprietary materials are not listed here. Follow the manufacturer's satety guidelines that accompany 
the product. 

Acetic acid (glacial) is highly corrosive and must be handled with great care. Liquid and mist cause severe burns 
to all body tissues. It may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and 
goggles and use in a chemical fume hood. Keep away from heat, sparks, and open flame. 

Acetic anhydride is extremely destructive to the skin, eyes, mucous membranes, and upper respiratory tract. It 
may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses. and use 
in a chemical fume hood. 

Acetone causes cye and skin irritation and is irritating to mucous membranes and upper respiratory tract. Do not 
breathe the vapors. It is also extremely flammable. Wear appropriate gloves and safety glasses. 

Acetonitrile (Methyl cyanide) is very volatile and extremely flammable. It ts an irritant and a chemical asphyxi- 
ant that can exert its effects by inhalation, ingestion, or skin absorption. Treat cases of severe exposure as cyanide 
poisoning. Wear appropriate gloves and safety glasses and use only in a chemical fume hood. Keep away from heat, 
sparks, and open flame. 

Acrylamide (unpolymerized) is a potent neurotaxin and is absorbed through the skin (the effects are cumulative). 
Avoid breathing the dust. Wear appropriate gloves and a face mask when weighing powdered acrylamide and 
methylene-bisacrylamide. Use in a chemical fume hood. Polyacrylamide is considered to be nontoxic, but it 
should be handled with care because it might contain small quantities of anpolymerized acrylamide. 

AgNO, see Silver nitrate 

£-Amino-n-caproic acid (6-Aminocaproic acid; Hexanoic acid) may be harmful by inhalation, ingestion, or skin 
absorption. Wear appropriate gloves and safety glasses. 

¢-Amino-n-butyric acid (GABA), see Butyric acid 

3-Aminopropyltriethoxysilane, see Silane 

Ammonia, NH, is corrosive, toxic, and can be explosive. It may be harmful by inhalation, ingestion, and skin 
absorption. Use only with mechanical exhaust. Wear apprapriate gloves and safety glasses. 

Ammonium bicarbonate, NH,HCO,, may be harmful by inhalation, ingestion, or skin absorption. Wear appro- 
priate gloves and safety glasses and use in a chemical fume hood. 

Ammonium carbonate, (NH,),CO,, may be harmful by inhalation, ingestion, or skin absorption. Wear appro- 
priate gloves and safety glasses and use in a chemical fume hood. 

Ammonium hydrogen carbonate, see Ammonium bicarbonate 

Ammonium hydroxide, NHOH, is a solution of ammonia in water. It is caustic and should be handled with great 
care, As ammonia vapors escape from the solution, they are corrosive, toxic, and can be explosive. Use only with 
mechanical exhaust. Wear appropriate gloves and use only in a chemical fume hood. 

Ammonium molybdate, (NH,),Mo,O,,,-4H,0, (or its tetrahydrate) may be harmful by inhalation, ingestion, of 
skin absorption. Wear appropriate glaves and safety glasses and use in a chemical fume hood. 

Ammonium persulfate, (NH,);5;0, is extremely destructive to tissue of the mucous membranes and upper res- 
piratory tract, eyes, and skin. inhalation may be fatal. Wear appropriate gloves, safety glasses, and protective cloth- 
ing and use only in a chemical fume hood. Wash thoroughly after handling. 

Ammonium sulfate, (NH,),SO, may be harmfu} by inhalation, ingestion, or skin absorption. Wear appropriate 
gloves and safety glasses. 

Aprotinin may be harmful by inhalation, ingestion, or skin absorption. It may also cause allergic reactions. 
Exposure may cause gastrointestinal effects, muscle pain, blood pressure changes, or bronchospasm. Wear appro- 
priate gloves and safety glasses and use only in a chemical fume hood. Do not breathe the dust. 

Arc lamps ate potentially explosive. Follow manufacturer's guidelines. When turning on arc lamps, make sure 
nearby computers àre turned off to avoid damage from electromagnetic wave components. Computers may be 
restarted once the arc lamps are in operation. 

Argon i5 à nonflammable high-pressure gas. It may be harmful by inhalation, ingestion, or skin absorption. Wear 
appropriate gloves and safety goggles. Use with sufficient ventilation and do not breathe the gas. 
p-Azidophenacyl bromide is highly flammable and corrosive. It may be harmful by inhalation, ingestion, and skin 
absorption. Wear appropriate gloves and safety goggles and use only in a chemical fume hood. Do nor breathe the 
dust, Keep away from heat, sparks, and open flame. 
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Bisacrylamide is à potent neurotoxin and is absorbed through the skin (the effects are cumulative). Avoid breath- 
ing the dust. Wear appropriate gloves and a face mask when weighing powdered acrylamide and methylene- 
bisacrylamide. 

Bleach {sodium hypochlorite), NaOCl, is poisonous, can be explosive, and may react with organic solvents. It 
may be fatal by inhalation and is also harmful by ingestion and destructive to the skin. Wear appropriate gloves 
and safety glasses and use in a chemical fume hood. 

Blood (human) and blood preducts and Epstein-Barr virus. Human blood, blood products, and tissues may 
contain occult infectious materials such as hepatitis B virus and HIV that may result in laboratory-acquired infec- 
tions. Investigators working with EBV-transformed lymphoblast cell lines are also at risk of EBV infection. Any 
human blood, blood products, or tissues should be considered a biohazard and should be handled accordingly. 
Wear disposable appropriate gloves, use mechanical pipetting devices, work in a biological safety cabinet, protect 
against the possibility of aerosol generation, and disinfect all waste materials befare disposal. Autoclave contami- 
nated plasticware before disposal; autoclave contaminated liquids or treat with bleach (10% [v/v] final concen- 
tration) for at least 30 minutes before disposal. Consult the local 1nstitutional safety officer for specific handling 
and disposal procedures. 

Bromophenol blue may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and 
safety glasses and use in a chemical fume hood. 

n-Butanol is irritating to the mucous membranes, upper respiratory tract, skin, and especially the eyes. Avoid 
breathing the vapors. Wear appropriate gloves and safety glasses and use in a chemical fume hood. n-butanol is 
also highly flammable. Keep away from heat, sparks, and open flame. 

sec-Butanol is irritating to the mucous membranes, upper respiratory tract, the skin, and especially the eyes. 
Avoid breathing the vapors. Wear appropriate gloves and safety glasses and use in a chemical fume hood. sec- 
butanol is also highly flammable. Keep away from heat, sparks, and open flame. 

Butyl Chloride is highly flammable. Keep away from heat, sparks, and open flame, It may be harmful by inhala- 
tion, ingestion, and skin absorption. Wear appropnate gloves and safety goggles. Do not breathe vapor. 

Calcium chloride is hygroscopic and may cause cardiac disturbances. It may be harmful by inhalation, ingestion, 
or skin absorption. Do not breathe the dust. Wear appropriate gloves and safety goggles. 

CAPS, see 3-(Cyclohexylamino)-1-propanesulfonic acid 

Carbon dioxide, CO,, in all forms may be fatal by inhalation, ingestion, or skin absorption. in high concentra- 
tions, it can paralyze the respiratory center and cause suffocation. Use only in well-ventilated areas. In the form of 
dry ice, contact with carbon dioxide can also cause frostbite. Do not place large quantities of dry ice in enclosed 
areas such as cold rooms. Wear appropriate gloves and safety goggles. 

Cetyltrimethylammonium bromide (CTAB) is toxic and an iritant and may be harmful by inhalation, ingestion, 
or skin absorption. Wear appropriate glaves and safety glasses. Avoid breathing the dust. 

CHAPS, see 3-[(3-Cholamidopropyl)dimethyl-ammonio]-1 -propanesulfonate 

Chloroform, CHCl, is irritating to the skin, eyes, mucous membranes, and respiratory tract. It is a carcinogen 
and may damage the liver and kidneys. It is also volatile, Avoid breathing the vapors. Wear approptiate gloves and 
safety glasses and always use in a chemical fume hood. 

3-[(3-Cholamidopropy!)dimethyl-ammonio]-1 -propanesulfonate (CHAPS) is an irritant and may be harmful 
by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses. 

Citraconic anhydride is highly toxic and may be harmful by inhalation, ingestion, or skin absorption. Wear 
appropriate gloves and safety goggles. Avoid inhalation. 

Citric acid is an irritant and may be harmful by inhalation, ingestion, or skin absorption. It poses a risk of seri- 
ous damage to the eyes. Wear appropriate gloves and safety goggles. Do not breathe the dust. 

Coomassie Brilliant Blue may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves 
and safety glasses. 

p-Cresol may be fatal if inhaled, ingested, or absorbed through the skin. It is corrosive and is extremely destruc- 
tive to the eyes, skin, mucous membranes, and upper respiratory tract. Wear appropriate gloves and safety glass- 
es and use in a chemical fume hood. 

Crystal Violet can cause severe burns. It may be harmful by inhalation, ingestion, and skin absorption. Wear 
appropriate gloves and safety goggles and use in a chemical fume hood, Do not breathe the dust. 


CTAB, see Cetyltrimethylammonium bromide 


B-Cysnoethyl diisopropylchlorophosphoramidite (2-Cyanoethyl diisopropylchlorophosphoramidite) 1s high- 
ly corrosive and causes burns. Heating may cause an explosion. ft is extremely destructive to mucous membranes 
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and upper respiratory track, eyes, and skin, Wear appropriate gloves and safety goggles and use in a chemical lume 
hood. Do not breathe the vapor. Keep away from heat, sparks, and open flame. 

Q-Cyano-4-hydroxycinnamic acid may cause cardiac disturbances. Chronic effects may be delayed. It may be 
harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses. 

Cyanogen bromide is extremely toxic and is volatile. It may be fatal hy inhalation, ingestion, or skin absorption, 
Do not breathe the vapors. Wear appropriate gloves and always use in a chemical fume hood. Keep away trom 
acids. 

3-(Cyclohexylamino)-1-propanesulfonic acid (CAPS) 1s an irritant and may be harmful by inhalation, ingestion, 
or skin absorption. Wear appropriate gloves and safety glasses. Do not breathe the dust. 

DCM, see Dichloromethane 

DHB, see 2,5-dihydroxybenzoic acid 

Dichloromethane (DCM), CH, Cl,, (also known as Methylene chloride) is toxic if inhaled, ingested, or absorbed 
through the skin. [t is also an irritant and is suspected to be a carcinogen. Wear appropriate gloves and safety glass- 
es and use in a chemical fume hood, Do not breathe the vapors. 

Digitonin may be fatal if inhaled, ingested, or absorbed through the skin. Wear appropriate gloves and safety 
glasses and use in a chemical fume hood. 

2,5-Dihydroxybenzoic acid (DHB) may be harmful by inhalation, ingestion, or skin absorption, Wear appropri- 
ate gloves and safety glasses. Do not breathe the dust. 

Diisopropylethylamine (DIEA) is extremely destructive to the mucous membranes, upper respiratory tract, skin, 
and eyes. It may be harmful by ingestion or skin absorption. Inhalation may be fatal. Wear appropriate gloves and 
safety glasses and always use in a chemical fume hood. Keep away fram heat sparks, and open flame. 
N,N-Dimethylformamide (DMF), HCON(CH,),, is à possibel carcinogen and i5 irritating to the eyes, skin, and 
mucous membranes, lt can exert its toxic effects through inhalation, ingestion, or skin absorption. Chronic 
inhalation can cause liver and kidney damage. Wear appropriate gloves and safety glasses and use in a chemical 
fume hood. 

Dimethyt sulfate (DMS), (CH,), SO, is extremely toxic and is a carcinogen. Avoid breathing the vapors, Wear 
appropriate gloves and safety glasses and use only in a chemical fume hood. Dispose of solutions containing 
dimethyl sulfate by pouring them siowly into a solution of sodium hydroxide or ammonium hydroxide and allow- 
ing them to sit overnight in the chemical fume hood. Contact your local safety office before reentering the lab to 
clean up a spill. 

Dimethyl sulfoxide (DMSO) may be harmful by inhalation or skin absorption. Wear appropriate gloves and safe- 
ty glasses and use in a chemical fume hood. DMSO is also combustible. Store in a tightly closed container, Keep 
away from heat, sparks, and open flame. 

Dioctylphthalate is toxic and is suspected of being a carcinogen. It may be harmful by inhalation, ingestion, or 
skin absorption, Wear appropriate gloves and safety goggles. 

1,4-Dioxane is highly flammable both in liquid and vapor form. It is a possible carcinogen and is highly toxic by 
inhalation, ingestion, or skin absorption. Do not breathe the vapor. Wear appropriate gloves and safety glasses. 
Keep away from heat, sparks, and open flame. 

Dithiothreitol (DTT) is a strong reducing agent that emits a foul odor. [t may be harmful by inhalation, inges- 
tion, or skin absorption. When working with the solid form or highly concentrated stocks, wear appropriate gloves 
and safety glasses and use in a chemical fume hood. 

DME, see N, N-Dimethylformamide 

Dry ice, see Carbon dioxide 

EDC, see N-ethyl-N '" (dimethylaminopropyl)-carbodiimide 

Epoxy and acrylic resins, see Resins 

Ethanol (EtOH), CH3CH,OH, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate 
gloves and safety glasses. 

Ethanolamine, HOCH,CH,NH, is toxic and harmful by inhalation, ingestion, or skin absorption. Handle with 
care and avoid any contact with the skin, Wear appropriate gloves and goggles and use in a chemical tume hood. 
Ethanolamine is highly corrosive and reacts violently with acids. 

Ether, see 1,4-Dioxane 

1-Ethyl-3-[3-dimethylaminopropyll carbodiimide (EDC) is irritating to the mucous membranes and upper res- 
piratory tract, It may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety 
glasses. Handle with care. 
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N-Ethylmorpholine may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and 
safety glasses. 


Ferric chloride, FeCl, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and 

safety glasses and use only in a chemical fume hood. 

Formaldehyde, HCHO, is highly toxic and volatile. 1t is also a possible carcinogen. It is readily absorbed through 

the skin and is irritating or destructive to the skin, eyes, mucous membranes, and upper respiratory tract, Avoid 

breathi ng the vapors. Wear appropriate gloves and safety glasses and always use in a chemical tume hood. Keep 

away from heat, sparks, and open flame. 

Formamide 1s teratogenic. The vapor is irritating to the eyes, skin, mucous membranes, and upper respiratory 

tract. [t may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses 

and always use a chemical fume hood when working with concentrated solutions of fermamude. Keep working 

solutions covered as much as possible, 

Formic acid, HCOOH, 1 highly toxic and extremely destructive to tissue of the mucous membranes, upper res- 

piratory tract, eyes, and skin. It may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate 

gloves and safety glasses (or face shield) and use ina chemical fume hood. 

Glutaraldehyde is toxic. I1 is readily absorbed through the skin and is irritating oc destructive to the skin, eyes, 

mucous membranes, and upper respiratory tract. Wear appropriate gloves and safety glasses and always use in a 

chemical fume hood. 

Glycine may be harmful by inhalation, ingestion, of skin absorption. Wear gloves and safety glasses. Avoid breath- 

ing the dust. 

Guanidine hydrochloride is irritating to the mucous membranes, upper respiratory tract, skin, and eyes, It may 

be harmful by inhalation, ingestion, or skin absorption, Wear appropriate gloves and safety glasses. Avoid breath- 

ing the dust. 

Suanidinium hydrochloride, see Guanidine hydrochloride 

HCOOH, see Formic acid 

Helium (gas) may cause frostbite and may be harmful by inhalation, ingestion, or skin absorption, Wear appro- 

priate gloves and safety goggles. Do not wear contact lenses. 

Heptafluerobutyric acid is corrosive and may be harmful by inhalation, ingestion, or skin absorption. Wear 

appropriate gloves and safety goggles. 

Heptane may be harmful by inhalation, ingestion, or skin absorption. Wear appro riate gloves and safety glasses. 
P Y y g P pprop E y 

It i5 extremely flammable. Keep away from heat, sparks, and open flame. 

Hexane is extremely flammable and may be harmful by inhalation, ingestion, or skin absorption. Wear appropri- 

ate gloves and safety glasses and use only in a chemical fume hood. Keep away from heat, sparks, and open flame. 

HNO, see Nitric acid 

H,NOH, see Hydroxylamine 

H,PO, see Phosphoric acid 

H,SO,, see Sulfuric acid 

Human tissues/fluids , see Blood (human) and blood products 

Hydrochloric acid, HCL, is volatile and may be fatal if inhaled, ingested, or absorbed through the skin. It is 

extremely destructive to mucous membranes, upper respiratory tract, eyes, and skin. Wear appropriate gloves and 

safety glasses and use with great care in a chemical fume hood, Wear goggles when handling large quantities. 

Hydrogen fluoride is extremely toxic, corrosive and can cause severe burns. It may be harmful by inhalation, 

ingestion, and skin absorption. Wear appropriate gloves and safety goggles and use only in a chemical fume hood. 

Hydroxylamine, H,NOH, is corrosive and toxic. !t may be harmful by inhalation, ingestion, or skin absorption. 

Wear appropriate gloves and safety glasses and use only in a chemical fume hood. 

Imidazole is corrosive and may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves 

and safety glasses and use in a chem ical fume hood. 

Iodoacetamide (IAA), C,H,INO, can alkylate amino groups in proteins and can therefore cause problems if the 

antigen 1s being purified for amino acid sequencing, It is toxic and harmful by inhalation, ingestion, or skin 

absorption, Wear appropriate gloves and safety glasses and use only in a chemical fume hood. Do not breathe the 

dust. 

lodoacetic acid, see Acetic acid 

Isopropanol is flammable and irritating. It may be harmful by inhalation, ingestion, or skin absorption. Wear 

appropriate gloves and safety glasses. Do not breathe the vapor. Keep away from heat, sparks, and open flame. 
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K,Fe(CN},, see Potassium ferricyanide 

KH,P0,/K,HPO,/K,PO,, see Potassium phosphate 

Leupeptin (or its hemisulfate) may be harmful by inhalation, ingestion, or skin absorption, Wear appropriate 
gloves and safety glasses and use in a chemical fume hood. 


Magnesium acetate tetrahydrate may be harmful by inhalation, ingestinn, or skin absorption. Wear appropriate 
gloves and safely glasses. 


Magnesium chloride, MgCl, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate 
gloves and safety glasses and use in a chemical fume hood. 

B-Mercaptoethanol (2-Mercaptoethanol), HOCH,CH,SH, may be fatal if inhaled or absorbed through the skin 
and is harmful if ingested. High concentrations are extremely destructive to the mucous membranes, upper res- 


piratory tract, skin, and eyes. B- Mercaptoethanol has a very foul odor. Wear appropriate gloves and safety glasses 
and always use in a chemical fume hood. 


MES, see 2-(N-Morpholino)ethanesulfonic acid 
Methanol, MeOH or H,COH, is poisonous and can cause blindness. 1t may be harmlul by inhalation, ingestion, 


or skin absorption. Adequate ventilation is necessary to limit exposure to vapors. Avoid inhaling these vapors 
Wear appropriate gloves and goggles and use only in a chemical fume hood. 


Methyl acetate is extremely flammable in both liquid and vapor form. Vapor may cause flash fire. It may be harm 
ful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses. Keep away trom heat, 
sparks, and open flame. 

Methyl cyanide, see Acetonitrile 


Methylene blue is irritating to the eves and skin. It may he harmful by inhalation, ingestion, or skin absorption. 
Wear appropriate gloves and safety glasses. 


Methyl green may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety 
glasses and use in a chemical fume hood. 


Methyl iodide mav be fatal and can affect the central nervous system. it i$ also a carcinogen and an irritant, It is 
harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety goggles and use only in 
a chemical fume hood. 


2-(N-Morpholino)ethanesulfonic acid (MES) may be harmful by inhalation, ingestion. or skin absorption. Wear 
appropriate gloves and safety glasses. 


3-(N-Morpholino)-propanesulfouic acid (MOPS) may be harmful by inhalation, ingestion, or skin absorption. 
It is irritating to mucous membranes and upper respiratory tract. Wear appropriate gloves and safety glasses and 
use in a chemical fame hood. 


Na,CO,, see Sodium carbonate 

NaF, see Sodium fluoride 

NaH,PO,/Na,HPO,/Na,PO, see Sodium phosphate 

NaN,, see Sadium azide 

Na, VO, see Sodium orthovanadate 

(NH,),CO,, see Ammonium carbonate 

NH,HCO,, see Ammonium bicarbonate 

(NH,),Mo,0,,-4H,O, see Ammonium molybdate 

(NH,),SO,, see Ammonium sulfate 

(NH,),S,0,, see Ammonium persulfate 

Nitric acid, HNO, is volatile and must be handled with great care. It is toxic by inhalation, ingestion, and skin 
absorption. Wear appropriate gloves and safety goggles and use in a chenucal fume hood. Do not breathe the 
vapors. Keep away from heat, sparks, and open flame. 

Nitrogen, liquid or gaseous can cause severe damage due to extreme temperature. Handle frozen samples with 
extreme caution. Do not breathe the vapors. Seepage of liquid nitrogen into frozen vials «an result in an exploding 
tube upon removal from liquid nitrogen. Use vials with O-rings when possible. Wear crvo-mitts and a face mask 
Pepstatin may be harmful by inhalation, ingestion, or skin absorption, Wear appropriate gloves and safety glass 
es and use in a chemical fume hood. 

Phenyl isocyanate is highly toxic, combustible, and flammable. It may be harmful by inhalation, ingestion, or skin 
absorption. It can cause burns. Wear appropriate gloves and safety goggles. Keep away [rom heat, sparks, and open 


flame. 
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Phenyl-methyl sultonyl fluoride (PMSF), C;H,FO,S or C,H,CH,SO,E is a highly toxic cholinesterase inhibitor. 
It is extremely destructive to the mucous membranes of the respiratory tract, eyes, andi skin. It may be fatal by 
inhalation, ingestion, or skin absorption, Wear appropriate gloves and safety glasses and always use it a chemical 
fume hood. In case of contact, immediately flush eyes or skin with copious amounts of water and discard conta- 
minated clothing. 

Phosphoric acid, H,PO,, is highly corrosive and is harmful by inhalation, ingestion, or skin absorption, Wear 
appropriate gloves and safety glasses. Do not inhale the vapor. 

c-Picoline is flammable and the effects from exposure are acute, It is harmful by inhalation, ingestion, or skin 
absorption. Wear appropriate gloves and safety goggles. 


B-Picoline is flammable and the effects from exposure are acute. [t is harmful by inhalation, ingestion, or skin 
absorption. Wear appropriate gloves and safety goggles. 


PMSF, see Phenyl-methyl-sulfony! fluoride 


Polyethylenimine is highly corrosive and causes eye and skin burns. It may be harmful by inhalation, ingestion 
or skin absorption. Wear appropriate gloves and safety goggles. 

Ponceau S is an irritant and may be harmful by inhalation, ingestion, or skin absorption, Wear appropriate gloves 
and safety glasses. 

Potassium carbonate may be harmful by inhalation, ingestion, or skin absorption. Wear approp riate gloves and 
safety glasses and use in a chemical fume hood. 

Potassium chloride, KCl, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves 
and safety glasses. 

Potassium ferricyanide, K,Fe(CN), may be fatal by inhalation, ingestion, or skin absorption. Wear appropriate 
gloves and safety glasses and always use with extreme care in a chemical fume hood. Keep away from strong acids. 
Potassium hydroxide, KOH and KOH/methanol, is highly toxic and may be fatal if swallowed, lt may be harm- 
ful by inhalation, ingestion, or skin absorption. Solutions are corrosive and can cause severe burns. !t should be 
handled with great care. Wear appropriate glaves and safety goggles. 


Potassium phosphate, KH,PO,/K,HPO,/K,PO,, may be harmful by inhalation, ingestion, or skin absorption. 
Wear appropriate gloves and safety glasses. Do not breathe the dust, K,HPO,e3H;O is dibasic and KH,PO, 1s 
monobasic. 


Propane is a flammable high-pressure liquid and gas and may form explosive mixtures with air. Vapor may trav- 
el a considerable distance to source of ignition and flash back. Keep away from heat, sparks, and flame. Wear 
appropriate gloves and safety glasses and use a mechanical exhaust. 

Pyridine 1s highly toxic and extremely destructive to the mucous membranes, upper respiratory tract, skin, and 
eyes. jt may be harmful by inhalation, ingestion, or skin absorption. It is a possible mutagen and may cause male 
infertility. Keep away from heat, sparks, and open flame. Wear appropriate glaves and safety glasses and always usc 
in a chernical fume hood. 


Pyronin Y may be mutagenic. lt may be harmful by inhalation, ingestion, or skin absorption, Wear appropriate 
gloves and safety glasses. 

Radioactive substances: When planning an experiment that involves the use of radioactivity, include the physic- 
ochemical properties of the isotope (half-life, emission type and energy), the chemical form of the radioactivity, 
its radioactive concentration (specific activity), total amount, and its chemical concentration, Order and use orly 
as much as really needed. Always wear appropriate gloves, lab coat, and safety goggles when handling radioactive 
material. X-rays and gamma rays are electromagnetic waves of very short wavelengths either generated by tech- 
nical devices or emitted by radioactive materials. They may be emitted isotropically from the source or may be 
focused into a beam. Their potential dangers depend on the time period of exposure, the intensity experienced, 
and the wavelengths used. Be aware that appropriate shielding is usually of lead or other similar material. ‘The 
thickness of the shielding is determined by the energy(s} of the X-rays or gamma rays. Consult the local safety 
office for further guidance in the appropriate use and disposal of radioactive materials. Always monitor thor- 
oughly after using radioisotopes. A convenient calculator to perform routine radioactivity calculations can be 
found at: http://www.graphpad.com/calculators/radcale.cfm 

Resins are suspected carcinogens. The unpolymerized components and dusts may cause toxic reactions, includ- 
ing contact allergies with long-term exposure, Avoid breathing the vapors and dusts, Wear appropriate gloves and 
safety glasses and always use in a chemical fume hood. Sensitivity to these chemicals may develop with repeated 


contact, 
SDS, see Sodium dodecyl] sulfate 
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Silane is extremely flammable and corrosive. It may be harmful by inhalation, ingestion, or skin absorption. Keep 
away from heat, sparks, and open flame. The vapor is irritating to the eyes, skin, mucous membranes, and upper 
respiratory tract. Wear appropriate gloves and safety goggles and always use in a chemical fume hood. 


Silica is an irritant and may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and 
safety glasscs. Do not breathe the dust. 


Silver nitrate, AgNO,, is a strong oxidizing agent and should be handled with care. It may be harmful by inhala- 
tian, ingestion, or skin absorption. Avoid contact with skin. Wear appropriate gloves and safety glasses. It can 
cause explosions upon contact with other materials. 

Sinapinic acid may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety 
glasses. 


Sodium acetate, see Acetic acid 

Sodium azide, NaN,, is highly poisonous. It blocks the cytochrome electron transport system. Solutions contain- 
ing sodium azide should be clearly marked. It may be harmful by inhalation, ingestion, ur skin absorption. Wear 
appropriate gloves and safety goggles and handle it with great care. Sodium azide is an oxidizing agent and should 
not be stored near flammable chemicals. 

Sodium borodeuteride is a flammable solid, corrosive, and causes burns. It is water-reactive and is harmful by 
inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety goggles and use in a chemical fume 
hood. 

Sodium carbonate, Na,CO,, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves 
and safety glasses and use in a chemical fume hood. 

Sodium citrate, see Citric acid 

Sodium dihydrogen phosphate, NaH,PO, (sodium phosphate, monobasic) may be harmful by inhalation, 
ingestion, or skin absorption. Wear appropriate gloves and safety glasses and use in 4 chemical fume hood. 
Sodium dodecyl sulfate (SDS) is toxic, an irritant, and poses a risk of severe damage to the eyes. It may be harm- 
ful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety goggles. Do not breathe the 
dust. 

Sodium fluoride, NaF, is highly toxic and causes severe irritation. It may be fatal by inhalation, ingestion, or skin 
absorption. Wear appropriate gloves and safety glasses and use only in a chemical fume hood. 

Sodium hydrogen phosphate, Na,HPO,, (sodium phosphate, dibasic) may be harmful by inhalation, ingestion, 
or skin absorption. Wear appropriate gloves and safety glasses and use in a chemical fume hood. 

Sodium hydroxide, NaOH, and solutions containing NaOH are highly toxic and caustic and should be handled 
with great care. Wear appropriate gloves and a face mask. All other concentrated bases should be handled in a sim- 
ilar manner. 

Sodium hypochlorite, NaOCl, see Bleach 

Sodium iodide, Nal, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and 
safety glasses and use in a chemical fume hood. 

Sodium nitrate, NaNO,, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves 
and safety glasses and use in a chemical fume hood. 

Sodium orthovanadate, Na,VO,, may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate 
gloves and safety glasses and use ina chemical fume hood. 

Sodium phosphate, NaH,PO,/Na,HPG,/Na,PO,, is an irritant to the eyes and skin. St may be harmful by inhala- 
tion, ingestion, or skin absorption. Wear appropriate gloves and safety goggles. Do not breathe the dust. 

Sodium toluene sulfinate, see Toluenesulfonic acid 

Streptomycin is toxic and a suspected carcinogen and mutagen. [t may cause allergic reactions. 1t may be harm- 
fu) by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses. 

Succinic anhydride is a possible mutagen and is a severe eye irritant. It may be harmful by inhalation, ingestion, 
or skin absorption. Wear appropriate gloves and safety glasses and use only in a chemic al fume hued. Do not 
breathe the dust. 

Sulfosalicylic acid (dihydrate) is extremely destructive to the mucous membranes and respiratory system. Do not 
breathe the dust. Wear appropriate gloves and safety glasses and use only in a chemical fume hood. 

Sulfuric acid, H,SO,, is highly toxic and extremely destructive to tissue of the mucous membranes and upper res- 
piratory tract, eyes, and skin. It causes burns, and contact with other materials (e.g, paper) may cause fire. Wear 
appropriate gloves, safety glasses, and lab coat and use ina chemical fume hood. 
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SYPRO Orange/Red/Ruby contains DMSO, See DMSO. 

TAME, see N-c:-Tosyl-1.-arginine methyl ester hydrochloride 

TBP, see Tributylphosphine 

TCEB, see Tris-(carboxyethyl phosphine hydrochloride 

TEMED, see N,N,N ,N-Tetramethylethylenediamine 

Tetrabutylammonium thiocyanate may be harmful by inhalation, ingestion, or skin absorption. Wear appropri- 
ate gloves and safety glasses. 

N,N, N,N -Tetramethylethylenediamine (TEMED]) is highly caustic to the eyes and mucus membranes and may 
he harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and tightly sealed safety goygles. 
TEA, see Trifluoroacetic acid 

Thiourea may be carcmogemc and may be harmful by inhalation, ingestion, or skin absorption. Wear appropri- 
ate gloves and safely glasses and use in a chemical fume hood. 

TMAO, see Trimethylamine N-oxide 

Toluene, C,H,CH,, vapors are irritating to the eyes, skin, mucous membranes, and upper respiratory tract. 
Toluene can exert harmful effects by inhalation, ingestion, or skin absorption, Do not inhale the vapors. Wear 
appropriate gloves and safety glasses and use in a chemical fume hood. Toluene is extremely flammable. Keep away 
from heat, sparks, and open flame. 

Toluenesulfonic acid is very corrosive, causes burns, and is extrernely destructive to the upper respiratory tract. 
It may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses and use 
only in a chernical fume hood. 

N-u-Tosyl-1-arginine methyl ester hydrochloride (TAME) may be harmful by inhalation, ingestion, or skin 
absorption, Wear appropriate gloves and safety glasses and use only in a chemical fume hood. 
N-Tosyl-1-phenylalanine chloromethyl ketone (1 PCK) may be harmful by inhalation, ingestiun, or skin absorp 
tion. Wear appropriate gloves and safety glasses and use only in a chemical fume hood. 

Tributylphosphine (TBP) may be fatal. It may be harmful by inhalation, ingestion, or skin absorption. Wear 
appropriate gloves and safety goggles and use in a chemical fume hood. Do not breathe the vapor. 
Triethanolamine may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safe- 
ty glasses and use only in a chemical fume hood, 

‘Trifluoroacetic acid (TEA) (concentrated) may be harmful by inhalation, ingestion, or skin absorption. 
Concentrated acids must be handled with great care. Decomposition causes toxic fumes. Wear appropriate gloves 
and a face mask and use in a chemical fume hood. 

‘Trimethylamine N-oxide (TMAQ) causes eye irritation and may be harmful by inhalation, ingestion, or skin 
absorption. Wear appropriate gloves and safety goggles. 

Tris may be harmful by inhalation, ingestion, or skin absorption. Wear appropriate gloves and safety glasses. 
"Tris (carboxyethyl) phosphine hydrochloride (TCEP) 1s corrosive to the mucous membranes, upper respiratory 
tract, eyes, and skin and can cause burns. It may be harmful by inhalation, ingestion, or skin absorption, Wear 
appropriate gloves and safety glasses and use in a chemical fume hood. Do not breathe the vapor or mist. 
Triton X-100 causes severe eye irritation and burns. [t may be harmful by inhalation, ingestion, or skin absorp- 
tion. Wear appropriate gloves and safety goggles. 

Urea may be harmful by inhalation, ingestion, or skin absorption, Wear appropriate gloves and safety glasses. 
UV light and/or UV radiation is dangerous and can damage the retina, Never look at an unshielded UV light 
source with naked eyes. Examples of UV light sources that are common in the laboratory include hand-held lamps 
and transilluminators. View only through a filter or safety glasses that absarb harmful wavelengths. UV radiation 
is also mutagenic and carcinogenic. To minimize exposure, make sure that the UV light source is adequately 
shielded, Wear protective appropriate gloves when holding materials under the UV light source. 
4-Vinylpyridine may be fatal by inhalation, ingestion, or skin absorption, Wear appropriate gloves and safety gog- 
gles. Use only in a chemical fume hood. Do not breathe the vapors. 

Xylene is flammable and may be narcotic at high concentrations. It may be harmful by inhalation, ingestion, or 
skin absorption. Wear appropriate gloves and safety glasses and use only in a chemical fume hood, Keep away from 
heat, sparks, and open flame. 

Xylene cyanol, see Xylene 

Zinc chloride, ZnCl, is corrosive and poses possible risk to the unborn child. It may be harmful by mhalation, 
ingestion, or skin absorption. Wear appropriate gloves and safety glasses. Do not breathe the dust. 
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suppliers 


Wi THE EXCEPTION OF THOSE SUPPLIERS LISTED IN THE TEXT with their addresses, all 


suppliers mentioned in this manual can be found in the BioSupplyNet Source Book 
and on the Web Site at: 


http://www.biosupplynet.com 


If a copy of the BioSupplyNet Source Book was not included with this manual, a 
free copy can be ordered by any of the following methods: 


* Complete the Free Source Book Request Form found at the Web Site at: 
http://www. biosupplynet.com 
* E-mail a request to info@biosupplynet.com 


a Fax a request to 1-919-659-2199, 
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577-879 


recovery with solid-phase sequencer, 
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structures, 842 
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blocking, 300, 307—308 
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840 
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Antibody arrays 
direct labeling, 898—699 
printing, 785-786 
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Antibody arrays (contznued) 
processing, 786 
sandwich assay, 698—700, 784—786 
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Apparent molecular-mass determination, 77-78 
Aprotinin, H04, 114, 14! 
Aptamers, 29 
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properties, 843 
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structure, 842 
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antibody arrays, 698—700 
direct labeling, 698-499 
printing, 785-786 
processing, 786 
sandwich assay, 698-700, 784—786 
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783 
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structure, B42 
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structure, 842 
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Asp-N, 357 
Association constant, determination of the, 884— 
886 
ACH. See Alkylated thiohydantoin 
Atmospheric sampling glow discharge ioniza- 


tion, 571 
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Aurodye staining, 358 
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Average mass, 433 
Avidin affinity capture, 523, 529-531 
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824-825 
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overview, 815- 817 
protein expression measurements, 828— #29 
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Blocking, amino terminal, 300, 317-308, 344 
BLU KS database, 810, 814 
BI OSUMG2, 798 
Blue Native PAGE (BN-PAG | 
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Bomb, 499 
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Bottom-up protein characterization, 567 
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labeling, 780 
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BRITE database, 824—825 
Bromelain, 353 
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CAD. See Collision-activated dissociation 
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CAFASP, 813 
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analysis of results, 888—390 
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blank experiments, 887 
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limitations of, 883—884 
measuring intermolecular interactions tech- 
nique, 890 
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sample preparation, 888 
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CaMK-1 insulin receptor kinase, 8 
Capacity factor (k'}, 224, 274, 277-278 
Capillary columns for RP-HPLC, packing, 242- 
248 
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320-321 
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examples of chrematograms, 312, 314-317 
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protocol, 332-335 
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CASP, 812-813 
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Cathepsin C, 354 
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Cation exchange, cleanup of ICAT samples, 528 
CBP. See Calmodulin-binding peptide 
CCD, See Charge-coupled device 
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tetrafluoroborate 
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detergent fractionation of monolayer, 133-154 
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for immunoblotting, 110-111 
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Cell mapping 
affinity capture methods, 15 
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Cellular complexity, origins of, 6-12 
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CFP. See Cyan fluorescent protein 
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CHAPS, for protein solubilization, 148 
CHAPS®O, for protein solubilization, 148 
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details, 465 
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materials, 556 
multiple endoproteinase reactions, 566 
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Collision-activated dissociation (CAD), 442, 568 

Collision- induced dissociation (CID), 19, 27, 876 

comparison im collision cell and in-source, 616 
detecting and selecting peptides for, 612-613 
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Coprecipitation, 15 
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by, 94 
Coupled proton transfer process, 887—888 
Coupling buffer, 329 
Covalent modifications, 7 
Cross-linking 
choice of cross-linker, 682 
difficulty of technique, 683—684 
disulfide bond cleavage with performic acid, 
347—348, 370—371 
M5 use with, 725, 728—733 
oplimization of reaction, 726—728 
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information, 681—684 
site-specific protein-DNA photo-cross-linking 
advantages of on-bead cross-linking, 
686, 742 
case study, 756 
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site-specifically derivatized DNA frag- 
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identification of cross-linked proteins, 
756 
nuclease digestion, 755 
overview, 685-687, 741-742 
preparation and UV irradiation of pratein- 
TNA complex, 752-754 
subsecond photo-cross-linking using water- 
soluble metal complexes, 734-740 
case study, 739—740 
materials, 734—735 
method, 735-737 
optimization and precautions, 738 
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topology analysis, 725-733 
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CTAB-PAGE protocol, 78-80 
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PAGE pore size and, 46 
sample buffer, 79 
Cyan fluorescent protein (CFP), 687, 757—765 
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Cyanine-3 (Cy3) dye, 775, 779 
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456, 462, 478, 546 
Cyanogen bromide, 344, 348, 353 
cleavage, 361 
overview, 362 
protocol, 249, 385—387 
reaction mechanism, 362 
fragments of proteins, 257 
putity, 387 
Cysteic acid, 547 
Cysteine. See also Disulfide bonds 
affinity tagging, 520-523 
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properties, 843 
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structure, 842 
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complexes 
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Data analysis, FRET, 763—765 
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824 
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available on BLAST Web server, 800 
metabolic and signaling pathway, 824—825 
protein interaction, 10-it, 823-824 
searching, 467—477, 506—507, 874 
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DC]. See 3,4-Dichloreisocoumarin 
DDF See Differential detergent fractionation 
Dead volume, 240, 267, 409 
Deblocking, 307. 308 
Degassing solvents, 133—184, 238, 864, 891 
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Denaturing loading buffer, 745 
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133 
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of DNA fragments, 747—748 
of phosphopeptides, 605—606, 612—615 
Desalting, 376-377 
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of peptides and protein mixtures by RP-HPLC 
techniques, 265—268 
peptides prior to MS, 454-456 
proteins prior to MS, 454 
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Detergents, for protein solubilization, 148. See 
also Differential detergent fractiona- 
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Dextran, PAGE pore size and, 46 
DGS, 810 
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DIABLO protein, 707 
Dialysis, 149, 374 
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DIEA. See Diisopropylethylamine 
Differential centrifugation, cell purification by, 
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Differential detergent fractionation (DDF!, 
126-137 
analysis using 2D gel electrophoresis, 134-135 
cell preparation, 131 
degradation studies, use in, 135 
examples of cellular analysis using, 127 
method for monolayer cell cultures, 133-134 
method for suspension cell cultures, 131-133 
overview, 101, 126-128 
precipitation of tubulins and MAPs using 
magnesium, 137 
protein distribution in hepatocyte fractions, 
127 
reagents, 128-130 


RIXA isolation, 127-128, E36 -137 
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Differential in-gel electrophoresis (DIGF 4 
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Discontinuous geis, 41—43 
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Disulfide bonds 
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structure, 349 
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site-specific protein-DNA photo-cross-linking 
advantages of on-bead cross-linking, 686, 742 
case study, 756 
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site-specifically derivatized DNA frag- 
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identification of cross-linked proteins, 756 
nuclease digestian, 755 
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DNA-binding proteins, 165 
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DNP (dinitrophenyl group}-labeling method, 288 
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structure prediction, 808, 810,-811 
transmembrane, 811 
Dom Fish server, 810 
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Dounce homogenizer, 103, 105 
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DTE . See Dithioerythreitol 
OTT. See Dithiothreitol 
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EB! database, 468 
ECD, See Electron capture dissociation 
EcoCyc/MetaCyc database, 824-825 
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diimide hydrochloride), 329—350, 725 
Edman degradation 
amino-terminal blocking, 300, 307—308 
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electroblotting samples for, 320 
history, 290 
overview, 288—292 
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phase sequencer, 309—310 
phosphoprotein and phosphopeptide analysis, 
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sample preparation 
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sample purity, 298, 300, 345 
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318-319 
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using spinning-cup sequencer, 291-293, 295 
EDTA (ethylene diaminetetracetic acid) 
in digitonin extraction buffer, 129 
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in protein extract preparation, 94 
in silver staining stop solution, 197 
Elastase, 353—354 
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chemical printer and, 463—465, 556—566 
history of, 320 
josses during, 306 
molecular scanner and, 463, 544—555 
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blotted proteins, 402—404 
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sequence analysis and, 306—307 
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troubleshooting, 323 
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Electro-osmotic flow (EOF), 862 
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al gel electrophoresis 
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native agarose gel electrophoresis, 686, 713—716 
nonequilibrium pH gradient electrophoresis 
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solid-phase extraction capillary electrophore- 
sis (SPE-CE), 612-613 
Electrophoresis buffer 
tonic strength, 51 
pH, 50-51 
Electrospray ionization (ESI), L6, 19, 27 
advantages and disadvantages, 440 
history of development, 430 
hypothetical spectra example, 438 
IMAC with, 608, 610-612 
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positive-ion mode, 604—605 
miceocapillary HPLC integration with, 487— 
495 
microsprayer setup, 491 
overview, 436 
phosphoprotein analysis, 603 
on-line micro-IMAC, 640 
protocol, 641—642, 651-635 
sample cleanup for, 452—453 
schematic representation of process, 438 
TOF, 445 
Electrotransfer buffers, 321 
Elements, isotope and natural abundance, 435 
Ellman's reagent, 350, 365, 378—379 
Elongins B and C, 712 
Flution time, 493 
Emotifs database, 814 
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Metabolism (EcoCyc), 10 
Endoproteinase 
Arg-C, 354 
Asp-N, 353—354, 358 
Glu-C, 354, 414 
Lys-C, 353-354, 358, 510 
Endoproteinases, 354, 358, See also Proteases 
Enthalpy of binding, 882-888 
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Entropy, 882—883 
Enzymes and Metabolic Pathways (EMP) data- 
base, 10, 825 
EOF, See Electro-osmotic flow 
Epifiuorescence microscope, 758 
Epitope mapping, 364 
Epitope tags, 676, See also FLAG-epitope-tagged 
proteins 
Equilibration buffer, 327 
Escherichia coli, 673 
disruption by nitrogen cavitation, 115 
enzymatic disruption of cells, 119-120 
expression systems, inclusion bodies af, 92 
inclusion bodies, 97 
lysates for 2D gel electrophoresis, 168-159 
molecular scanner analysis of proteins, 554— 
555 
peptide mass fingerprint analysis of, 561— 
504 
RNA polymerase transcription initiation, 756 
solubilization of recombinant proteins from 
inclusion bodies, 121-123 
ESL. See Electrospray ionization 
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ip RP-HPLC, 224, 227, 238 
in silver stain fixation solution, 193 
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in capping reagent, 546 
interference with sequencing, 301 
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Ethyl acetate, 290, 313, 335 
Ethylene diarninetetracetic acid. See EDTA 
Fthylmorphotine, in electroblotting buffer, 321 
Ethylmorpholine acetate buffer, 373 
E-values, BLAST, 802 
Exopeptidase, 354 
ExPASy, 473 
ExPASy Peptident, 560 
Expressed sequence tag (EST), 467-468, 670, 
804, 825 
Expression 
protein expression measurements, 828-829 
spatial expression patterns, 828 
kxpressron proteomics, 3, 5 
Extraction solution, 163, 165, 168 
Extracts, protein 
DDF, (26-137 
analysis using 2D gel electrophoresis, 134- 
135 
cell preparation, 131 
degradation studies, use in, 135 
examples of cellular analysis using, 127 
method for monolayer cell cultures, 133- 
134 
method for suspension cell cultures, 131— 
133 
overview, 101, 126—128 
precipitation of tubulins and MAPs using 
magnesium, 137 
protein distribution in hepatocyte fractions, 
127 
reagents, 128—130 
RNA isolation, 127—128, 136-137 
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preparation from bacterial celis 
for 2D gel electrophoresis, 168-169 
large-scale extraction, 1 19-120 
lysis methods 
enzymatic, 96 
French Press, 96 
glass bead vortexing, 96 
grinding cells, 96 
mitrogen cavitation, 96, 112-116 
sonication, 96 
overview, 95—96 
small-scale extraction, 117—118 
solubilization of inclusion bodies, 121-123 
preparation from mammalian cultured celis 
disruption by nitrogen cavitation, 112-116 
lysates for 2D gel electrophoresis, 165-167 
lysis conditions, 95 
lysis for immunoblotting, 110-111 
lysis for immunoprecipitation 
method for celis grown as monoculture, 
108 
method for cells grown in suspension, 
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reagents, 107—108 
overview, 94-95 
radioisotope labeling, 167 
preparation from mammalian tissues 
cell purification methods, 94 
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method, 195-106 
reagents, 102-104 
averview, 93-94 
rat liver protein far 2D get electrophoresis, 
163-164 
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trophoresis, 163-164 
proteolytic degradation, 92, 95 
subcellular fractionation 
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Fast atom bombardment ionization (FAB), 430 
FASTA, 798 
Ferric chloride, 624-625, 631, 633 
Fibroblast growth tactor (FGF), 14 
Ficoll, PAGE pore size and. 46 
Filter paper chromatography, 288 
Find Protein, 475 
Fixation solutian 
for fluorescent staimng, 199-200 
tor silver staining, 193-194, 196 
FLACG-epitope tagged proteins, 677, 701- 707 
case study, 707 
cell lysis, 704 
concentration of proteins, 706—707 
enrichment, large-scale, 705 
immunopreupitation far small-scale analysis, 
f 705 
materials, 701 -703 
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steps, summary diagram of, 703 
transfection, 704 
Blow cell, 243, 409 
Fluorescence gel-retardation assay, 767. 768 
Fluorescence polarization assay, 769 
Fluorescence probes, 63-64 
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SY PRO Ruby, 71-72, 157, 199-200 
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Fluorophores, for FRET, 687-690 
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Fold recognition, 812-813 
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Formamide, in denaturing loading buffer, 743 
Formic acid, 553, 387, 626-627, 629-630 
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CR? mass spectrometer, 18, 443 
phosphoprotein analysis, 622 
top-down protein characterization, 568-569 
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Fragmentation of proteins 
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overview, 360—361 
reagents, table ot, 353-382 
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reagents, table of, 333 
with trypsin, 380-382 
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b, ions, ^80, 381 
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y, lons, 380—582 
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French Press method of cell lysis, 96 
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fluorophores for, 687—690 
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applications uf, 689 
benefits of FRET, 688, 690 
data analysis, 763-765 
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image acquisition, 762-764 
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materials, 757—758 
overview, 686-690 
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symbols and their interpretation, 760 
brit, column, 243, 247 
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domain-based search, 818—819 
expression databases, 825-826 
metabolic and signaling pathway databases, 
824-825 
mutant phenolypes, 829 
operon identification, 820—821 
overview, 815—817 
Protein expression measurements, 828— 829 
protein interaction databases, 823-824 


spatial expression patterns, 827 
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molecular, 7 
predicting protein structure and function, 
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annorated domain databases, 814 
domain structure predicrion, 808, 810 
told recognition, 812-813 
secondary structure prediction, 811 
sequere signals ot protein function, 813-814 
tertiary structure prediction, 812-814 
transmembrane domain prediction, 811 
Functional genomics, 2 
Functinnal proteomics 
definition, 3 
strategies for identification and analysis of 
proteins, L3 
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Cus- phase sequencer, 294- 296 
Gaussian modeling, 16! 
te CSE 14 
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Gel electrophoresis See specific types 
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concentrating acrylamide, $26--328 
reduction/alkvlation and proteolytic digestion 
af, 360, 398 - 399 
Gel staining selurien, 713 
Gel storage solution, E77 
GelCode Phosphoprotein staining kit, 155-157, 
201-202 
GF Loader tips, 460—462, 482—484, 486, 627, 
5829 630 
Gel-retardaton assay. of prolein-protem inter- 
action, 767—768 
GenBank, 360, 804 
Gene expression databases, 825-826 
Gene identitication, 798-807 
orthologs 
importance of, 804 -803 
rapid identification be method of bidirec- 
ional best hits, 806-807 
sequence database searching 
overview, 798 
using Gapped-Bl AST, 799 
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GeneNet database, 825 
Genome sequencing, 1,5 6 
Genomics, functional, 2 
GenPept database, 468, 799, 802-803 
Gen Threader, 412 
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interactions with, 766—771 
Gibbs energy of binding, 82-483 
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Of 7 
(ibas plates, cleamng, 54, 182 
Global profiling proteomns, 3,5 
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description, 457 
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Cilutamic acid 
phosphoglutamate, 599 
properties, 843 
structure, 842 
Glutamine 
properties, #44 
Structure, 542 
tlutaraldehyde sensttivation solution, 193-194 
Glutathione 
to avoid blocking in gels, 308 
for reducing prolems, 348 
in urea solubilization buffer, 122-123 
Glutathione-5 thioreductase, 675—677 
Glutathione -5-transferase, 97 
Glycerol 
interference wilh sequencing, 301 
in protein microarray samples, 693 
in SDS equilibration buffer, 185 
in sulfhydryl blocking solution, 547 
Glycinamide, a, ion of, 579 
lyeine, 46 
in electrablotting buffer, 321 
interference with sequencing, 301 
in Laemmli electrophoresis running buffer, 
327 
properties, 843 
in regeneration bulfer, 529 
in SDS electrophoresis buffer, 187 
structure, 842 
in transfer buffers, 203, 206 
Glycine, a, ion of, 57 
Glycosylation, 561 
Glycyl betaine, 98 
OM-CSE, 14 
Goid, staining membrane-bound proteins wath 
colloidal, 208, 212 
GOLD (Genomes OnLine Databases}, 1.5 
Gradient SDS-PAGE gels, 181-184 
Granulocyte colony-stimulating growth factor 
{G-CSF}, 14 
Granulocyte- macrophage colony-stimulating 
growth factor (GM-CSE), 14 
Graphite powder microcolumns, 453, 458—453 
Green fluorescent protein (GFP), 687, 766—771 
Grinding of cells with abrasive materials, 96 
Guanidine hydrochloride 
in alkylation buffer, 372 
for denaturation prior to reduction/alkylation, 
351 
as humectant, 694 
for inclusion body solubilization, 94, 121-122 
in protein microarrays, 694 
quality of, 351—352 
in reduction buffer, 373 
trypsin digestion and, 380, 382 
Guanidinium isothiocyanate, in RNA isolation 
tram detergent extracts, 136 


HECA. See Cyano-4-hydroxy-cinnamic acid 

HCL. See Hydrochloric acid 

Heat. See Isothermal titration calorimetry 

Heat capacity change of binding, 887 

Heat of dilution expenments, 887 

Heat shock protein 60, 304, Mn 

Helicobacter pylorz, 675, 823 

Helium gas, in RP-HPLC, 259 

Helix predictors, #11 

Hepatocytes, protein distribution in detergent 
fractions of, 427 


HEPES butter, 693, 776 
Heptaflnorobutyric acid i HEBA j, 387 
in FSI-MS solvent, 641 
in RP-HPLC, 227-231, 514, 532 
High-performance capillary electrochromatog- 
raphy (HPCEC), 862-868 
analytical RPCEC, 866 
background electrolyte, preparation of, 864 
rase study, 868 
electrochromatographic conditions, selection 
of, 466 
instrument setup, 863 
isocratic separation of peptides and proteins, 
86^ 
materials, 863 
protocol, 864-867 
testing instrument, 865-866 
High-performance capillary electrophoresis 
{HPUAF), 253 
High-performance liquid chromatography. See 
HPLC 
Histidine 
phosphohustidine, 599, 658—660 
properties, 443 
structure, 842 
Histidine phosphate, localization of, 658— 
660 
Homogenization 
aims of, 100 
in DDF, 132 
of mammalian tissuc 
equipment, 103, 105 
method, L05-106 
mucin removal from homogenate, 06 
reagents, 102 104 
methods, table of, 93 
Homogenization buffers A and B, 102 
Homogenization medium, for nitrogen cavita- 
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Horrogemzers, 103, 105. See alsa specific homog- 


entzers 
Homologous proteins, 804 
Homoserine, 362 
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trochromatography 
HPCZE, See High-performance capillary elec- 
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HPLC. See also Reversed-phase high perfor- 
mance liquid chromatography 
equipment for gradient elution, 410 
mic rocapillary HPLC (LC) 
LLC-ESI-MS/MS 
off-line mwera-IMAC, 638-642 
phosphoprotein analysis, 638-640 
preparation and use of zolumns, 487—495 
PTH amino acd detectian by, 292, 295 
schematic diagrarn of system, 411 
HT Analyzer software system, 160 
Human Genome Project, 670 
Human plasma proteomes, peptide mass finger 
printing of, 561, 564—565 
Humectant, use in protein arrays, 692-693 
Hydrobromic acid, 370-371 
Hydrechloric acid (HU) 
in acid hydrolysis, 288 
chemical eavage by, 353 
for deblocking, 308 
Hydrophobicity, reversed-phase chromatogra- 
phy and, 220-221, 456—460 


Hydroxvlanune cleavage 
overview, 353, 362-164 
protocol, 388 A89 


AA, See lodoacetic acid 
CATs, See Isotope-caded alfinity taps 
CPMS, See Taducovely coupled plasma mass 
spectrometer 
DA. 5ev Iminodiacemm acid 
Identification and analysis of proteins 
amino acid sequence determinanon, 10-19 
in complexes 15 
low-abundance protems, [4 
overview, 1219 
separation strategies, 12-14 
I EE, See Isoelectric lecusang HFF) 
LEF butter, 701 
IFi-strip equilibration butler, 356, 558 
lgBl AST database, 798 
[1 -6, 5365- 369 
IMAC (immobilized metal ion affinity chroma- 
tography) 
column and pressure cell, diagram of, 610 
column regenerating media, 60 
phosphopeptide emichrnent, 24 
of phosphoproteins 
alkaline phosphazase digestion, 634 
Larbow peptidase digestion, 534 
column preparaton, 654 
Ferllli or Gal Wl) use, 624-526 
MALDI- TOI MS and, 607, 627, 631—637, 639 
miciocolumn preparation and use, 627,639 
ul C conditions and, 610 
nanoscale, 626- 627 
oH tine micro IMAG enrichment, 608-012, 
53-640 
on-line micio- IMAL enrichment, 812 
overview, 60) 612 
recovery by bel flL]- MAU of phosphopro- 
tein isolated by gel electrophoresis, o9 


resin 
loading calumt with, 626 
preparation of, 625 026 
Image acquisition 
from 2D gels, 158--160 
(CU camera, 134 
densitometer, laser-based, 159 - L60 
document scanner, 159 
FRET, 762- 763 
Image visualization and manipulation, to) 
Imagemaster 21) software system, 160 
Imidazole, 137. 394—395 
Iminndiacene acid, 608 
Immobilization buller, 708—716 
immobilized enzyine mucrocolumns, prepara- 
tion ot, 462, 485—486 
immobilezed metal ion atfinity chromatography 
See IMAC. 
Immobilized pH gradient. See [PU gels 
Immobilon CL}, 463 
Immobilon P= membrane, 558 
Immunoblotting 
lysis conditions and, 95 
lysis of cultured animal cells, yeast, and hacte- 
ria ter, EUI 
immunepreapitatum 
lysis conditions and, 95 
lysis ot cultured cells for, p - pen 
of phosphoproterts, 607-608 
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Immunoprecipitation (continued) 
for protein-protein interaction, detection of, 
675 
Impurities, 2D gel electrophoresis and, 149—150 
In vivo isotopic labeling of proteins for quanti 
tative proteomics, 534-539 
Inclusion bodies, 92 
advantages, 98 
prevention of formation, 97-98 
recovery of recombinant protein from, 97-98 
solubilization of recombinant proteins from, 
I2i-123 
India ink, staining membrane-bound proteins 
with, 208, 21! 
Inductively coupled plasma mass spectrometer 
(ICPMS), 622 
Injector, RP-HPLC, 243—245 
InstaStain Blue Ge! Stain Paper stain, 69—70 
Insulin, 288 
Integrated biology, 7—8 
INTERACT database, 824 
Interactome, 671 
Interferon, human, 14 
interlogs, 823 
Intermolecular interactions, measuring, 882—892 
International Haman Genome Sequencing 
Consortium, 5 
Intrasteric regulation, 8-9 
lodoacetamide, 185, 373, 514, 557, R72 
for alkylation of cysteine residues, 351 
in sulfhydryl blocking solution, 547 
Iodoacetic acid, 351, 373-375, 379 
ledosobenzoic acid, 364, 390—39) 
lon parking, 571 
Ion-exchange chromatography, 5 
lonic strength, electrophoresis buffer, 51 
Jon/ion reactions, 569—571, 573 
lonization methods, 434—440 
electrospray ionization 
advantages and disadvantages, 440 
hypothetical spectra example, 438 
overview, 436 
schernatic representation of process, 438 
MALDI, 435 
advantages and disadvantages, 440 
overview, 435—436 
schematic representation of process, 437 
lon/ molecule reactions, 573 
Ton-pairing agents, 228—230, 369 
IPG butter, 165-166 
IPG (immobilized pH gradient) gels, 172-176 
IEF, 151-132 
strips, 172 
embedding in agarose, 188 
equilibration, 185-186 
rehydration of, 173-174 
Isocratic separations of peptides and proteins 
with CEC, 865 
Isoelectric focusing (IEF), 5, 154-152, 172-176 
in 2D gel electrophoresis, 144, 146-147 
DDF and, 134-135 
in IEF-dedicated electrophoresis unit, 173, 
175-176 
immobilized pH gradients, 151-152 
interfering substances, 149 
in multipurpose flatbed electrophoresis unit, 
173-175 
overview, 151-152 
rehydration of IPG strips, 173-174 


Isoelectric point (pl), 151, 161, 680 
Isoleucine 
properties, 843 
structure, 842 
Isopropanol, 55, 224 
Isotachophoresis, 52 
Isothermal titration calorimetry, 882-892 
advantages of, 885 
amount of sample required, 888 
analysis of results, 888—890 
information available, 884—888 
blank experiments, 887 
coupled proton transfer process, 887—888 
enthalpy of binding, RR6—887 
heat capacity change of binding, 887 
simultaneous determination of association 
constant and enthalpy of binding, 
B84—886 
lirnitations of, 883—884 
measuring mtermolecular interactions tech- 
nique, 890 
materials, 890 
method, 891-892 
overview, 882—883 
principle of technique, 884 
sample preparation, 888 
isotope 
labeling 
stable isotope dilution technique, 20-2) 
in vitro and in vivo, 21 
mass and natural abundance, 435 
Isotope-coded affinity tags (ICATs), 21, 524- 
533,615 
avidin column use, 529—531 
cation exchange cleanup of samples, 528 
labeling proteins, 526-527 
MS analysis of peptides, 532 
overview, 324 
reagent calculation, 527 
strategy diagram, 525 


Japan database, 468 
JPRED, 811 


KC]. See Potassium chloride 
KEGG database, 11, 824—825 
Kinase buffer, 782 
Kinases 
protem microarray assay for, 698 
regulation of, 8 
sequence specificity of, 600 
Web resources on, 600 
Klebsiella pneurnoniae, 119 
Kohlrausch boundary, 43 
Kyoto Encyclopedia of Genes and Genomes 
(KEGG), 11, 824-825 


Labeling 
biotin 
Biotin-HPDB 520-522 
of phosphopeptides, 24-25 
of site-specifically derivatized DNA frag- 
ment, 743-751 
ICATS, 524- 534 
for quantitative proteomics 
in vitro 
affinity capture cysteine-containing pro- 
teins, 466, 520-523 


isotope-cuded atfinity tagging, 524-533 
overview, 466 
in vivo 
materials. 534—536 
methods, 536- 53 
overview, 466, 534 
radiolabeling, 167 
phosphorylation site mapping, 600, 602 
2P- labeling 
of cells, 167 
peptides, solid-phase microsequencing of, 
309-310, 329-331 
phosphoprotein and phosphopeptide 
analysis, 22-23, 26 
protein arrays, 782-783 
Lactone derivative coupling and sequencing, 387 
Laemmli electrophoresis running buffer, 327 
Laemmli gels, 41-42, 44-46 
Laemmli sample buffer, 110, 117-118 
CQ Xealibur software, 503-504 
Leucine 
properties, 843 
structure, 842 
Leupeptin, 104, 119, 151 
LIGAND database, 424-825 
Ligands, protein state alteration and, 7 
Light source for photopolymerization, 81 
Linear gradient gels, 59 
Linear regression analysis, 274, 278 
Linear Solvent Strength theory, 274 
Lipids, as impurities in protein samples for 2D 
gel electrophoresis, 150 
Liquid chrematography. 5ee also RP-HPLC 
coupling MS/MS to 1-LC protocol, 496-507 
coupling MS/MS to 2D-1C 
overview, 450—451 
protocol, 508-513 
microcapillary HPLC (HLC) 
Au LC-ESI-MS/MS 
off-line micro-IMAC, 635—642 
phosphoprotein analysis, 638-640 
preparation and use of columns, 487-495 
mucroscale LC cleanup of protein samples 
improves value of mass spectra, 
451-453 
nanoliter- LC, 450, 496-507 
coupled to nano- LC-MS/MS, 496-507 
column preparation, 497—500 
HPLC programming, 503 
jon-source setup, 302 
loading sample onto column, 501 
materials, 496—497 
MS analysis, 503—305 
S. cerevisiae proteome analysis, 507 
sample concentration, 500-501 
separation of phosphoproteins, 552—654 
overview, 449 
sensitivity, 450 
Liquid secondary ion mass spectrometry 
(L51MS$), 430 
Live Database of Interacting Proteins ( LiveDiP], 
i0 
Loading dye, for BN-PAGE, 718 
Lowry method, for protein concentration deter- 
mination, 134 
LSIMS. See Liquid secondary ron mass spec- 
trometry 
Lyophilization, 227-278 
Lys-C, 356 


Lysine 
phospholvsune, 599 
Properties, 845 
structure, 847 

Lysis buffer 
for bacterial cell lysis, 119 
protease inhibitor addition to, 95 
for protein extraction from mammalian cells, 

95 

Lysosomal proteases, 94 

lysozyme, 96, 120 

Lysyl side chains 
citraconylation of, 384 
succinylation of, 383 


Madin-Darby canine kidney (MDCK) cells, 
fractionation of, 128 
Magnesium acetate, in elution buffer, 743 
Magnesium-induced precipitation of tubulins 
and MAPs, 137 
Magnetic particle concentrator, 752-754 
MALDI (matrix-assisted laser desorption ion- 
ization), 16, 19, 27, 435. See also 
MALDI-TOF-MS5 
advantages and disadvantages, 440 
alkaline phosphatase treatment of phospho- 
peptides before of after analysis, 628 - 
630 
calibration standards, 478 
peptide mass, 479 
protein mass, 480 
of electroblotted proteins, 462—463 
excipients compatible with, 452 
general method of protein/peptide analysis, 
428-481 
history of development, 430 
ionization efficiency of phosphopeptides in 
positive-ion mode, 604—605 
matrix choice, 479 
overview, 435—436 
phosphoprotein analysis, 603, 622 
protocol, 643—650 
sample cleanup 
excipients compatible with MALDI-MS, 
table of, 452 
microcolumns 
desalting and concentration of peptides 
and proteins, 454—456 
homemade and commercial compared, 
460-461 
immobilized enzyme, 462, 435-486 
on-column enzymatic digestion, 462 
resins, 453 
reversed-phase, preparation of, 453, 
482—484 
sequential elution of peptides, 456 
use of increasing hydrophobic micro- 
columns, 456—460 
microscale- LC, 451—453, 487-495 
nanoliter-LC, 496 
sample cleanup for, 452462 
schematic representation of process, 437 
MALDI-MS. See MALDI 
MALDI-TOF-MS, 444—446. See also MALDI 
analysis of proteomes using the molecular 
scanner, 244 
calibration standards, 478—480 
general method of protein and peptide analy- 
sis, 478—481 
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peptide mass fingerprinting, 465, 556-566 
phosphoprotein analysis, 604 
IMAC and, 612, 631—637 
Mammalian cultured cells, protein extract 
preparation from 
disruption by nitrogen cavitation, 112-116 
lysates for 2D gel electrophoresis, 165-167 
lysis conditions, 95 
lysis for immunoblotting, 110-111 
lysis for immunoprecipitation 
method for cells grown as monoculture, 108 
methad for cells grown in suspension, 
108-109 
reagents, 107-108 
overview, 94-95 
radioisotope labeling, 167 
Mammalian tissue, protein extract preparation 
from 
cell purification methods, 94 
choice of tissue, 94 
homogenization of tissue, 102—106 
equipment, 103, 105 
method, 105-106 
reagents, 102-104 
overview, 93-94 
rat liver protein for 2D gel electrophoresis, 
163-164 
MAP kinase ERK2, 8 
Mascot, 473, 475, 874 
Mass accuracy, 433 
Mass analyzer configurations, 443—442 
FT-ICR, 448 
quadrupole ion trap, 446—447 
QTOF, 444-445 
tandem-in-space, 443—446 
tandem-in-time, 446448 
(OF, 444 
TOF/TOB 446 
triple quadrupole, 443-444 
Mass changes, due to positranslational modifi- 
cations of peptides and proteins, 840 
Mass spectrometry (MS), 3, 425-595 
2D chromatography combined with, 514—519 
REC chromatography, 518-519 
SCX chromatography. 517 
analysis of the topology of protein complexes 
using cross-linking and, 725, 728—733 
charge state of multiply charged ion, calculat- 
ing, 439 
chemical printer 
details, 465 
equipment, photographs of, 464 
overview, 463, 465 
peptide mass fingerprinting 
analysis of E. coli proteome, 561-564 
analysis of human plasma proteome, 561, 
564-565 
materials, 556 
multiple endoproteinase reactions, 566 
protocol, 557—560 
CID, 19, 27 
compatible silver staining, 75-76, 155-156 
components of mass spectrometer, 427, 429 
ESI, 16, 19, 27 
advantages and disadvantages, 440 
history of development, 430 
hypothetical spectra example, 438 
IMAC with, 508, 610—612 
ionization efficiency of phosphopeptides in 


positive-zon made, 604-615 
microcapillary HPTLC integration with, 487- 
495 
microsprayer setup, 491 
overview, 436 
phosphoprotein analysis, 603 
on-line micr IMAC, 640) 
protocol, 641-642, 631-055 
sample cleanup for, 452—453 
schematic representation of process, 435 
TOF, 445 
essential elements of, 16, 18 
fragmentation mechanisms of protonated 
peptides in the gas phase 
a, lans, 577 
glycinamide, 579 
glycine, 578 
glycylglycine, 579—580 
b, ions, 580-581 
b, ions, 580 
concepts, summary of important, 583 
future developments, 583 
nonsequence tons, 582 
overview, 577 
v, tons, 580-582 
FT-ICR mass spectrometer, 14, 448 
phosphoprotein analysis, 622 
top-down protein characterization, 568—569 
general principles, 427, 429 
history, 430 
identification methods used in proteomics, 19, 
27 
ionization methods, 454—440 
ESI 
advantages and disadvantages, 440 
hypothetical spectra cxaraple, 438 
averview, 436 
schematic representation ol process, 438 
MALDI, 435 
advanlages and disadvantages, 440 
overview, 435-436 
schematic representation of process, 437 
labeling of proteins lor quantitative proteomics 
in vitro 
affinity capture cysteine-containing pro - 
teins, 466, 520-523 
isotope-Loded affinity tagging, 524-533 
overview, 466 
in vivo 
materials, 534—536 
methods, 536-539 
overview, 465, 534 
MALDI, 16, 19, 27, 435 
advantages and disadvantages, 440 
alkaline phosphatase treatment of phospho- 
peptides before or after analysis, 
628—630 
calibration standards, 478 
peptide mass, 479 
protein mass, 480 
af electroblotted proteins 462-463 
excipients compatible with, 452 
general method of protein/peptide analysis, 
428—481 
ionization efficiency of phosphopeptides in 
positive-ion mode, 604-605 
matrix choice, 479 
overview, 435-436 
phosphoprotein analysis, 603, 622, 643-650) 
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Mass spectrometry (MS) (continued) 


sample cleanup men PRA circumventing prob- nana-l C, coupled ta, 496—307 
desalting and concentration af peptides by » N "MNMS. 64 2 MM Me M 
and proteins, 454-45 y HIC bI MES MS, 4 1-64; i phosphopeptide sequencing, 620-621 
an proteins, 5 56 hy MALIDI-MS, 643—650, 656—657 emerging techniques, 522-623 
excipients compatible with MALDI-MS, MALDLTOF- MS and IMAC combined, 531 search engines, 470, 474—477 
table of, 452 methods, 615 617 top-down protein identification and charac- 
homemade and commercial cumpared, multidimensional strategies, 621 terion 
460 -461 neutral loss scanning, 620 FI-ICR MS, 568. 569 
immobilized enzyme, 462,485 486 overview, 603 604, 615-617 future deselopments 3723 573 
Tricroscale-L C, 451-455, 487—405 precursor loss scanning, 620 ion/ion reactions, ^69. 571, 575 
nanoliter-[ , 496 problems associated with, 604 -605 overview, 367-568 
on-column eneymiatic digestion, 462 reporter wn, 618 quadrupole ion -Arap MS, 569—571 
resins, 453 phosphoprotein and phosphopeptide analysis top down protein identification and FT- 
reversed-phase, preparation of, 453, 482- 23-23, 26 characterization 
484 tor posttranslational modificaron study, 21 PPAGCT Ms, 568-569 
sequential elution ol peptides, 456 precursor 10n sanning, 27 future developments, 572-573 
use of increasing hydrophobic mitro- protein arrays on membranes, 462-463 jonhon reactions, 869-571, 573 
columns, 456-460 quadropole time of flight, 444 445, 620, 622, quadrupole ien-trap MS, 569—571 
sample cleanup tor, 452 462 635 terminology, 431—4 34 
schematic representation of process, 437 sample cleanup mass daccaracy, 453 
MAT DI-TOF MS, 444—446 excipients compatible with MALIYI-MS, raass-to-charge ratio, 431 
analysis of proteomes using the molecular table of, 452 moneisolopic Mass versus average mass, 
scanner, 544 microcolumns 431,433 
calibration standards, 178—480 desalting and concentration of peptides resolution, 431. 432, 444 
general method of protein and peptide and proteins, 454—456 WAW resources, 595 
analysis, 478—481 homemade and commercial compared, Mass-spectrometry compatible silver staining, 
peptide mass fingerprinting, 465, 556—566 460 461 196--198 
phosphoprotein analysis, 604 immobilized enzyme, 462, 485—496 MassFrag, 47^ 
IMAC and, 612, 631—837 on-column enzymatic digestion, 462 Masslvnx software, 874 
molecular scanner for proteome analysis resins, 453 MassSearch, 472 
application to analysis of E. calf proterns, reversed-phase, preparation of, 453, 482— Mass-to-charge ratio Leri, 431 
554—555 484 Mister Gel, 162 
materials, 546-349 sequential clution ot peptides, 456 Matrix: assisted laser desorption ionizatium, See 
overview, 463, 544 use of increasing hydrophobic micro- MALDI 
protocol columns, 456 460 Medline, linking Entrez with, 813 
data acquisition, 552 micruscale-1 C, 431-453, 487—495 Melanie LE sottware system. 160-162 
digestion ol proteins, 551 nanultter-T C, 496 Membranes 
protein identification, 552-556 sample purity and, 345 contamination of, 158 
separation ot prukeins, 544—545, 549-551 search engines for protein identification using proteolyt digestion of elect roblotted pro- 
virtual map creation, ^54 MS data, 506 teins, 402 404 
scheme, 545 history, 467 transter, 15h 
MulJPIT combined with, 508-519 peptide fragment von searches, 470, 474—477 Mercaptoethylamine, [or reducing prens 348 
overview, 426 peptide mass fingerprinting and, 468—470, MES buffer 
peptide mass fingerprinting, 1% 472-473 for protein microarray use, 644 
of phosphopeptides, 27, 603—623 sequence databases, 467—468 lor protem micrearrays, 776 
affinity-bound phosphoproteins, 617, 631— summary, 477 Metabolic labeling, 704 
637 stable isotope dilution technique, 20-21 Metabolomics, ! 
alkaline phosphatase treatment of phospho- steps in protein identification and character'- Metal ion binding, detection by, 64. See aise 
peptides, 628—636 zation, 425 Silver staining 
after MAI DI MS analysis, in-solution, strategy for identification of proteins and their MetaMorph Imaging System, 758 
629 630) posttranslational modifications, 17 META-PP, 808, 810. 812, 814, H 30 
after MÁALDI-M5 analysis, on. probe, 630 tandem MS (MS/MS), 18-19 Metastable decay, 444 
prior 10 MALDI-MS analysis, 629 automated 21)- LC-MS/MS for large-scale Methanol f l - 
derivatization of phosphopeptides, 605. 606, protein analysis, 869 876 in destaming solution, 327 
612-615 coupling to 1D-LC, 449 in electroblutting buffer, 321 
emerging techniques für, 622-623 coupling to 20-1, 450-451 in. fixative volution, 190, 199, 304 
enrichment ot phosphopeptides, 607-615 data-dependent, 611-612 in RP-HPLC, 224, 227, 238 
IM AC, 607-612 ESI and, 18-19 m sequence analysis protocols, 3418-319 
by immunoprecipiialion, 607 68 IMAC with, 608, 610-612 in silver stain fixation solution, 196 
by ESI-MS, 651—655 intermediate reaction events, 441-44? m transter buffers, 204, 206 uu" 
histidine phosphate localization, 658-666 mass analyzer configurations for wetting membranes, 158. 204, 207, 322-325 
IMAG FT-ICR, 448 Methionine 
Fe(II} or Ga( TII) use, 624-626 quadrupole ion trap, 446—447 cyanogen bromide cleavage and, 361-462, 
MALDI- LOE-MS and IMAC combined, QUO 444-445 MS-ÓB7 
627,639 tandem-in-space, 443-446 properties and structure. B42 Bat 
microcolumn preparation and use, 627, 634 tandem-in-time, 446 -448 Methionine sulfoxide, 362, 387 
nanoscale, 626 627 TOT, 444 Methyl Green, 156, 201-207 
off-line nucra-IM AC enrichment, 638- 640 "'OEZTOR, 446 Methylene blue, 45, #1 
in-source CIT), O19 triple quadrupole, 443-444 Methylimidazole, 343 
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Methyltrichloroacctate, 359 
Metric prefines, #38 
MGD database, 829 
Michael addition, 605—607, 614—615 
Microarrays, See Arrays, protein 
Microbore column chromatography, 230-235 
band broadening, 232 
buffer exchange, 233-235 
principles, 231—232 
protocol, 242-248 
resolution, 232—233 
sample capacity, 232 -233 
sample preparation for sequence analysis, 
305-306 
sample recovery, 235 
trace enrichment, 233-235, 305 
Microcapillary H PLC. (LC) 
JLC-ESI-MS/MS 
off-line micro-IMAC, 638-642 
phosphoprotein analysis, 638—640 
preparation and use of columns, 487—495 
Micracoccal nuclease, 755 
Microgram/picomole conversion guide, 336 
Microscope, epifluorescence, 758 
Microtubule-associated proteins, precipitation 
using magnesium, 137 
MINT database, 11, 673, 824 
MIPS database, 824-825 
Mobile phase, RP-HPLC, 222-230 
ion-pairing agents, 228- 230 
organic solvent, 222, 224, 226—227 
pH, 226-227 
preparation, 259-26), 266, 270-27! 
ModBase, 812-813 
Molariry, of concentrated acids and ammonia, 
S40 
Molecular crowding, 672 
Molecular function, protein, 7 
Molecular INTeraction Database (MINT}, 11, 
674, 824 
Molecular scanner, 463 
Moles/weight conversion table, 834 
Monoisotepic mass, 433 
MOPS buffer, 693, 776 
MOWSE ‘molecular-weight search), 472 
mRNA 
alternate splicing of, 670—671 
differential detergent Fractionation and, 137 
expression level, 6 
profiling, disparity with protein profiling, 6 
MS-Fit, 472 
Mb-'lag, 174 
MTPXt polypeptide, 255—256 
Mucin, removal trom tissue homogenates, .06 
Multidimensional protein identification tech- 
nology (MuDPIT}, 507-519 
analysis of complex protein mixtures 
digestion of insoluble protein fractions for, 
512 
digestion of soluble protein extracts for, 
510-511 
materials, 508—509 
protocol, 311-512 
S, cerevisiae proteome analysis, 513 
SEQUEST analysis, 512 
2D chromatography and MS combined 
materials, 514-315 
proteolysis of soluble protein, 516 
RPC chromatography, 518-519 


SCX chromatography, 316-517 

Multiphasic zone electrophoretic [MZE theory, 
44 

Multipotential colony-stimulating factor, 14 

Munich Information Center for Protein 
Sequences (MIPS), 824—825 

Muscarinic acetylcholine receptor, 14 

Mycoplasma genitalium, 6 

Mylar, 183 


DN labeling, 536 
NaCl. See Sodium chloride 
Nanoliter-1 C, 450, 496—507 
coupled to MS/MS (nano- LC-MS/MS], 496-307 
column preparation, 497—500 
HPLC programming, 503 
ion-source setup, 302 
loading sample onto column, 501 
materials, 496447 
MS analysis, 503—505 
S. cerevisiae proteome analysis, 507 
sample concentration, 500. 501 
separation of phosphoproteins, 652—654 
Native agarose gel electrophoresis, for isolation 
of protein complexes, 680, 713 -716 
Native gel buffer, 714 
NCBI database, 468 
NCBInr database, 468 
Nearest-neighbor relations in protein complex- 
es, 58] 
Needleman- Wunsch, 798 
Negative staining, 358—339, 394—395 
Nematode Expression Pattern Database, 829 
NEpHGE. See Nonequilibrium pH gradient 
electrophoresis 
NetOglyc database, 814 
NetPhas database, #14 
Neutral gain scan, 443 
Neutral loss scanning, for phosphopeptide detec - 
tion and analysis, 441, 445, 620 
NH,HCO,. See ammonium bicarbonate 
Nile Red dve, 42 
NitPico database, 814 
Nitrilotriacetic acid (NTA), 608, 624 
Nitrocellulose membranes 
age of, 158 
digestion af proteins on, 403—404 
electroblotting to, 306, 403 
in semidry blotting of 2D gels, 206-207 
staining proteins on, 403 
supported, 158 
in tank transfer of 21) gels, 203-205 
wetting, 15$ 
Nitrogen cavitation, 96, 112-116 
equipment, 113 
overview, 96, 112 
protocol, 114-116 
reagents, 1 13-114 
Nondenaturing loading buffer, 744 
Nondenaturing PAGE 
overview, 30 
protocol, 85-86 
variables, 50-51 
Nonequilibrium pH gradient electrophoresis 
(NEpHGE), 134-135 
Nonidet P-40 
for cell Lysis, 95, 107, 119. 120 
in denaturing gel electrophoresis, 49 


lor proton solubilization, 148 

Nenionn detergents, interference with sequens 
ing, 301 

Normalization, 162 

NP-40), See Nonidet P-40 

NTA See Nitribolriacetis acid 

N'IC B £2 nitro 5 thiecvaniobenzoic acid}, 153, 
Anh 

Nuclear magnetis resonance (NMR) studies, 810 

Nuclear pore complet, 689 

Nuclease digestion, 755 

Nucleic acids, 2 impurities in protein samples 
tor 2D gel cectrophoresis, 149- 150. 
See also DNA: RNA 

Nup&4p complex, topology ol the, 729-733 

Nylon membranes, 158 


HO labeling, 21 

Octyl-glycopyranoside (OGP), 557-558 

LY Farrell lysis buffer, 134 

Ohgo R} resin, 453, 458 

Oligodeoxyribonucleotide synthesis, 746 

Oligomernc stale, 8 

pen reading frame (ORF), & 

Operans, 816-817, 820—821 

Optical density. 162 

Organic solvent tor RP-HPLC, 222, 224, 226, 238 

Orthologs 

unportance ot, 804- 805 
rapid identification of orthologs by the method 

at bidirectional best hits, 806—807 

Orthophosphoric aad (H,PO,), in RP-HPIC:, 
227, 229-230 

Overlay solution, 547. 344 

(AWT database, 468 

Oxazrolone, 580. 581 

Oxidation of proteins, by perlormic acid, 
ATU- 47 | 


`P labeling 
ot cell, 167 
peptides, solid-phase microsequencing of, 
309- 310, 329—334 
phosphoprotein and phosphopeptide analysis, 
22-23, 26 
Packing materials, lor HPLC, 223 
PAGE. See Polyucrylanude gel electrophoresis 
Palladium (1) (tetrakis-4-N-{rmethyl]-pyridyl 
porphyrin), 684—685, 734—740 
Pancreatic ribonuclease A, 890-892 
Papain 354, 358 
p-anidophenac yl bromide, 743, 747 
Paralogs, 804 
Paramagnetic beads, 752-754 
Partilion chromatography method, 288 
Pasteur pipette concentration gel system, 326-328 
PathCalling Yeast Interactiun Database, LI 
Paup, 405 
PBS. See Phosphate-buffered saline 
pCAT. 676 
PCXC-54 detergent, 23, 54 
PDCAL See Perfluoro-1 3-dimethylcyclohexane 
PL sh, See Platelet-derived growth factor 
PDinteract database, 824 
PDQuest soltware, 5, 160 162 
PPOWeh software, 162 
Peak parking, 450, 611, 613 
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Peak shape, 493 
PEL. See Polyethylenimine 
Pentafluoropropionic acid (PFPA), in RP-HPLC, 
228-231 
PepFrag, 474-475 
Pepident2, 473 
Pepsin, 353—354, 357 
Pepstatin, 104, 106, 151 
Peptide mapping 
analytical, 344 
comparative, 344 
covalent cross-links, breaking, 347 
definition, 344 
disulfide bond cleavage, 347-348, 370—371 
disulfide linkages, determination of, 364-366, 
378—379 
fragmenting proteins 
chemical 
cyanogen bromide, 361-362, 385-387 
hydroxylamine, 361—364, 388—389 
o-iodosebenzoic acid, 364, 390-391 
overview, 360-361 
reagents, table of, 353-382 
enzymatic 
choosing a proteinase, 357-358 
in gel digestion, 358 
information resources, 356 
optimized conditions, 356 
overview, 352-356 
proteases, list of, 354 
reagents, table of, 353 
with trypsin, 380—382 
in-gel digestion and extraction of peptides, 
360, 400-401 
macroscale, 345 
microscale, 345—346 
on-membrane digestion of electroblotted pro- 
teins, 360, 402—404 
overview, 344-345 
reduction and $-carbaxymethylation of proteins 
large-scale method, 372-374 
microscale method, 5757377 
by SDS-PAGE, 364, 412-416 
S-pyridylethylation of gel-resolved proteins, 
360, 398—399 
staining proteins in gels 
Coomassie Mue, 358—359, 392393 
negative staining, 358—359, 396-397 
overview, 358-359 
silver staining, 359, 396—397 
using RP-HPLC, 366, 378-379, 405-411 
WWW resources, 423 
Peptide mass fingerprinting, 19 
analysis of E. coli proteome, 561—564 
analysis of human plasma proteome, 561, 564— 
565 
materials, 556 
multiple endoproteinase reactions, 566 
of phosphopeptides, 604 
protein identification by peptide mass map- 
ping, 468—470, 472-473 
protocol, 557-560 
Peptide YY {PYY}, 14 
PeptideMass, 473 
PeptideSearch, 472 
Perdluoro-1,3-dimethyicyclohexane (PDCH), 
561, 570 
Períormic acid 
disulfide bond cleavage by, 347-348, 370-371 


oxidation of proteins, 370—371 
production of, 371 
pET fusion system, 676 
Pfam database, 810, 814 
PFBP database, 825 
PFPA. See Fentafluorepropionic acid 
pH 
charge of proteins and, 50 
electrophoresis buffer, 50-51 
electrophoretic mobility and, 40, 46 
gel porosity and, 45 
immobilized pH gradient gels, 151-152 
isoelectric point and, 151 
mobile phase in RP-HPLC, 226—227 
of stacking gel, 43 
Phage display, 671—672 
PHDhtm, 8t1 
PHDtopology, Sik 
Phenanthroline, 104 
Phenol, in RNA isolation from detergent 
extracts, 136 
Phenolic compounds, as impuritics in protein 
samples for 2D gel electrophoresis, 150 
Phenomics, 2 
Phenylalanine 
properties, 843 
structure, 842 
Phenylisocyanate (PIC), 299, 311, 333, 333 
Phenylisothiocyanate ( PITC), 290, 292 
Phenyimethylsulfonyl fluoride (PMSF), 103- 
104, 106, 119, 163, 168-169, 380 
inactivation by sulfhydryl reagents, 150 
Phenylthiocarbamyl {P1'C), 290, 313 
Phenylthiohydantoin (PTH) amino acid, 290, 
292, 295, 297, 309, 329 
PHIAT. See Phosphoprotein isotape-coded affin- 
ity tags 
Phoretix software, 160-162 
Phosphatases 
treatment of affinity-bound phosphoproteins, 
617 
treatment of IMAC-enriched peptide samples, 
642 
Web resources on, 600 
Phosphate buffer, 693, 776 
Phosphate-buffered saline (PBS), 108, 113, 124, 
130 
Phosphoamino acids 
identification by thin-layer electraphoresis, 
880-881 
modification to facilitate identification, 877— 
879 
Phosphoarginine, 599 
Phosphoaspartate, 599 
Phosphocysteine, 599 
Phosphoglutamate, 599 
Phosphohistidine 
localization of, 658—653 
structure, 599 
Phospholysine, 599 
Phosphopeptides 
biotin labeling, 24-25 
chemical modifications of, 24-25 
detection methods, flow chart for, 601 
enrichment methods for, 23-24 
lability of phospho-moiety, 617-618 
MS analysis of, 600, 603-623 
affinity-bound phosphoproteins, 617, 631- 
637 


Alkaline phosphatase treatment of phospho- 
peptides, 623 -630 
after MALDI-M> analysis, in solution, 
629-630 
after MALDI-MS analysis, on-probe, 630 
prior to MALDI-MS analysis, 629 
derivatization af phosphopeptides, 605- 
b^, 612-615 
emerging techniques tor, 622-623 
enrichment af phosphopeptides, 607-015 
IMAC, 607—517 
by immuaoprecipitation, 607-608 
by ESI-MS, 651—555 
histidine phosphate localization, 658-660 
IMAC 
Feti) or Gallll} use, 624—626 
MALD- l'OF-MS and IMAC combined, 
627,639 
microcolumn preparation and use, 627, 
639 
nanoscale, 626—627 
off-line micro-[IM AC grrichment, 638-640 
in-source CID, 619 
ronization suppression, circumventing prob- 
lem of, 605-615 
by MALDI-MS, 643—650, 456-057 
MALDI-TOF-MS8 and IMAC combined, 631 
methods, 615—617 
by BLC-ESI-MS/MS, 641-042 
multidimensional strategies, 621 
neutral loss scanning, 620 
overview, 603-604, 615—617 
precursor loss scanning, 620 
problems associated with, 604—605 
reporter ion, 618 
post-source decay, 27 
radiolabeling, 602 
reporter 10n. 618 
western blotting, 602 
Phosphoprotein isotope-coded affinity tags 
(PhIAT), 615 
Phosphoprotein stains, 155—157, 201-202 
Phosphopreteins 
alkylation, 652 
chemical modifications of, 24—25 
enrichment methods for, 23-24 
proteolytic cleavage, 603 
reduction, 652 
Phosphoramidate derivatzation, 607, 614-615 
Phosphoric acid, in RP-HPLC, 229 
Phosphor] mager, 159 
Phosphorylation 
chemical stability of phosphorylated amino 
acids, 598 
enrichment methods for phosphoproteins and 
phosphopeptides, 23-24 
mass increase from, 600 
overview, 22-23 
as regulatory mechanism, 598 
sites, analysis of, 26, 308- 309, 597-667. See 
also Phasphoproteis 
autophosphorylation sites, 656—657 
by radiolabeling, 600, 602 
sample requirements, 604 
Web resources, 607 
Phosphorylation buffer, 744, 748 
Phosphoserine, 588 
B-elimination, 605-606, 614 
collision-induced dissociation, 605 


derivatization of, 877-879 
enzymatic cleavage, effect on, 605 
phosphate elimination from, 614 
structure, 599, 614 
Phosphothreonine 
B-climination, 24, 605—606, 614, 621 
CID, 605 
enzymatic cleavage, effect on, 605 
phosphate elimination from, 614 
structure, 599, 614 
Phasphotyrosine 
enrichment by immunoprecipitation, 607—508 
phosphate elimination frorn, 614 
structure, 599, 614 
Phosvitin, 201 
Photochemical reactor, 752, 754 
Photo-cross-linking, See Cross-linking 
Phetodisseciation, 568 
Photo-induced cross-linking of unmodified 
proteins (PICUP}, 684, 734 
Photomultiplier tube, 699 
Phyiip, 805 
pl (isoelectric point), 151, 161, 680 
PIC. See Phenylisocyanate 
PicoGreen dsDNA quantitation kit, 744, 751 
Picomole/microgram conversion guide, 334 
PICUE. See Photo-induced cross-linking of 
unmodified proteins 
PiMRider database, 824 
PinPoint, 676 
Piperidine thiocyanate, 333 
PIPES buffer 
in differential detergent fractionation, 126, 
128, 130 
for protein microarray use, 693, 776 
PIR database, 468 
PITC. See Phenylisothiocyanate 
pK values, buffer, 839 
Platelet-derived growth factor (PDGE), 14 
PLEX. See Protein Link Explorer 
PlusOne silver stain kit, 156, 196 
pMAL, 676 
PME. See Peptide mass fingerprinting 
PMSE See Phenylmethylsulfonyl fluoride 
PNGase F, 561 
Polyacrylamide 
polymerization, initiation of, 44—45 
pore sizes, 45—46 
structure, 44 
Polyacrylamide gel electrophoresis (PAGE). 5ee 
alsa SDS-PAGE 
acid-urea PAGE, 49, 81—82 
apparent molecular-mass determination, 77-78 
applications of, 40 
BN-PAGE, 680-681, 717—724 
commercial slab gel systems, 54 
CTAB-PAGE, 49, 79-80 
electroblotting proteins from gels to PVDF 
membranes, 320-323 
gel additives, 45—46 
general principles, 41-46 
continuous versus discontinuous gels, 41-43 
pore sizes, 44-45 
resolution, 43-46 
in-gel digestion and extraction of peptides, 
360, 400-401 
mobility, influences on, 41 
concentration of polyacrylamide, 41 
pH. 49 


nondenaturing, 50-51, 85-86 
sample preparation for sequence analysis, 306— 
307 
SDS-PAGF. See SDS-PAGE 
sensitivity of protein detection in, 40 
S-pyridylethylation of gel-resolved proteins, 
360, 398-399 
staining proteins in gels 
Coomassie Blue, 358-359, 392—395 
negative staining, 358-359, 394—395 
silver staining, 359, 396—397 
visualization procedures, 4042, 47-48, 63—76 
Coomassie Blue, 63 
conventional, 65-66 
rapid, 67-68 
fluorescent probes, 63-4 
SYPRO Orange, 73-74 
SYPRO Ruby, 71-72 
InstaStain Blue Gel Paper stain, 64-70 
overview, 63-64 
silver stain, 64, 75-76 
WWW resources, 89 
Polybrene, 294 
Polyethylene glycol, polyacrylanmde pore size 
and, 45 
Polyethylenimine (PEL), 120, 702, 704 
Polyphenol oxidase, 150 
Polytron homogenizer, 105 
Polyvinylidene difluoride (PVDF) membranes, 
158 
Amido Black staining of proteins, 552, 554—555 
column frit, 247 
detecting proteins on, 324-325 
electroblotting to, 306, 403 
electrotransfer of proteins fram gels to, 320-323 
mass spectrometry and, 463 
in semidry blotting of 2D gels, 206-207 
in sequencing protocols, 309-310, 313 
staining with Amido Black, 403 
in tank transfer of 2D gels, 203-205 
transfer of proteins to 
adsorption of protein in solution. 334 
electroblotting, 334 
Polyvinylpolypyrrolidone, 150 
Polyvinylpyrrolidone, 46, 150 
Ponceau S, 208—209, 324, 402—403 
Poros R1 resin, 453—454 
Poros R2 resin, 453—454, 456, 458, 460, 462, 490 
Poroszyme, 510 
Position-specific iterated- BLAST (PSI-BLAST), 
78, 798, 804 
Post-source decay (PSD) 27, 444 
Posttranslational modifications, 6, 598 
chentical, 24-26 
difficulty of studying, 22 
mass changes due ta,840 
mass spectrometry for study of, 21 
overview, 21-26 
phosphorylation, 22-24 
Potassium acetate/chloride staining, 358 
Potassium chloride, in SCX chromatography 
buffers, 515 
Potassium ferricyanide, 539 
Potter-Elvehjem homogenizer, 103, 105 
Power supply, for SDS-PAGE, 53 
Precursor ion scanning, 27, 441, 443, 620 
Predicting protein structure and function, 
808-8} 4 
annotated domain databases, 814 
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domain structure prediction, 808, 810 
fold recognition, 812-813 
secondary structure prediction, 811 
sequence signals of protein function, #13- 
814 
tertiarv structure prediction, 312-813 
transmembrane domain predicuen, 811 
Predictome, 11 
Pressure cell, 489-490 
Pressure conversion table, 838 
Pressurization device, 499 
Printer, chemical 
details, 465 
equipment, photographs of, 464 
overview, 463, 465 
peptide mass fingerprinting 
analysis of E. colt proteome, 561—564 
analvsis of human plasma proteome, 561, 
564-565 
materials, 556 
multipie endoproteinase reactions, 566 
protocol, 557—580 
PRIN'18 database, 614 
Prochymosin, 98 
Prolom database, 810, 814, 818-819 
Product ion scan, 441, 443 
PROF, 811 
Profiling, protein, 691—692, 698—699 
ProFound, 473 
Proline 
properties, 843 
structure, 642 
Proline effect, 581 
PRONET. See Protein Interaction on the Web 
Propanol, in RP-HPLC, 224, 227, 248 
ProSal database, 808, 810, 812, 814 
ProSites database, 814 
Protamine sulfate, 120 
Protease inhibitors, [04, 150-151. Ser also specif- 
ic irfurbitars 
Proteases 
choosing, 357-358, (See also Fragmentation of 
proteins, enzymatic; specific proteases) 
control of, & 
examples, # 
list of commen, 354 
optimized proteolysis conditions, 356 
Schechter and Berger nomenclature for the 
description of protease subsites, 355 
selechvity, 452—354 
specificity, 353, 355 
WWW resources, 356 
Protein-affinity-interaction chromatography, 15 
Proteinase digestion buffer, 414 
Proteinase K, 354, 358 
Protein-binding buffer, 752, 754 
Protein complexes, 669—786. See aiso Protein- 
protein interactions 
alternate mRNA splicing and, 670—671 
cross-linking 
difficulty of technique, 683-084 
MS use with, 725, 728-733 
optimization of reaction, 726-728 
protein proximity and spatial organization 
information, 681—684 
site-specific protein-DNA photo-cross-linking 
advantages of on-bead cross-linking, 685, 
741 
case study, 756 


922 = INDEX 


Protein complexes {continued 
design and preparation of biotin-tabeled 
site-specitically derivatized DNA frag- 
ment, 743-7512 
identification of cross-linked proteins, 
750 
nuclease digestion, 735 
overview, 685 -687, 741-742 
preparation and UV irradiation of pro- 
tein- DNA complex, 752-754 
subsecond photo-cross-linking using water- 
soluble metal complexes, 734-740 
case study, 739—740 
materials, 734—735 
method, 735.737 
optimization and precautions, 738 
overview, 684-685 
topology analysis, 725—733 
examples of, 15 
identification of proteins in, 15 
isolation using electrophoresis, 680-66 | 
BN-PAGE, 680 -681, 717-724 
native agarose vel electrophoresis, 680, 713— 
716 
overview, 371—373 
temporal variation, 672 
topology analysis, 725-733 
WWW resources, 794 
Protein concentration determination 
BCA assay, 165, 846-856 
Bradford assay, 857 -859 
in DDE 134 
Protein Data Bank (PI), 812 
Protein-DNÀ complexes, structural organization 
of, 685-686 
Protein extracts. See Extracts, protein 
Protein fingerprinting, 344. See aiso Peptide 
mapping 
Protein interaction databases, 10-11, 823-824 
Protein Interaction on the Web (PRONET), 1! 
Protein kinase C, 600 
Protein kinases, 308—309, 698 
Protein Link Explorer (PLEX), 822 
ProteinLynx Global SERVER, 473, 475 
Protein Predicrion Meta Server, 813 
Protein profiling, 6, 691-692, 698-699 
ProteinProspector, 472, 474 
Protein separation strategies, overview, 12-14 
Protein solubilization, 147-149 
ampholytes or bullers, 148-149 
chaotropes, 147-148 
detergents, 148 
reductants, 148 
Protein state, attributes of 
biological interactions and, 9 
cellulat localization, 7 
conformation, 8 
covalent modifications, 7 
ligands, presence of, 7 
oligomeric state, 8 
proteolytic cleavage, 71 
splicing, 7 
protein-protein interactions. See also Protein 
complexes 
analysis 
affinity capture techniques, 675—679 
from cell lysates, 677, 708—712 
optimization of variables for, 678 
tandem affinity purification, 679 


using FLAG-epitope-tapged proteins, 677. 
7 707 
by binding constant determination, 69-691 
in vivo with FRET 
data analysis, 763-765 
experimental design, 759-760 
expusure settings calibration, 761-762 
image acquisition, 762—763 
materials 757 758 
overview, 686-090 
stram construction, 760—761 
with protein arrays, 691—700, 772—786 
yeast two-hybrid system, 673—674 
binding constants, 690 691 
cell-mapping preteomns, 3, 5 
importance of, 671 
integrated biology approach, 7 
monitoring with (FP chimeras, 766-77 I 
case study, 768 
fluores.ence gel-retardation assay, 767—768 
fluorescence polarization assay, 769-77] 
overview, 766 
physical versus biological, 9 
protein siates and, 7-8 
Protein transfer, from 2D gels. See Transfer of 
proteins from 2D gels 
Proteolytic cleavage 
denaturation of polypeptide prior to, 355 
phosphoprotein, 603 
protem state and, 7-8 
Proteolytic degradation, in extract preparation, 92 
Proteome, definition, 3 
Proteomic analysis, 5 
Proteomics 
capture molecule, 29 
cell-mapping, 3,5, 8, 12, 671, 675 
definition, 3, 5 
differential display (comparative), 19-21 
expression, 3, 5 
global profiling, 4, 5 
identification and analysis ul proteins, 12-19 
amino acid sequence determination, 16-19 
in complexes, 15 
protein separation strategies. 12-14 
integrated biology approach, 7-8 
microarrays, 26-28 
modificatian-specific, 598 
need for, 6-12 
posttranslational modifications, 21-26 
chemical, 24—26 
phosphorylation, 22—24 
protein-protein interaclians, 7-11 
target molecule, 29 
technology plattorms for, 12 
WWW resources, 37 
Protonated peptides, fragmentation mechanisms 
m the gas phase, 577-583 
Prowl software suite, 473—475 
PSO. See Post-source decay 
Pseudomolecular ions, 441 
PSI-BLAST. See Position-specific interated- BLAST 
PSlpred, 811 
PSORT, 829-830 
PIC. See Phenylthiecarbamyl 
PTH. See Phenylthiohydantoin 
Pulsed liquid-phase sequencer, 295, 297 
P-values, BLAST, 802 
PVDF. See Polyvinylidene difluoride membranes 
PYP-40, 402 404 


iyridvlethvlation, $4.4, 360, 398. $99 
Pyroglutamute aninopepüdase, 308, 3554 


QlAexpress, 676 
QM ORF, 195 
QTOFE2 Masslvis sotbware, 503 
Quadrupole ion trap, 446- 447, 569. 571 
Quadrupole tune ot Hight (TOP; M5. 
444 14^, 520, 622, 645 
Quantitving protein by BCA, 846. 856 
96 well microtiter plate procedure, 854-856 
calibration curve, construction of, 850- 851 
micrapracedure, 852 
standard procedure, $48 849 
Quantitative proteonmtics, 5 
Quen h solution, 752 
Quenching arrays, 695 


Radiolabeling, L67 
phosphorylation site mapping, 600, 602 
VP labeling, 
of ceils, 167 
peptides, solid-phase microsequencing of, 
309—310, 3249—34] 
phosphoprotein and phosphopeptide analy- 
sis, 22-24, 26 
protem arrays, 782. 783 
Rat liver protem extract for 21 gel electrophore- 
ss, Led ded 
REBASE database, 824 
Recombinant proteins 
extraction from bacterial cells, 92, 95-948, 117 
12: 
secretion ot, 92 
solubilization trom inclusion bodies, L21 123 
Heductants 
m protein microarrays, 693 
for protein solubilization, 144 
Reduction 
disulfide bond cleavage by, 348—352, 372-377 
gel spot, 399 
large-scale method, 372-374 
rucroscale method, 375-377 
phosphopratein, 657 
whole-gel, 360-361, 398 199 
Reduction buller, 373, 375 
Reterence tables 
amino acid groper ties, 843 
amino acid structure, 842 
amino acids not encoded in the genetic code, 
nal 
buffers ana pK values, 834 
mass changes due to pasttranslational moditi- 
cations, 840 
metric prefixes, 848 
molarities and specific gravities of concentrat- 
ed acids and ammonia, 840 
moles weight conversion lable, 838 
pressure conversion table, 838 
Refulding of recombinant proteins, 98, 121, 123 
Regular expression, 814 
RegulonDB database, 817, 823425 
Rehydrati buffer, 555 
Rehydration solution tor IPG strips, 172-174 
Relanve mobility. calculating, 75 
RELIBASE database, 424 
Reporter ion, 618 
Resins, chromatographic, 453 


INDEX » 923 


Resolution, mass spectrometry 
10% valley definiton, 431 
examples, 431—433 
full-width, half-maximum (FWHM defini- 
tion, 431 
Resolving gel buffer, 52 
Resolving gel mixture, CIAB-PAGE, 74 
Resolving gel recipes, SOS-PAGE, 75 
Restriction endanucleases, 745, 749 
Retention factor, 224 
Reversed-phase high-performance liquid chro- 
matography (RP-HPLC), 449 
analytical separation of crude peptide and 
protein mixtures, 261, 273 
capillary columns, 242-248 
construction, 246 -247 
equipment, 243- 245 
flow generation, 245—246 
packing, 247 -248 
reagents, 242 
UV detection, 246 
chromatographic support, 221-222 
column storage, 409 
connecting column to HPLC plumbing, 239 
dead volume determination, 409 
desalting of peptide and protein mixtures, 
263-268 
equipment for gradient elution, 410 
evaluating the system, 405 
examples of polypeptides and proteins puri- 
fied by, 225 
fittings, interchangeability of, 241 
G2 LabExpert control, 254 
hydrophobicity and, 220-221, 224, 228-230, 
251 
limitations, 219-220 
microbore, 230-235 
band broadening, 232 
principles, 231-232 
resolution, 232-233 
sample capacity, 232-233 
sample recovery, 235 
trace enrichment and buffer exchange, 235— 
235 
microsequencing, sample preparation for, 304— 
306 
mobile phase, 222-230 
ion-pairing agents, 22823) 
organx solvent, 222, 224, zM-227 
pH, 226-227 
optimization of gradient conditions, 274-380 
calculations, 277-279 
gradient shape, 279 
gradient time optimization, 279 
initial experiments, 274—275 
peak tracking and assignment, 278-279 
starting conditions, choice of, 275-277 
verification of results, 27¢ 
overview, 219-220 
packing capillary columns for, 242—248 
packing materials, rable ol, 223 
particle size, 221-223 
peptide mapping using, 366, 405-411 
of peptides, 405—411 
pore size, 221-223 
preparative separation of peptides, 262 
principle, 220-221 
purification of large polypeptides, 249 257 
case studies, 252—253, 255, 257 


protocol, 231-256 
of radioactive fraction fram amon exchange 
chromatography eluent, 645 647 
sample recovery, 235 
solid-phase peptide synthesis, purification of 
peptides from, 258-264 
solvent preparation, 407 
standard chromatographis conditions, 236— 
241, 248 
storage of columns, 241 
storage at samples, 346 
ira e enrichment, 233-234 
WWW resources, 224, 233, 286 
Reversed-phase microcolumns, preparation of, 
453, 482—484 
Reversed-phase Solvent A, 532 
RF10 medium, 702, 704 
Riboflavin 5 phosphate, 45 
Ribosome, 6 
Ribosome display, 672 
RIPA lysis buffer, 95, 107, 124 
RNA 
DDF and, 137 
as impurities in protein samples for 2D gel 
electrophoresis, 149—150 
isolation by DIDF, 127 128, 136-137 
RNA extraction buffer, 136 
RNA polymerase II transcription complexes, 6 
RNA polymerase transeription initiation com- 
plexes, E. coiz, 756 
RNase A, 150 
RE chromatography solvents, 514 
REC chromatography, of SCX chromatography 
fractions, 518-518 
RP HPLC. See Reversed-phase high-petfor- 
mance liquid chromatography 
RPMI 1640 media, 702, 704 
REMI-based medium, [67 
Ru(ll)(bpy?", 684 
Running butter, SDS-PAGE, 53 


"a labeling of cells, 167, 704 
SST GFP chimeras, 768 
Saccharomyces cerevisiae, 673 
FRET experiments, 689, 758 
MuDPIT analysis of proteome, 513 
protein identification by nano-LC-MS/ MS 
and database searching, 507 
SAGE. See Serial Analysis of Gene Expression 
SAGEmap database, 825 
Sample buffer, nondenaturing PACE, 85 
sapphire knife, 245, 247 
SBASE, 810 
Schiff bases, 300 
Science's Signal Transduction Knowledge 
Environment (STKE), 11, 824-825 
Scintillation fluid, 329-331 
SCOP, 813 
SCX. See Strong cation exchange chromatography 
4CX chromatography buffers, 515 
SDS 
10% stock solution, 54 
interference with sequencing, 301 
in Laemmli electrophoresis running buffer, 
327 
PAGE, pore size and, 46 
for protem sol ubilization, 148 
in sulfhydryl blocking solution, 547 


in transfer buffers, 158, 203, 206 
trypsin digestion and, 380, 382 
m znchmidazole staining, 394-395 
SDS electrophoresis buffer, 178, ,82, 187 
SDS equilibration burfer, 185-186 
SDS-loading bulter, 755 
SDS-PAGE, 4—19 
anomalous behavior in, 49 
concentration of acrylamide gel spots lor 
sequencing, 326-328 
denaturing gels, 46-49 
isolating the protein complex components by, 
723-7 
linuitations of, 49 
loading butter, 702, 705, 709, 711 
peptide mapping by, 564, 412-416 
protocol, 52—61 
equipment. 53-34 
linear gradient gels, 59 
pouring a slab gel, 54-57 
reagents, 32-53 
roaming discontinuous slab gel, 58 
sample preparanon, 57-58 
sample buffer, 53 
stacking gel, 52 
staining proteins m gels, Coomassie Blue, 358- 
359, 392-393 
Tricine, 20 
troubleshooting, o0 -61 
tube versus slab gels, 52 
21) gel electrophoresis, 144, [46 
IPG strip equilibration, 185-186 
overview, 152-154 
protocol, 187—.89 
recipes for single- percentage gels, 178 
vertical gels, 134 
vertical slab gels 
casting single homogeneous gel, 177-180 
electrophoresis conditions, 189 
simultaneous casting of multiple gradient 
gels, 181-184 
Seatch engines for protein identification using 
MS data, 506 
history, 467 
pepude fragment ion searches, 470, 474477 
peptide mass fingerprinting and, 468-470, 
472-473 
sequence databases, 467-468 
summary, 477 
Searching far functionally linked proteins, using 
gene tusons 8] 3-819 
Secondary structure prediction, 811 
Selective unmunoseparation procedures, cell 
purification by, 94 
senudry blotting ol 21) gels, 206-207 
Sensitization solution, silver staining, [96 
Sequelon-AA membranes, 310, 329-330 
Sequelon- DI [C membranes, 310, 3428 
Sequence analysis, 287—541. See also Edman degra- 
cation: Peptide mapping; Sequencer 
amino-terminal, 290-310 
blocking, 300, 307-308 
carboxy terminal combined with, 313, 317 
carboxy-terminal compared, 3 iG311 
concentration of gel-separated protein for, 307 
overview, 290-292 
variations, 310 
carboxy-terminal 
amino-terminal combined with, 313, 317 
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Sequence analysis (continued) 
amino-terminal compared, 310-311 
automated, 310-311 
examples of chromatograms, 312, 314—317 
overview, 28], 310 
performance characteristics, 332 
protocol, 332-335 
sensitivity and performance, improvement 
of, 311-313 
concentrating acrylamide gel spots for, 326-328 
core facilities for, 289 
electroblotting for, 320-323 
experimental approaches, flow chart of, 346 
history of, 288 
microsequencing, sample handling considera- 
tions, 346 
of phosphorylation sites, 308-309 
rate-limiting steps 
contaminants, chemical, 300—301 
sample attributes for successful sequencing, 
301 
sample purity, 298, 300 
sample preparation 
blocking, 300, 307—508 
by PAGE, 306-307 
by RP-HPLC, 304-306 
WWW resources, 341 
yield 
inital, 302 
repetitive, 202—303 
Sequence database searching with Gapped- 
BLAST and PSI-BLAST, 798-807 
Sequencer 
biphasic column, 295, 297-299, 338-319 
gas-phase, 294-296 
initial yield, 302 
performance, improvement in, 302 
pulsed-liquid phase, 295, 297 
repetitive yield, 302—303 
spinning-cup, 291-295, 295 
SEQUEST, 474, 495, 506 
Serial Analysis of Gene Expression (SAGE) 
libraries, 825 
Serine 
phosphoserine, 598—599, 605—606, 614 
properties, 843 
structure, 842 
Serratia marcescens, 168 
S-ethylcysteinyt peptides, 879 
SGD database, 829 
SIL. See Surface-induced dissociation 
Signaling pathway, 6, 308—309 
Signaling pathway database {SPAD}, 11 
SignalP, 823-830 
Signal-to-noise ratio, 493494 
Signal Transduction Knowledge Environment, 
LL, 824-825 
Silica resin, derivatized, 488 
Silica-based reversed-phase materials, 221, 223, 226 
Silver staining 
2D gels, 155-156 
ammoniacal, 155, 192-195, 396 
interference, minimizing, 194, 197 
MS-compatible, 155-156, 196-198 
for protein microanalysis, 359 
of proteins in gels, 359 
sensitivity, 64, 192 
silver nitrate, 396—397 
steps, 192 


techniques, 40—41 
Silver staining solution, 193, 195, 197 
Single-ion monitoring (SIM), 619 
SMCC (succinimydl-4- | N-maleimidomethyl, 
cyclohexane-1 -carboxylate), cross-link- 
mg agent, 725 
Smith-Waterman, 798 
SnapDragon, 810 
SOCS (suppressors of cytokine signaling) family 
of proteins, 677, 679 
Sodium 
acetate staining, 358 
bicarbonate buffer, in capping reagent, 546 
borate, in electroblotting buffer, 321 
carbonate, in developing solution, 196, 396 
chloride (NaCl) 
NaCl/acetonitrile in RP-HPLC, 367 
in RP-HPLC, 227, 410 
UV transparency, 227 
hexylsulfonate, 369 
iodide (Nal), 752 
p-toluenesulfinate, 81 
phosphate, in RP-HPLC, 367, 410 
tetrathionate (Na,5,0,), 348 
thioglycolate, to avoid blocking in gels, 308 
thiosulfate, 539 
Sodium dodecy! sulfate. See SDS 
Solid-phase extraction capillary electrophoresis 
(SPE-CE), 612-613 
Solid-phase peptide synthesis, 258-264 
Solid-phase sequencing of #P-labeled phospho- 
peptides, 309-310, 329—331 
Solubilization buffer, 122, 170, 718 
Sonar MSMS, 474-475 
Sonication, 96, 111, 169 
Sorbitol, 98 
SPAD. See Signaling pathway database 
Spatia) expression patterns, 828 
Spatial organization of proteins, cross-linking 
use to investigate, 682 
SPE-CE. See Solid-phase extraction capillary 
electrophoresis 
Specific gravity, of concentrated acids and 
ammonia, 840 
Spectra Focus software, 246 
Spermine, 149 
Spinning-cup sequencer, 291-293, 295 
Splicing 
alternate proteins and, 7 
of mRNA, 670—671 
Split tee junctions, 245 
Spot detection and quantification, 160-16] 
SpotArray Enterprise, 694 
Spots 
matching, 161 
normalizing, tél 
SSpro, 811 
Stable isotope dilution, 20-21 
Stacking gel 
CTAB-PAGE, 80 
resolution of proteins and, 43 
SDS-PAGE, 55, 57 
Stacking gel buffer, 52 
Staining, 40-42, 47-48, 63-76. See also specific 
stains 
Coomassie Blue, 63, 324-325 
conventional, 65-66 
rapid, 67-68 
fluorescent probes, 42, 63-64 


SYPRO Orange, 73- 74 
SYPRO Ruby, 71--72 
ot gel-separated proteins for proteolytic diges- 
tion, 358—359, 392-397 
InstaStain Blue Gel Paper stain, 69-70 
negative, 358-359, 394-395 
overview, 63-04 
proteins in gels 
Coomassie Blue, 358-359, 392-393 
negative staining, 558-359, 396—397 
Overview, 358—359 
silver staining, 359, 396—397 
PVDF membranes, 324-325 
silver stain, 64, 75—76 
for 21 gel electrophoresis 
Coomassie Blue, 154—155, 190—191, 208, 210 
fluorescent, 155-156, 199-200 
membranes, 208—2 | ? 
colloidal gold, 208, 212 
Coomassie Blue, 208, 210 
india ink, 208, 21i 
Ponceau 5, 208-209 
phosphopratein, 155-157, 201-202 
silver stain, 155--156, 192-198 
Stanford Mictoarray Database, 825, 827 
STKE database, 11, 824-325 
STRING Web server, 820 
Strang cation exchange (3CX) chromatography, 
450 
column preparation, 516-517 
shallow gradient for chromatography, 516 
of yeast peptides, 517-518 
Structural proteomics, 13 
Structure Prediction Meta Server, 812 
ST-Sepharose column, 541-543 
Subcellular fractionation 
differential detergent fractionation and, 101 
reasons for use, 98—99 
steps in, 100-101 
21) gel electrophoresis and, 99-100 
Substitution matrix, 798 
Subtilisin, 353-354, 356, 358 
Succinic anhydride, modification of lysyl side 
chains using, 383 
Succinylation, of lysine, 385 
Sucrose, in SDS-PAGE gels, 182 
Sulfhydryl blocking solution, 547, 548 
Sulfite, 150 
Sulfitotysis, 347 
Sulfosalicylic acid solution, 20% 

Suppressors of cytokine signaling |SOCS) family 
of proteins, 677, 679, 712 
Surface-induced disseciation (S113), 442, 568 

Swiss-Madel, 812 
SwissProt database, 468, 561,802,813, 818 
Symmetrie best hits method, 806-807 
SYPRO 
Orange protein gel stain, 42 
Orange stain, 42, 73-74 
Red stain, 42, 73 
Rose Plus protein blot stain, 42 
Rose protein blot stain, 42 
Ruby fluorescent staining, 71-72 
Ruby stain, 42, 157, 199—200 
Tangerine protein gel stain, 42 


T4 DNA lipase, 745, 749 
T4 DNA polymerase, 745, 749 
T4 polynucleotide Kinase, 745, 749 


T7 DNA polymerase, 745, 749 
TAME, See Vosyl-i-arginine methyl ester 
‘Tandem affinity purification (TAP), 679 
‘Tandem MS {MS/MS} 
automated 2D LC-MS/MS for large-scale pro- 
tein analysis, 869-876 
coupling to 1D-LC, 449 
coupling to 2D-LC, 450-451 
data-dependent, 611-612 
ESI and, 18-19 
IMAC with, 608, 610-612 
intermediate reaction events, 441-442 
mass analyzer configurations 
FT-ICR, 448 
quadrupole ion trap, 446-447 
OTOF, 444—445 
tandem-in-space, 443-446 
tandem-in-time, 446—445 
TOF, 444 
TOF/TOF, 446 
triple quadrupole, 443—444 
nano-LC coupled to, 496--507 
overview, 441—442 
phosphopeptide sequencing, 620—621 
emerging techniques, 622—623 
search engines, 470, 474—477 
top-down protein identification and charac- 
terization 
FT-1CR MS, 568-569 
future developments, 572—573 
ion/ion reactions, 569-571, 573 
overview, 567—568 
quadrupole ion-trap MS, 565-571 
Tank transfer of 2D gels, 157, 203-205 
TBLASTN, 798, 804 
TBLASTX, 798 
TBP, See Tributylphosphine 
TCA, for removal of impurities, 149 
TCEP, See Tris (2-carboxyethyl) phosphine 
TEAR, See Triethylammonium phosphate 
Techniques, 845-896 
2D-LC-MS for large-scale protein analysis. 
869—876 
Coomassie Blue dye-binding assay (Bradford 
Assay) for protein concentration deter- 
mination, 857—859 
isothermal titration calorimetry, 882—892 
advantages of, 883 
amount of sample required, B88 
analysis of results, 888—890 
infarmation available, 884—888 
blank experiments, 887 
coupled proton transfer process, 887—888 
enthalpy of binding, 886—887 
heat capacity change of binding, 887 
simultaneous determination of association 
constant and enthalpy of binding, 
884-886 
limitations of, 883—884 
measuring intermolecular interactions tech- 
nique, 890 
materials, 890 
method, 891-892 
overview, 882-883 
principle of technique, 884 
sample preparation, 888 
phosphoamino acids 
identification by thin-layer electrophoresis, 
880-881 


modification to facilitate identification, 
877.878 
quantifying protein by BCA, 846-856 
36-well microtiter plate procedure, 854—856 
calibration curve, construction of, 850-851 
microprocedure, 852 
standard procedure, 848—849 
separation of peptides and proteins by 
HPCEC, 862—868 
urea purification, 860-861 
TEMED (tetramethylethylenediamine), 44-45, 178 
CTAB-PAGE, 79 
SDS-PAGE, 53, 55 
Tertiary structure prediction, 812 
Tetrabutylammonium chloride, in RP-HPLC, 229 
‘Tetrabutylammonium thiocyanate, 333, 335 
Tetraethylthiuram disulfide/acetonitrile, 744, 746 
Tetrahydrofuran (THE), 224, 227, 511-312 
Tetramethylarnmonium chloride, in RP-HPLC, 
229 
TEA. See Trifluoroacetic acid 
TGE-B. See Transforming growth factor-B 
Thermolysin, 353-354, 358 
Thermus thermophilus ribosomal L36 protein, 
263-264 
THE. See Tetrahydrofuran 
Thin-layer chromatography (TLC), 292, 294, 309 
Thin-layer electrophoresis, identification of 
phosphoamino acids by, 880-881 
Thioglycollic acid, for reducing proteins, 348 
Thiohydantoin, 290—291, 310, 335 
Thiols, estimation of free, 378-379 
Thioredoxin, 97 
Thiosulfate, in silver staining, 192 
Thiourea, 147, 150 
Threonine 
phosphothreonine, 599-600 
properties, 843 
structure, 842 
Thrombin, 353 
Thrombin receptor antagonist peptide analogs 
(TRAP analogs), 868 
TIC. See Total ion chromatogram 
‘Time of flight (TOF) 
ESI, 445 
MALDI, 444-446 
Tass analyzer, 444 
QTOR, 444-445, 620, 622, 635 
TOF/TOR 446 
Titin kinase, 8 
'TMHMM, 811 
TMPyP (5,10,15,20-tetrakis| | -methyl-4-pyri- 
dinio]porphyrin tetra[p-tokuene sul- 
fonate]), 735-737 
TNB (2-nitro-5-thiobenzaate), 365 
TNE, See Tumor necrosis factor 
TOR See Time of flight 
Toluene, in 2D paper chromatography, 289 
Top-down protein characterization 
FT-ICR MS, 568—569 
future developments, 572-573 
novel chemistries, 573 
novel instrumentation, 572-573 
understanding the fragmentation behavior 
of whole-protein ions, 572 
ion/ion reactions, 569-571, 573 
overview, 557-568 
quadrupole ion-trap MS, 569-371 
Topology prediction, 811 
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TopPRED, 811 
Tosyt-! -arginine methyl ester (TAME), 546 
Total cellular protein, sample preparation, 58 
Total ion chromatograrn, 876 
Total systems biology, 7-8 
Towbin transfer bulfer, 203, 206 
Trace enrichment, 13, 233-234, 305 
Transblotting buffer, 546 
Transcriptomics, 1,6 
TRANSFAC database, 823—824 
‘Transfection 
using polyethylenimine, 704 
with vectors encoding epitope-tagged pro- 
teins, 704 
Transfer buffer, for BN-PAGE, 718 
Transfer of proteins from 2D gels 
equipment, 157 
membranes, 158 
semidry blotting, 157, 206—207 
tank transfer, 157, 203—205 
tips for, 158 
Transferrin receptor, murine, 14 
Transforming growth factor- (TGF-B), 14 
"Iransilluminator, 200 
‘Transmembrane domain prediction, 811 
Transpath, 11 
TRAP. See Thrombin receptor antagonist pep- 
tide analogs 
Tree View, 805, 826 
Trialkylphosphines, for reductive cleavage of 
disulfides, 349 
Tributylphosphine (TBP), 526-527 
for protein solubilization, £48 
for reducing proteins, 348—350 
structure, 549 
‘Tricine, 46 
Tricine running buffer, 80 
Tricine-5DS- PAGE 
protocol, 83-84 
for separation of small polypeptides, 50 
‘Triethoxysilylbutanal, for aldehyde slide fabrica- 
tion, 772-773 
Triethylamine, 229, 294 
Triethylammonium acetate, 744, 747 
Triethylammeonium phosphate (TEAP), 229-230 
Trifluoroacetic acid {TFA} 
for deblocking, 308 
in microsequencing with a gas-phase 
sequencer, 294 
neutralization by contaminants, 300 
in pulsed-liquid -phase sequencer protocol, 
295, 297 
in RP-HPLC, 227-231, 237~238, 248, 250-256, 
259, 270, 367, 407—410 
in sequence analvsis protocols, 318—319 
Triple-quadrupole mass spectrometer, 619 
overview, 445 
scan mode feature, 443—444 
Ttis, interference with sequencing, 301 
Tris (2-carboxyethyl) phosphine (TCEP) 
in protein microartays, 694 
for reducing proteins, 348, 350 
structure, 349 
‘Tris-acetate electrophoresis buffer, 51 
Tris-bipyridyl ruthenmum (11) dication 
(Ruibpy];*), 734-735 
Teis-borate electrophoresis buffer, 51 
Tris-citrate electrophoresis buffer, 51 
Tris-glycine electrophoresis buffer, 51 
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Tris-Cl 
in electroblotting buffer, 321 
in rehydration buffer, 535 
in sulfhydryl blocking solution, 547 
Triton X-100 
for bacterial cell lysis, 117-118 
in BN-PAGE of protein complexes, 681 
in denaturing gel clectrophoresis, 49 
in DDE, 126—130, 132-133 
interference with sequencing, 301 
PAGE pore size and, 46 
for protein solubilization, 148 
in rehydration buffer, 535 
tRNA, DDE and, 137 
Troubleshoating 
electroblotting, 323 
SDS-PAGE, 60-61 
YAPI, 758,761 
TRRD database, 824, 825 
Truncations, 7-8 
Trypan blue, 132-133 
‘Trypsin, 353-354, 400 
anhydrotrypsin, 540-343 
electrophoretic mobilities of, 40 
fragmentation of protein using, 380-382 
immobilized, 462, 549, 551 
optimized proteolysis conditions, 356 
peptide mass fingerprinting, 19, 556- 566 
phosphorylated amino acid effect on cleavage, 
605. 
trypsin, 382 
Tryptophan 
cleavage at, 361, 364, 390-391 
properties, 843 
structure, 842 
Tubing, RP-HPLC, 244—247 
Tubulins, precipitation using magnesium, 137 
Tomor necrosis factor {TNF}, 14 
Tween-40 
in DDE, 126-127, 129-130, 132-133 
interference with sequencing, 301 
Twitchen kinase, 8 
Two-dimensional equilibration solution, 547 
Two-dimensional (2D) gel electrophoresis, 3, 
143-218 
analysis, 160-162 
annotating gels, 162 
editing matches, 161 
1mage visualization and manipulation, 160 
molecular weight and pl calibration, 161 
normalization, 162 
software systems, 160 
spot detection and quantification, 160-16! 
spot matching, [61 
synthetic and average gels, 161—162 
blocked protein in, 307 
concentration of acrylamide gel spots for 
sequencing, 326—328 
DDF and, 134-135 
electroblotting and, 557-559 
electrophoresis conditions, 189 
IEF, 151-152, 172-176 
in IEF-dedicated electrophoresis unit, 173, 
)75-176 
in multipurpose flained electrophoresis unit, 
172-175 


rehydration ol IPG strips, 173-174 
image acquisition, 159. 160 
OCD camera, 159 
densitometer, laser-based, 159—160 
document scanner, 159 
hmiuations, 13, 99-100 
molecular scanner and, 463, 544-555 
overview, 144-145 
sample preparation 146-15] 
E. colt lysates, 168- 169 
eukaryotic tysates. 165-167 
interfering substances, 146, 149-150 
lipids, 150 
nucleic acids, 149-150 
phenolic compounds, 150 
small jonic molecules, 149 
overview, 146-147 
protection against proteolysis, 150-151 
AEBSE, 150 
EDTA, 15] 
peptide protease inhibitors, 151 
PMSF, 150 
protein solubilization, 147-149 
ampholytes or buffers, 148—149 
chaotropes, 147 148 
detergents, 148 
reductants, 148 
rat liver protein, 163-164 
SDS-PAGE 
(PG strip equilibration, 185-186 
overview, 152-154 
protocol, 187-189 
recipes for single-percentage gels, 178 
vertical slab gels 
casting single homogeneous gel, 177 -180 
electrophoresis conditions, 189 
simultaneous casting of multiple gradient 
gels, 181—184 
staining 
Coomassie Blue, 154—155, 190-191, 208, 210 
fluorescent, 155-156, 199 -100 
phosphoprotein, 155-157, 201-202 
silver stain, 155-156, 192-198 
staining metnbranes, 208-212 
colloidal gold, 208, 212 
Coomassie Blue, 208, 210 
india ink, 208, 21! 
Ponceau 5, 208-209 
trace enrichment, 13 
transfer of proteins from 
equipment, 157 
membranes, 158 
semidry blotting, 157, 206-207 
tank transfer, 157, 203-205 
tips for, 158 
WWW resources, 213-215 
Two-dimensional J0) gel rehydration solution, 
547 
‘Two-dimensional (2DH- LC-MS 
for large-scale protein analysis, 869-876 
installation of systern, 870—871 
materials, 871-872 
method, 872-873 
overview, 869-870 
schematic diagram of system. 871 
lwo-dimensional paper chromatography, 289 


Tyrosine 
phosphotyrosine, 594, 600 
properties and structure, 842—843 


Unit ene database, 825 
URAA, 758,76] 
Urea, 752 
ammonium cyanate conzaminatión of, $52 
for denaturation pror to reduction/alkylation, 
351 
for inclusion body solubilization, 98, 121-122 
polyacrylamide pore size and, 45 
for protein solubilization, 147. 148 
purification ul, 880-861 
quality of, 351-352 
recrystallizatian of, 861 
in rehydration buffer, 535 
in SDS equilibratien buffer, 185 
m sulfhydryl block ing solution, 347 
in trypsin digestion protocol, 382 
UV transparency, importance in RP-HPLC, 224, 
226 


V8 protease, Staphylococcus aureus, 358, 366, 414 
Valine 

properties, 843 

structure, 842 
van't Hoff enthalpy, 843 
Vinylpvridine, 351, 472, 374, 379, 398-399 
Virtual maps, 533, 546 


Waring Blendor, 105 
Web servers, for identifying protein features, 809 
Western blotting, 157, 602 
Whole protein tons, fragmentation of, 572 
WIT database, 821, 825 
WEZ Web Site, 820 
WormPD database, 829 
WWW. resources 
bioinformatics, 809, 824—825, 834-835 
cellular and subcellular extract preparation, 142 
HPLC information, 224, 286 
mas spectrametiy, 595 
PAGE, 89 
peptide mapping, 423 
phosphorylation site mapping, 667 
proteases, 356 
protein interactrons, 794 
RP-HPLC, 235 


Xcalibur software, 503-3504 


Yeast 
preparation of protein extracts from, 97, 124— 
125 
two-hybrid system, 673-674 
Yellow fluorescent protein (YFP), 687, 757-765 
YPO database, 829 


Z3 software system, 160 

Zinc chloride staining, 358 

Zanc-imidazole staining, 64, 358, 394—395 

Zinc sulfate, 394-- 395 

Zip Tips, [C ju, 460—461 

Zame electrophoresis, 41, 45. See also Polyacryl- 
amide gel electrophoresis 

Zymogens, 8 
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